ChemxSeer: search engine for chemical formulae Wednesday, 1 August 2007 11:17 am
Posted by Dongmei in Internet Resources, chemistry, search engines.1 comment so far
I blogged about CiteSeer in the past (see CiteSeer, techXtra and computer science literature), a targeted search engine for computer science, info tech, etc. Now, Dr. Giles (one of the developers of CiteSeer) and his students and colleagues at Penn State have developed another targeted search engine for chemistry.
So what is it?
From the news release:
ChemxSeer, the first publicly available search engine designed specifically for chemical formulae, can sort out when “He” refers to helium and not a person more than nine times out of 10, according to the Penn State College of Information Sciences and Technology (IST) researchers who created the tool.
With the new engine, scientists searching for research on CH4 or methane no longer have to wade through search results about Channel 4 or Chapter 4 as ChemxSeer will only return documents with references to the chemical formula. “Results from our search engine are much more relevant than results returned by popular search engines,” Giles said. “It is one of several cyber tools under development in our lab which will enable better access to and sharing of information and data among scientists and scholars.”
One of many interesting features that make-up ChemxSeer is TableSeer.
From the web site:
This tool automatically identifies tables in digital documents and extracts the contents in the cells of the tables. The contents are stored in a queryable table in a database. TableSeer extracts table metadata, and uses a novel ranking function to search for tables relevant to user queries.
(via ResourceShelf)
CiteSeer, techXtra and computer science literature Saturday, 27 May 2006 4:03 pm
Posted by Dongmei in computer science, search engines.2 comments
For those doing computer science (and applied mathematics) research, I'm sure that you use CiteSeer a lot. (CiteSeer is a public digital library and search engine in computer and information science (and related areas) that's hosted at Penn State, currently includes over 700,000 documents).
TechXtra (used to be called EEVLXtra), an initiative of Heriot Watt University cross-search 25 databases in engineering, math, and computing.
In addition to CiteSeer, TechXtra also searches arXiv ePrints, ePrints UK, NASA Technical Reports, CISTI (Canada Institute for Scientific and Technical Information), EEVL Ejournal Search Engine, Euclid Mathematics and Statistics Journals, and others.
So, when CiteSeer is busy (happens a lot), you can use techXtra as an alternative, however, I do find some discrepancy (tried "vector space model" as an example). And you can't do citation search in techXtra which is a nice feature of CiteSeer.



