Categories


Contact

Search

Links


Archive


BioMed Central Blog

Tuesday Jul 31, 2007

BioText - a search engine for open access figures

At the ISMB conference we met Anna Divoli, a postdoc at the University of California, Berkeley, who showed us the BioText Search Engine, which she was presenting as a poster, and has recently published.


BioText Search Engine logo


I came across it briefly earlier this month thanks to the blog of medical librarian David Rothman, who described it as "A supercool way to search PubMed Central", which is a pretty good description!

It is part of the text mining BioText project, and goes beyond the abstract searching in MEDLINE seen previously to extend searching to the figure legends of Open Access journals in PubMed Central.

As the homepage of PubMed Central notes, "All the articles in PMC are free (sometimes on a delayed basis). Some journals go beyond free, to Open Access". Because Open Access explicitly allows the reuse of the content of the articles in these journals (which include all 170+ BioMed Central journals) this has allowed the BioText people to create a search engine that allows keyword searching of abstracts, figure legends, titles and authors, returning results sorted by date and relevance, and in two formats: abstracts with figure thumbnails and legends, or figure legends with thumbnails.

   

For example, by running a 'Captions' search for "Drosophila eye", I quickly found these three images from BMC Developmental Biology, Journal of Biology and BMC Cell Biology:

Verdier et al., BMC Developmental Biology 2006    Junger et al., Journal of Biology 2003    Minakhina et al., BMC Cell Biology 2005

Anna hinted at upcoming functions such as returning snippets that match the search terms from the full text of the article (much as Google Scholar does). We look forward to these further developments, and we'd like to thank Anna, Marti Hearst and the others on the BioText team for developing such a useful and user friendly tool. This is a great example of how Open Access allows others to make further use of published work, in ways that the authors or publishers had not anticipated.

 

Comments:

Post a Comment:
  • HTML Syntax: Allowed