{"id":228,"date":"2009-02-18T23:04:30","date_gmt":"2009-02-18T23:04:30","guid":{"rendered":"http:\/\/occamstypewriter.org\/trading-knowledge\/2009\/02\/18\/uk_pubmedcentral\/"},"modified":"2009-02-18T23:04:30","modified_gmt":"2009-02-18T23:04:30","slug":"uk_pubmedcentral","status":"publish","type":"post","link":"https:\/\/occamstypewriter.org\/trading-knowledge\/2009\/02\/18\/uk_pubmedcentral\/","title":{"rendered":"UK PubMedCentral"},"content":{"rendered":"<p>I went to another meeting today about <a href=\"http:\/\/ukpmc.ac.uk\">UK PubMedCentral<\/a> .  For the first time I began to feel a bit excited about the resource this project is building.<br \/>\nWhenever I hear the phrase &#8220;one-stop shop&#8221; I groan inwardly.  Since bibliographic databases and journals first appeared on the Internet I have heard publishers and database developers talk about their intention to build a &#8220;one-stop shop&#8221;.  To me it betrays an arrogance beyond belief and an ignorance of the nature of information use.  Of course it&#8217;s an impossible aim.  I&#8217;m not sure whether UKPMC ever did use that phrase, but I felt that the early information about the project did have a hint of such an ambition, so I thought to myself &#8220;here we go again&#8221;.<br \/>\nHappily the meeting today eschewed any such notion and I came away feeling that the intention of the UKPMC project participants is to build a truly excellent, useful and usable resource for health science researchers.  I also felt confident that the participants have it within them to do the job.<br \/>\nSophia Ananiadou from <a href=\"http:\/\/www.nactem.ac.uk\">NaCTeM<\/a> explained the work her group has done using text mining techniques on Medline abstracts.  This is the third time I&#8217;ve heard her talk about this, and it gets more interesting each time.  Her aim is to enrich the literature by automatically creating semantic metadata, and thereby to make &#8220;undiscovered science&#8221; accessible. The <a href=\"http:\/\/www-tsujii.is.s.u-tokyo.ac.jp\/medie\/\">MEDIE<\/a> system is the most vivid example she showed, allowing you to construct a query in the form &#8220;subject &#8211; verb &#8211; object&#8221;.  For instance, you can ask &#8220;what does p53 activate&#8221; by searching for <em>subject=p53<\/em>, <em>verb=activate<\/em>.  Or you can ask &#8220;what causes colon cancer&#8221; by searching for <em>verb=cause, object=colon cancer<\/em>.  I tried <em>verb=read, object=book<\/em> but I&#8217;m not sure what question that was answering.  Currently this MEDIE system is just searching abstracts, but even so it does a pretty good job.   It gives a hint of the power of text-mining techniques; I look forward to them being applied to the full-text corpus that is growing in PubMedCentral.<br \/>\nI also enjoyed seeing Peter Stoehr&#8217;s demonstration of <a href=\"http:\/\/www.ebi.ac.uk\/citexplore\/\">CiteXplore<\/a> , that he has developed at the <a href=\"http:\/\/www.ebi.ac.uk\">EBI<\/a> .  I&#8217;ve heard of it before and looked at it briefly but never properly considered it as a serious replacement for PubMed.  Now, partly because of the additional content in it and partly because it is going to be at the heart of the UKPMC search service, I can see that it deserves more attention.<br \/>\nOne advantage it has over PubMed is the coverage &#8211; CiteXplore indexes about half a million extra references covering plant and animal science (from the Agricola database); plus a large collection of biological patents and abstracts of Chinese biological journals.  Other resources continue to be added (e.g. <a href=\"http:\/\/www.nice.org.uk\/\">NICE<\/a> guidelines). The UKPMC project is actively considering other resources to add to enhance the search service.<br \/>\nI was surprised to see that CiteXplore also has citation data. When you display a record it shows the standard bibliographic fields and abstract but it also shows where that article has been cited.  And that&#8217;s not all: instead of just showing the citing reference it also shows the sentence in which the original article was cited, thus making it easier to interpret the significance of each citation. It would be interesting to compare the number of citations that CiteXplore lists for a given article with the number listed in Scopus and Web of Science, but I&#8217;ve not done this.<br \/>\nFinally, CiteXplore has some features that draw on text-mining tools.  When you display results you can ask it to highlight proteins in the results.  It will then highlight any occurrences of protein names and turn them into links to UniProt.  You can do the same for genes or protein-interactions.<br \/>\nPutting all these together and extrapolating the power of the text-mining techniques that Sophia showed, I left the meeting feeling that before long we are going to have a rather special search tool ready to use.<br \/>\nOne caveat &#8211; all this does presuppose that UKPMC is successful in its aim to gather in the full-text of published research articles.  Open Access mandates from the research funders (MRC, CRUK, Wellcome, DoH etc), who are also funding UKPMC, will hopefully help to achieve a high rate of deposition, but it requires the cooperation of biomedical researchers, who have thus far not proved to be very enthusiastic about Open Access.  The promise of a better literature search tool may help to persuade them it is worth it.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I went to another meeting today about UK PubMedCentral . For the first time I began to feel a bit excited about the resource this project is building. Whenever I hear the phrase &#8220;one-stop shop&#8221; I groan inwardly. Since bibliographic &hellip; <a href=\"https:\/\/occamstypewriter.org\/trading-knowledge\/2009\/02\/18\/uk_pubmedcentral\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":17,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-228","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts\/228","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/users\/17"}],"replies":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/comments?post=228"}],"version-history":[{"count":0,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts\/228\/revisions"}],"wp:attachment":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/media?parent=228"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/categories?post=228"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/tags?post=228"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}