{"id":277,"date":"2009-06-22T15:34:57","date_gmt":"2009-06-22T15:34:57","guid":{"rendered":"http:\/\/occamstypewriter.org\/rpg\/2009\/06\/22\/on_article_level_metrics_and_other_animals\/"},"modified":"2009-06-22T15:34:57","modified_gmt":"2009-06-22T15:34:57","slug":"on_article_level_metrics_and_other_animals","status":"publish","type":"post","link":"https:\/\/occamstypewriter.org\/rpg\/2009\/06\/22\/on_article_level_metrics_and_other_animals\/","title":{"rendered":"On article-level metrics and other animals"},"content":{"rendered":"<p>What I wanted to talk about yesterday, before I got <a href=\"http:\/\/network.nature.com\/people\/rpg\/blog\/2009\/06\/21\/on-fathers-day\">distracted<\/a>, was a mix of things that came out of last Monday&#8217;s ALPSP meeting, a paper in PLoS One and this whole assessment schtick that we&#8217;ve got going on.<\/p>\n<p>\nWe know, don&#8217;t we, that the <a href=\"http:\/\/network.nature.com\/hubs\/nyc\/blog\/2009\/04\/12\/the-impact-of-the-impact-factor\">Impact Factor<\/a> <em>as a measure of an individual scientist&#8217;s productivity<\/em> is deeply <a href=\"http:\/\/network.nature.com\/people\/sunon77\/blog\/2009\/05\/04\/a-better-index-than-impact-factor-to-decide-the-most-influential-journals\">flawed<\/a>. Some might go as far as to say it&#8217;s flawed fundamentally and should be ditched forthwith: after all it is prone to gaming, unduly skewed by review articles, does not demonstrate a link between the &#8216;quality&#8217; of a journal and any given article therein, can <em>reduce<\/em> the measured effectiveness of a single article (if a seminal paper is cited in a review then the <em>review<\/em> rather than the original paper can be cited more frequently), has no meaning in fields (humanities, nursing\u2026) where citations are not routinely used as a measure of impact, and is very, very slow. (Seriously. We&#8217;re looking at a two year gap at least between the original work being done and it being cited in any meaningful manner\u2014there has to be the first paper published, experiments thought about and performed, results analysed, papers written, reviewed and finally another publishing round before you can measure it.) It is at least one step removed from the output it might be taken as measuring, relies on everyone else playing fair, and, just like a quantum physical experiment, perturbs in the act of measurement the very thing it is supposed to measure.<\/p>\n<p>\nThere have been attempts to massage the Impact Factor, and new formulations cooked up, such as the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Hirsch_number\">Hirsch Index<\/a> (and its variants) or the <a href=\"http:\/\/www.eigenfactor.org\/whyeigenfactor.htm\">Eigenfactor<\/a>, but essentially these are all measuring not a researcher&#8217;s activity, nor even how good that activity has been, but rather how someone else has managed to exploit it. Let&#8217;s not even think about research activities that do not result in a classical &#8216;citable unit&#8217;: database annotations, talks, training of students, <a href=\"http:\/\/blogs.usyd.edu.au\/labrats\/2008\/10\/untransferable_skills.html\">fixing kit<\/a>, etc., etc. And when you start looking at this in terms of stakeholders, for example the Research <del>Excrement<\/del> Excellence Framework (&amp;c.), you get into all sorts of difficulties because people want to be able to measure you in a pretty automated way in order to finish it before the heat death of the universe. <\/p>\n<p>\n<code>Clearly there is no substitute for actually reading an article to determine its importance<\/code><\/p>\n<p>\n\u2026 which is probably why certain people last Monday were very careful to say &#8220;indicators&#8221; rather than &#8220;metrics&#8221;, and brings me onto <a href=\"http:\/\/everyone.plos.org\/2009\/05\/27\/article-level-metrics-at-plos\/\">article-level metrics<\/a> and a paper on the impact of Wellcome-funded research.<\/p>\n<p>\nPLoS ONE are trying to look at the impact (however you define <em>that<\/em>) in terms of individual articles. According to <a href=\"http:\/\/conferences.aepic.it\/index.php\/elpub\/elpub2009\/paper\/view\/114\/51\">Peter Binfield<\/a>, for each PLoS ONE article the aim <\/p>\n<p>\n<code>is to provide information relating to online usage, citation activity, blog and media coverage, commenting activity, social bookmarking, &#034;star ratings&#034; and &#034;best of&#034; picks as selected by academic experts, as well as other measures yet to be determined<\/code><\/p>\n<p>\nIn other words, you&#8217;ll be measured by what you write, not by who writes about what you wrote. And even that&#8217;s a bit tricky, because really, there is no substitute for &#8220;expert&#8221; review, as was elegantly demonstrated in PLoS One <a href=\"http:\/\/dx.doi.org\/10.1371\/journal.pone.0005910\">last week<\/a>.<\/p>\n<p>\nLiz Allen and colleagues at the Wellcome Trust analysed the output of a whole bunch of Wellcome-funded researchers, whether project or programme grant or fellowship holders. They appointed a panel, or college, of &#8216;experts&#8217; to review papers published in 2005 and essentially got this college to answer the question &#8220;how good is this stuff, I mean, really?&#8221;\u2014for nearly 700 papers. (Okay, they didn&#8217;t assess &#8216;non-citable units&#8217;, but it&#8217;s a start.)<\/p>\n<p>\nThe results are interesting. For a start, it turns out that the Impact Factor or citational analysis is a pretty poor indicator of an individually paper&#8217;s importance (or quality, or influence). We all kind of suspected this, but it&#8217;s nice to finally have some data. <\/p>\n<p>\nThe other thing, and here I declare my conflict of interest, is that the Wellcome&#8217;s expert college on the whole matched <a href=\"http:\/\/f1000.com\/\">Faculty of 1000&#8217;s<\/a> assessment. Yes, some papers were completely missed, but we think we know why and are working to make sure that in future we do spot more interesting things that appear in less well-known journals. We&#8217;re pretty stoked about this, actually.<\/p>\n<p>\nYou can&#8217;t beat human input. Not yet, anyway. What we&#8217;d really like to do is find some way of matching PLoS&#8217;s article-level <del>metrics<\/del> indicators with our F1000 Factor and synthesize a sensible measure that looks at the quality of the research as well as its influence and usage. Here would be a good place to draw your attention to Johan Bollen&#8217;s incredibly cool <a href=\"http:\/\/www.mesur.org\/services\/maps.html\">network graphs<\/a>, which could provide hours of fun for all the family. And yes, I talked to Johan on Monday and yes, he does plan to slice those data at the individual author\/research group level.<\/p>\n<p>\nYou may have noticed that we (Faculty of 1000) already &#8216;score&#8217; articles. Seeing as we have these scores, you might imagine that we&#8217;d be able to, oh I don&#8217;t know, give scores to individual journals or institutions: perhaps even individual scientists. Now there&#8217;s an idea\u2026<\/p>\n<p>\nBut you know, I do wonder if all these indicators and claims of unfairness, fairness, whatever, might be missing the point, somewhat. <strong>Whatever the metric you use, good science will out<\/strong>. I wonder if there are truly any brilliant scientists who have not got jobs, or not got funded, because the current indicators have missed them? And I&#8217;m sure you&#8217;ve all got stories you can tell me. To which I say,<\/p>\n<blockquote><p>\n<em>the plural of anecdote is not &#8216;data&#8217;<\/em>\n<\/p><\/blockquote>\n<p>\nFurthermore, we all assume that there is necessarily a link between past results and future performance, in opposition to what the investment adverts keep telling us. The Impact Factor\u2014or any citation metric\u2014is at least two years out of date. So I wonder if this holds when dishing out the cash for research? Now, I&#8217;m pretty open-minded, and a strong believer in experiment. I would really like someone to test my hypotheses. So I propose the following:<\/p>\n<p>\nTake a pot of money. Tens, if not hundreds, of millions of pounds or dollars, doesn&#8217;t matter which. Divide it into four equal portions. Invite researchers to apply for the money in the usual way, and on receipt of the applications assign them randomly to one of four groups, A, B, C or D. Group A is judged in the standard manner of competitive grant applications. Group B is judged on the basis of Hirsch Indices or Impact Factors of the PIs involved and Group C uses something like an expert college in much the same way as the papers in the Allen study were judged (but looking at individual researchers rather than their papers). Applicants in Group D get the money randomly. We&#8217;d probably want to limit to a 20% success rate for each group\u2014someone else can work out the details.<\/p>\n<p>\nAnd then, after five years, we assess, using every best method available, the productivity of the researchers who were funded (and could probably look at the careers of those who weren&#8217;t, too). <\/p>\n<p>\nThis is probably unethical, but potentially very interesting. I wouldn&#8217;t be surprised if there were a few surprises.<\/p>\n<p><!--more--><br \/>\n<span style=\"float: left;padding: 5px\"><a href=\"http:\/\/www.researchblogging.org\"><img decoding=\"async\" alt=\"ResearchBlogging.org\" src=\"http:\/\/www.researchblogging.org\/public\/citation_icons\/rb2_small.png\" style=\"border:0\" \/><\/a><\/span><\/p>\n<p>\n<span class=\"Z3988\" title=\"ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=PLoS+ONE&amp;rft_id=info%3Adoi%2F10.1371%2Fjournal.pone.0005910&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Looking+for+Landmarks%3A+The+Role+of+Expert+Review+and+Bibliometric+Analysis+in+Evaluating+Scientific+Publication+Outputs&amp;rft.issn=1932-6203&amp;rft.date=2009&amp;rft.volume=4&amp;rft.issue=6&amp;rft.spage=0&amp;rft.epage=0&amp;rft.artnum=http%3A%2F%2Fdx.plos.org%2F10.1371%2Fjournal.pone.0005910&amp;rft.au=Allen%2C+L.&amp;rft.au=Jones%2C+C.&amp;rft.au=Dolby%2C+K.&amp;rft.au=Lynn%2C+D.&amp;rft.au=Walport%2C+M.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology\">Allen, L., Jones, C., Dolby, K., Lynn, D., &amp; Walport, M. (2009). Looking for Landmarks: The Role of Expert Review and Bibliometric Analysis in Evaluating Scientific Publication Outputs <span style=\"font-style: italic\">PLoS ONE, 4<\/span> (6) DOI: <a rev=\"review\" href=\"http:\/\/dx.doi.org\/10.1371\/journal.pone.0005910\">10.1371\/journal.pone.0005910<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>What I wanted to talk about yesterday, before I got distracted, was a mix of things that came out of last Monday&#8217;s ALPSP meeting, a paper in PLoS One and this whole assessment schtick that we&#8217;ve got going on. We &hellip; <a href=\"https:\/\/occamstypewriter.org\/rpg\/2009\/06\/22\/on_article_level_metrics_and_other_animals\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-277","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/posts\/277","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/comments?post=277"}],"version-history":[{"count":0,"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/posts\/277\/revisions"}],"wp:attachment":[{"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/media?parent=277"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/categories?post=277"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/occamstypewriter.org\/rpg\/wp-json\/wp\/v2\/tags?post=277"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}