{"id":2293,"date":"2016-08-23T17:46:05","date_gmt":"2016-08-23T17:46:05","guid":{"rendered":"http:\/\/occamstypewriter.org\/trading-knowledge\/?p=2293"},"modified":"2016-08-24T10:55:18","modified_gmt":"2016-08-24T10:55:18","slug":"book-sequences","status":"publish","type":"post","link":"https:\/\/occamstypewriter.org\/trading-knowledge\/2016\/08\/23\/book-sequences\/","title":{"rendered":"Book sequences"},"content":{"rendered":"<p>You may have seen some of my <a href=\"https:\/\/twitter.com\/hashtag\/nimrlibrarybyebye\">#nimrlibrarybyebye <\/a>tweets. These were a sequence of tweets showcasing books that we have been transferring to other libraries. Each tweet included a photo of a book or a handful of books. I will write a proper post about them sometime soon. The &#8216;byebye&#8217; in the hashtag is to signify that nearly the whole of the library stock is being disposed of. \u00a0Transferring books to other collections means that part of our library will live on.<\/p>\n<p>I\u2019ve also been selecting things to keep (as mentioned in my recent\u00a0<a href=\"http:\/\/occamstypewriter.org\/trading-knowledge\/2016\/08\/22\/library-day-in-the-life-july-2016\/\">Library day in the life <\/a>post). I\u2019ve focused quite a bit on science history &#8211; but that includes recent history such as the early days of the human genome project and bioinformatics. Related to that topic, two things in particular caught my eye down in our store.<\/p>\n<h2>Protein sequences &#8211; Dayhoff<\/h2>\n<div style=\"width: 305px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/c5.staticflickr.com\/9\/8455\/29052436172_f0b36d2d0e.jpg\" width=\"295\" height=\"378\" \/><p class=\"wp-caption-text\">Atlas of protein sequence and structure<\/p><\/div>\n<p>One of these I\u2019d seen before &#8211; it is an book\u00a0of sequences. My memory told me that we had a small hard-bound book of Genbank sequences, but I must have imagined that. \u00a0The book is a softbound book of protein sequences, from 1967-68. However, I did remember the author correctly: Margaret O. Dayhoff.<\/p>\n<p>Margaret O. Dayhoff was originally a physical chemist and was one of the founders in the field of bioinformatics. She\u00a0created this first public comprehensive, computerised and publicly available listing\u00a0of protein sequences, <em>The Atlas of Protein Sequence and Structure<\/em>. I think it started out in 1965. Read this <a href=\"http:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/9780470015902.a0023939\/abstract\">biographical article <\/a>for more details about her.<\/p>\n<div style=\"width: 198px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/c1.staticflickr.com\/9\/8198\/29075421752_72b8f70f61.jpg\" width=\"188\" height=\"306\" \/><p class=\"wp-caption-text\">Dedication page<\/p><\/div>\n<p>Mainly I&#8217;m tickled by the idea of printing sequences in a book!\u00a0 I love this book because today the very idea of a book full of gene or protein sequences seems bizarre. It shows how naturally we use\u00a0books to share information. There were more volumes and supplements published in this series in the next few years.<\/p>\n<p>I like the image on the dedication page too &#8211; though I&#8217;m not quite sure what the origin of the sculpture depicted is.<\/p>\n<p>The book later turned into\u00a0the <a href=\"http:\/\/nar.oxfordjournals.org\/content\/16\/5\/1869.full.pdf+html\">protein\u00a0identification resource (PIR)<\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>There is an <a href=\"http:\/\/nar.oxfordjournals.org\/content\/10\/1\/157.full.pdf+html?sid=88b2aa4d-726e-46e9-8f52-ab597bc3c470\">interesting account in <em>Nucleic Acids Research<\/em> i<\/a>n 1981 of\u00a0another of Dayhoff&#8217;s projects\u00a0&#8211; a nucleotide sequence database which became the\u00a0model for other databanks, such as GenBank.<\/p>\n<blockquote><p>On September 15. 1980, the Nucleic Acid Sequence Database Demonstration Project of the National Biomedical Research Foundation was made available to interested users through telephone access to our computer. Over two hundred user groups requested access during the ten months of the demonstration. &#8230;<\/p>\n<p>We had been using the computer system ourselves for some time and had found that a computerized management system was essential to minimize the overall cost of collecting, updating, and critically reviewing the data.<\/p><\/blockquote>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" src=\"https:\/\/c3.staticflickr.com\/9\/8148\/29124775786_4405332a59.jpg\" width=\"232\" height=\"306\" \/>Margaret Dayhoff was held in some esteem by her peers, and\u00a0I discovered we also have a festschrift dedicated to her, a special issue of\u00a0the\u00a0<em>Bulletin of Mathematical Biology<\/em>.<\/p>\n<h3><\/h3>\n<div style=\"width: 353px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/c7.staticflickr.com\/9\/8436\/29052435902_3788babb24.jpg\" width=\"343\" height=\"451\" \/><p class=\"wp-caption-text\">Genbank online service &#8211; manual<\/p><\/div>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>While sorting out some old files in my office (I&#8217;m doing a lot of sorting out and throwing away these days!) I found a manual for the Genbank online service (GOS), 1992. I thought I must have thrown this away ages ago so was pleased to see it again.<\/p>\n<p>I remember that the GOS was my first direct contact with Genbank. Back then I occasionally had people asking me about gene\u00a0sequences. I discovered\u00a0that I could search on Medline for a gene name and then identify the sequence accession number. This allowed them to retrieve the sequence from other sources. With GOS, accessed using telnet, \u00a0I was\u00a0able to search GenBank directly in its most current format, and I could even get the sequence too. The interface was plain but no worse that what I was used to in other online systems.<\/p>\n<p>Not long after this Gopher came along, followed swiftly by the WWW (as we called it then).\u00a0These made\u00a0it dead easy for everyone to find sequence information and my newly acquired skills with GOS became\u00a0redundant. Information skills had a high churn rate even in 1992.<\/p>\n<div style=\"width: 279px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/c5.staticflickr.com\/9\/8822\/29124881796_0a45bdab76.jpg\" width=\"269\" height=\"371\" \/><p class=\"wp-caption-text\">Martin Bishop&#8217;s 1994 book.\u00a0 Guide to human genome computing \/ edited by Martin J. Bishop. London: Academic Press.<\/p><\/div>\n<p>Some brave souls wrote books about sequence databases and manipulation\u00a0&#8211; knowing that by the time the book appeared in print another dozen databases and software tools would have been developed. Martin Bishop, a scientist at the MRC Human Genome Mapping Project Resource Centre (HGMP-RC), was better placed than most to keep up-to-date and his was one of the first books on the subject\u00a0I remember buying for the library .<\/p>\n<p>Other classics were Russell Doolittle&#8217;s<br \/>\n<em><a href=\"http:\/\/www.sciencedirect.com\/science\/bookseries\/00766879\/266\/supp\/C\">Computer Methods for Macromolecular Sequence Analysis<\/a>,<\/em>\u00a0part of the\u00a0<em>Methods in Enzymology<\/em> series &#8211; vol.\u00a0266\u00a0in\u00a01996. \u00a0And Andreas Baxevanis and Francis Ouellette&#8217;s\u00a0<em>Bioinformatics : a practical guide to the analysis of genes and proteins &#8211;<\/em> part of the\u00a0<em>Methods of Biochemical Analysis<\/em> series in \u00a01998. \u00a0These days we have both of these online as ebooks.<\/p>\n<h3>Immunological sequences &#8211; Kabat<\/h3>\n<p>The other book that summoned up memories of the days when sequences were printed in book form was Kabat. \u00a0I remember the 1987 edition was an enormous book that received quite a bit of use when I first started here in the Library. I was\u00a0excited when I spotted that there was a\u00a0new edition in 1991 and went to some lengths to purchase\u00a0a copy for the Library. That was the last\u00a0edition of the book as it then <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/8727325\">turned into\u00a0a database.<\/a>\u00a0 See this account by Martin in 1996:<\/p>\n<blockquote><p>&#8220;The chief drawback of this database has been that it has only been available in the form of a printed book. These data have recently become available on the global computer Internet, but no method of searching the data has, as yet, been provided. Here, the development of a specialized database program for accessing the antibody data is described. This database software has been made accessible over the World Wide Web, together with a program which allows a novel antibody sequence to be tested against the Kabat sequence database, to identify unusual features of an antibody sequence which may represent cloning artifacts or sequencing errors.&#8221;<\/p><\/blockquote>\n<p>I was pleased to see that we have a copy of each edition of Kabat, from 1979 through to 1991, on the shelves in the Library store.<\/p>\n<div style=\"width: 510px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/c1.staticflickr.com\/9\/8775\/29052436112_a7717f1eba.jpg\" width=\"500\" height=\"322\" \/><p class=\"wp-caption-text\">Kabat &#8211; 5 different editions, 1979-1991<\/p><\/div>\n<p>A longer <a href=\"http:\/\/nar.oxfordjournals.org\/content\/28\/1\/214.full?sid=0ea4f574-aa50-4338-ae6b-928f2a4eae78\">history of Kabat <\/a>appeared in 2000 in\u00a0<em>Nucleic Acids Research.\u00a0<\/em>This described a 30-year history, going back to 1970 when the <a href=\"http:\/\/jem.rupress.org\/content\/132\/2\/211.full.pdf+html\">data compilation first appeared as an article<\/a> in <em>J Exp. Med.<\/em><\/p>\n<p>Elvin Kabat died in 2000, and the US National Academy of Sciences published a <a href=\"http:\/\/www.nap.edu\/read\/11172\/chapter\/7\">biographical memoir<\/a> of him saying he:<\/p>\n<blockquote><p>was a founding father of modern quantitative immunochemistry together with Michael Heidelberger, his doctoral mentor&#8230;<\/p>\n<p>The printed and subsequent Web version [of Kabat] was a pioneering effort that preceded the current GenBank database. Indeed, Kabat was also instrumental in urging the National Institutes of Health to support a national DNA sequence database and the development of sequence manipulation software.<\/p><\/blockquote>\n<p>It is salutary to think that the early 1990s were such a different world &#8211; no web, hardly any internet, email was just starting to be used. \u00a0And people thought nothing of publishing gene and protein sequences in paper format.<\/p>\n<div style=\"width: 411px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/c1.staticflickr.com\/9\/8418\/29124775616_2814286d73.jpg\" width=\"401\" height=\"500\" \/><p class=\"wp-caption-text\">A page from the Atlas of protein sequences and structure<\/p><\/div>\n<p>Nowadays the only reason for printing out sequences is to create a museum exhibit:<\/p>\n<div style=\"width: 335px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.broadinstitute.org\/files\/imagecache\/large\/blog\/images\/2010\/Wellcome_genome_bookcase_0.png\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.broadinstitute.org\/files\/imagecache\/large\/blog\/images\/2010\/Wellcome_genome_bookcase_0.png\" width=\"325\" height=\"284\" \/><\/a><p class=\"wp-caption-text\">When the human genome is printed out in a series of books, the DNA sequence fills more than 100 books. Image Courtesy: Russ London&#8217;s photograph of the Human Genome in the &#8220;Medicine Now\u201d\u00a0room at the Wellcome Collection in London.<\/p><\/div>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>A couple of reviews delve more deeply into the history of bioinformatics and computational biology:<\/p>\n<ul>\n<li>Searls DB (2010) <a href=\"http:\/\/journals.plos.org\/ploscompbiol\/article?id=10.1371\/journal.pcbi.1000809\">The Roots of Bioinformatics<\/a>. <em>PLoS Comput Biol<\/em> <strong>6<\/strong>(6): e1000809<\/li>\n<li>Hagen JB (2000)\u00a0<a href=\"http:\/\/www.nature.com\/nrg\/journal\/v1\/n3\/full\/nrg1200_231a.html\">The origins of bioinformatics<\/a>.\u00a0<em>Nature Reviews Genetics<\/em> <strong>1<\/strong>: 231-236<\/li>\n<li>Hagen JB (2011) <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21063941\">The origin and early reception of sequence databases<\/a>. <em>Methods Mol Biol.<\/em>\u00a0<strong>696<\/strong>:61-77<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>You may have seen some of my #nimrlibrarybyebye tweets. These were a sequence of tweets showcasing books that we have been transferring to other libraries. Each tweet included a photo of a book or a handful of books. I will &hellip; <a href=\"https:\/\/occamstypewriter.org\/trading-knowledge\/2016\/08\/23\/book-sequences\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":17,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[33,34,26,24],"tags":[],"class_list":["post-2293","post","type-post","status-publish","format-standard","hentry","category-books","category-collections","category-history","category-research-data"],"_links":{"self":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts\/2293","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/users\/17"}],"replies":[{"embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/comments?post=2293"}],"version-history":[{"count":0,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/posts\/2293\/revisions"}],"wp:attachment":[{"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/media?parent=2293"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/categories?post=2293"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/occamstypewriter.org\/trading-knowledge\/wp-json\/wp\/v2\/tags?post=2293"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}