Mining experimental evidence of molecular function claims from the literature.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Author(s): Crangle CE;Crangle CE; Cherry JM; Hong EL; Zbyslaw A
  • Source:
    Bioinformatics (Oxford, England) [Bioinformatics] 2007 Dec 01; Vol. 23 (23), pp. 3232-40. Date of Electronic Publication: 2007 Oct 17.
  • Publication Type:
    Journal Article; Research Support, N.I.H., Extramural
  • Language:
    English
  • Additional Information
    • Source:
      Publisher: Oxford University Press Country of Publication: England NLM ID: 9808944 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1367-4811 (Electronic) Linking ISSN: 13674803 NLM ISO Abbreviation: Bioinformatics Subsets: MEDLINE
    • Publication Information:
      Original Publication: Oxford : Oxford University Press, c1998-
    • Subject Terms:
    • Abstract:
      Motivation: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence.
      Results: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7% and precision of 65.7%. Furthermore, our methods correctly identified in the text 44.6% of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92%, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast).
      Availability: The annotation files for S.cerevisiae are available from ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.
    • References:
      Mol Cell Biol. 1999 May;19(5):3237-45. (PMID: 10207049)
      J Bioinform Comput Biol. 2004 Sep;2(3):551-68. (PMID: 15359426)
      Nucleic Acids Res. 2007 Jan;35(Database issue):D511-4. (PMID: 17202167)
      BMC Bioinformatics. 2005;6 Suppl 1:S17. (PMID: 15960829)
      Nucleic Acids Res. 2000 Jan 1;28(1):304-5. (PMID: 10592255)
      Bioinformatics. 2004 Sep 1;20(13):2084-91. (PMID: 15059832)
      J Biomed Inform. 2007 Apr;40(2):150-9. (PMID: 16843731)
      BMC Bioinformatics. 2005;6 Suppl 1:S18. (PMID: 15960830)
      Conf Proc IEEE Eng Med Biol Soc. 2004;2004:2821-3. (PMID: 17270864)
      Bioinformatics. 2003 Jul 22;19(11):1417-22. (PMID: 12874055)
      BMC Bioinformatics. 2005;6 Suppl 1:S21. (PMID: 15960834)
      OMICS. 2006 Summer;10(2):199-204. (PMID: 16901226)
      Mol Cell Biol. 2000 Mar;20(5):1816-24. (PMID: 10669756)
      BMC Bioinformatics. 2005;6 Suppl 1:S16. (PMID: 15960828)
      Nat Genet. 2000 May;25(1):25-9. (PMID: 10802651)
      Bioinformatics. 2005 Jun 1;21(11):2759-65. (PMID: 15814565)
      Nat Biotechnol. 2006 Aug;24(8):902-3. (PMID: 16900125)
      Bioinformatics. 2005 Apr 1;21(7):1227-36. (PMID: 15509601)
      EMBO J. 2005 Jun 1;24(11):2024-33. (PMID: 15889139)
      Bioinformatics. 2006 Dec 15;22(24):3089-95. (PMID: 17050571)
      BMC Bioinformatics. 2005;6 Suppl 1:S1. (PMID: 15960821)
      Methods Enzymol. 2002;350:329-46. (PMID: 12073322)
      BMC Bioinformatics. 2005;6 Suppl 1:S23. (PMID: 15960836)
      Nucleic Acids Res. 2003 Jan 1;31(1):172-5. (PMID: 12519974)
      Bioinformatics. 2003;19 Suppl 1:i331-9. (PMID: 12855478)
    • Grant Information:
      R03 LM009752 United States LM NLM NIH HHS; P41HG02273 United States HG NHGRI NIH HHS; P41 HG001315-16 United States HG NHGRI NIH HHS; R43 HG003600 United States HG NHGRI NIH HHS; P41 HG002273 United States HG NHGRI NIH HHS; 2P41HG001315 United States HG NHGRI NIH HHS; P41 HG001315 United States HG NHGRI NIH HHS; R43CAHG003600-01 United States PHS HHS; P41 HG002273-11 United States HG NHGRI NIH HHS
    • Accession Number:
      0 (Saccharomyces cerevisiae Proteins)
    • Publication Date:
      Date Created: 20071019 Date Completed: 20071221 Latest Revision: 20211020
    • Publication Date:
      20231215
    • Accession Number:
      PMC3041023
    • Accession Number:
      10.1093/bioinformatics/btm495
    • Accession Number:
      17942445