Reliability of plastid and mitochondrial localisation prediction declines rapidly with the evolutionary distance to the training set increasing.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Source:
      Publisher: Public Library of Science Country of Publication: United States NLM ID: 101238922 Publication Model: eCollection Cited Medium: Internet ISSN: 1553-7358 (Electronic) Linking ISSN: 1553734X NLM ISO Abbreviation: PLoS Comput Biol Subsets: MEDLINE
    • Publication Information:
      Original Publication: San Francisco, CA : Public Library of Science, [2005]-
    • Subject Terms:
    • Abstract:
      Mitochondria and plastids import thousands of proteins. Their experimental localisation remains a frequent task, but can be resource-intensive and sometimes impossible. Hence, hundreds of studies make use of algorithms that predict a localisation based on a protein's sequence. Their reliability across evolutionary diverse species is unknown. Here, we evaluate the performance of common algorithms (TargetP, Localizer and WoLFPSORT) for four photosynthetic eukaryotes (Arabidopsis thaliana, Zea mays, Physcomitrium patens, and Chlamydomonas reinhardtii) for which experimental plastid and mitochondrial proteome data is available, and 171 eukaryotes using orthology inferences. The match between predictions and experimental data ranges from 75% to as low as 2%. Results worsen as the evolutionary distance between training and query species increases, especially for plant mitochondria for which performance borders on random sampling. Specificity, sensitivity and precision analyses highlight cross-organelle errors and uncover the evolutionary divergence of organelles as the main driver of current performance issues. The results encourage to train the next generation of neural networks on an evolutionary more diverse set of organelle proteins for optimizing performance and reliability.
      Competing Interests: The authors have declared that no competing interests exist.
      (Copyright: © 2024 Gould et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
    • References:
      Plant Physiol. 2009 Jul;150(3):1272-85. (PMID: 19474214)
      Gene. 2004 Mar 31;329:11-6. (PMID: 15033524)
      Plant Physiol. 2011 Apr;155(4):1578-88. (PMID: 21350036)
      Mol Biol Evol. 2013 Jul;30(7):1563-73. (PMID: 23462316)
      Virus Res. 1985 Oct;3(3):271-86. (PMID: 3000102)
      Nucleic Acids Res. 1986 Jun 11;14(11):4683-90. (PMID: 3714490)
      Mol Membr Biol. 2010 Nov;27(8):469-89. (PMID: 21067450)
      Plant Cell Physiol. 2016 Jan;57(1):e9. (PMID: 26556651)
      Nat Protoc. 2017 Jun;12(6):1110-1135. (PMID: 28471460)
      Mol Cell Biol. 1989 Mar;9(3):1014-25. (PMID: 2524645)
      Nat Rev Mol Cell Biol. 2011 Jan;12(1):48-59. (PMID: 21139638)
      Cell. 2023 Aug 3;186(16):3499-3518.e14. (PMID: 37437571)
      Bioinformatics. 2010 Jul 1;26(13):1608-15. (PMID: 20472543)
      Sci Rep. 2017 Aug 24;7(1):9279. (PMID: 28839179)
      iScience. 2020 Mar 27;23(3):100896. (PMID: 32088393)
      PLoS Biol. 2024 May 7;22(5):e3002608. (PMID: 38713727)
      EMBO J. 1986 Jun;5(6):1343-50. (PMID: 16453686)
      FEBS Lett. 2004 Jan 16;557(1-3):109-14. (PMID: 14741350)
      Biochim Biophys Acta. 2011 Mar;1808(3):947-54. (PMID: 20659421)
      Plant J. 2015 Feb;81(3):519-28. (PMID: 25438865)
      New Phytol. 2009;183(1):224-236. (PMID: 19368670)
      Elife. 2022 Nov 10;11:. (PMID: 36355038)
      Annu Rev Biochem. 2015;84:843-64. (PMID: 25494301)
      Philos Trans R Soc Lond B Biol Sci. 2015 Sep 26;370(1678):20140330. (PMID: 26323761)
      Front Plant Sci. 2014 Oct 13;5:535. (PMID: 25352854)
      Plant Cell. 2007 Nov;19(11):3739-59. (PMID: 17981999)
      Mol Plant. 2014 Jan;7(1):121-36. (PMID: 24214895)
      J Biochem. 1982 May;91(5):1821-4. (PMID: 7096320)
      Biol Rev Camb Philos Soc. 2018 May;93(2):1125-1144. (PMID: 29230921)
      Plant Cell. 2003 Jul;15(7):1619-31. (PMID: 12837951)
      Plant Cell Physiol. 2006 Mar;47(3):432-6. (PMID: 16418230)
      Nucleic Acids Res. 2023 Jan 6;51(D1):D587-D592. (PMID: 36300620)
      J Cell Sci. 2007 Sep 1;120(Pt 17):2977-85. (PMID: 17715154)
      Photosynth Res. 2018 Dec;138(3):289-301. (PMID: 30101370)
      Proteins. 1991;11(2):95-110. (PMID: 1946347)
      Front Plant Sci. 2022 Feb 02;13:824080. (PMID: 35185991)
      Trends Biochem Sci. 2024 Feb;49(2):105-118. (PMID: 37919225)
      PLoS One. 2014 Apr 21;9(4):e95088. (PMID: 24751891)
      Protein J. 2019 Jun;38(3):343-350. (PMID: 31201619)
      Int J Mol Sci. 2012;13(7):8038-8050. (PMID: 22942688)
      Cell. 2019 Nov 14;179(5):1057-1067.e14. (PMID: 31730849)
      FEBS J. 2022 Nov;289(22):6908-6918. (PMID: 35472255)
      EMBO J. 2000 Feb 15;19(4):542-9. (PMID: 10675323)
      FEBS J. 2005 Jun;272(12):3184-96. (PMID: 15955075)
      Nucleic Acids Res. 2013 Jan;41(Database issue):D530-5. (PMID: 23161678)
      Protein J. 2019 Jun;38(3):200-216. (PMID: 31119599)
      Eukaryot Cell. 2012 Feb;11(2):217-28. (PMID: 22140228)
      Plant J. 2018 Feb;93(3):515-533. (PMID: 29237241)
      Trends Biochem Sci. 1999 Jan;24(1):34-6. (PMID: 10087920)
      J Biol Chem. 2002 Feb 15;277(7):5562-9. (PMID: 11733507)
      Proteomics. 2004 Jun;4(6):1591-6. (PMID: 15174129)
      Nat Rev Genet. 2004 Feb;5(2):123-35. (PMID: 14735123)
      Nat Commun. 2019 Jan 18;10(1):331. (PMID: 30659192)
      Nucleic Acids Res. 2017 Jan 4;45(D1):D1064-D1074. (PMID: 27899614)
      Cell. 2017 Oct 5;171(2):287-304.e15. (PMID: 28985561)
      Plant Cell Rep. 2017 Oct;36(10):1627-1640. (PMID: 28698906)
      Elife. 2016 Mar 21;5:. (PMID: 26999824)
      Photosynth Res. 2013 Oct;116(2-3):427-36. (PMID: 23873414)
      Curr Genet. 2011 Jun;57(3):151-68. (PMID: 21533645)
      PLoS Comput Biol. 2016 May 13;12(5):e1004920. (PMID: 27175778)
      Nucleic Acids Res. 2003 Jul 1;31(13):3613-7. (PMID: 12824378)
      Mol Biol Evol. 2012 Dec;29(12):3625-39. (PMID: 22826458)
      J Exp Bot. 2014 Dec;65(22):6301-35. (PMID: 25324401)
      Front Plant Sci. 2021 Jun 17;12:692024. (PMID: 34220916)
      Trends Microbiol. 2016 Jul;24(7):525-534. (PMID: 27040918)
      J Biol Chem. 2014 Dec 19;289(51):35656-67. (PMID: 25359772)
      Plant J. 2020 Nov;104(3):812-827. (PMID: 32780488)
      Genomics. 1992 Dec;14(4):897-911. (PMID: 1478671)
      J Biochem. 1983 Sep;94(3):997-1007. (PMID: 6643433)
      J Exp Bot. 2020 Feb 19;71(4):1226-1238. (PMID: 31730153)
      Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. (PMID: 35575915)
      Trends Plant Sci. 2016 Jun;21(6):467-476. (PMID: 26895731)
      Mol Biol Evol. 2009 Jul;26(7):1533-48. (PMID: 19349646)
      Genome Biol Evol. 2015 Sep 02;7(9):2716-26. (PMID: 26338186)
      J Biol Chem. 2015 Nov 27;290(48):28778-91. (PMID: 26446787)
      Curr Biol. 2015 Oct 5;25(19):R911-21. (PMID: 26439354)
      Bioinformatics. 2014 Dec 1;30(23):3356-64. (PMID: 25150248)
      Nat Plants. 2020 Mar;6(3):259-272. (PMID: 32170292)
      Mol Genet Genomics. 2009 May;281(5):525-38. (PMID: 19214577)
      Front Plant Sci. 2023 Mar 08;14:1108027. (PMID: 36968370)
      Philos Trans R Soc Lond B Biol Sci. 2010 Mar 12;365(1541):847-55. (PMID: 20124349)
      Nat Commun. 2014 May 28;5:3978. (PMID: 24865297)
      FEBS J. 2009 Mar;276(5):1187-95. (PMID: 19187233)
      FEBS Lett. 2006 Jul 10;580(16):3966-72. (PMID: 16806197)
      Biochim Biophys Acta. 2013 Feb;1833(2):253-9. (PMID: 22683762)
      Genome Biol. 2019 Nov 14;20(1):238. (PMID: 31727128)
      Biol Chem. 2017 May 1;398(5-6):653-661. (PMID: 28076289)
      Nucleic Acids Res. 2009 Jan;37(Database issue):D969-74. (PMID: 18832363)
      Animal. 2012 May;6(5):748-62. (PMID: 22558923)
      Biochim Biophys Acta. 2014 Apr;1840(4):1233-45. (PMID: 24080405)
      Curr Biol. 2018 Apr 23;28(8):R381-R385. (PMID: 29689219)
      Traffic. 2003 Jul;4(7):491-501. (PMID: 12795694)
      Ann Rev Mar Sci. 2020 Jan 3;12:233-265. (PMID: 31899671)
      Trends Cell Biol. 2016 Dec;26(12):894-905. (PMID: 27524662)
      Biochim Biophys Acta. 2004 Nov 11;1694(1-3):135-47. (PMID: 15546663)
      Trends Plant Sci. 2022 Sep;27(9):847-857. (PMID: 35739050)
      BMC Genomics. 2013 Mar 18;14:189. (PMID: 23506162)
      Trends Biochem Sci. 2023 Apr;48(4):345-359. (PMID: 36504138)
      Cell. 2018 Jul 12;174(2):448-464.e24. (PMID: 30007417)
      Bioessays. 2007 Oct;29(10):1048-58. (PMID: 17876808)
      Sci Rep. 2020 May 19;10(1):8281. (PMID: 32427841)
      Biochim Biophys Acta. 2013 Feb;1833(2):360-70. (PMID: 22495024)
      Nucleic Acids Res. 2012 Jan;40(Database issue):D1178-86. (PMID: 22110026)
      Nucleic Acids Res. 2023 Jan 6;51(D1):D418-D427. (PMID: 36350672)
      Genome Biol. 2021 Dec 20;22(1):345. (PMID: 34930424)
      Plant Mol Biol. 2003 Oct;53(3):341-56. (PMID: 14750523)
      J Biol Chem. 2015 Jun 12;290(24):14866-74. (PMID: 25947384)
      J Cell Sci. 2020 Feb 26;133(4):. (PMID: 32102937)
      J Biochem. 1983 Sep;94(3):981-95. (PMID: 6643432)
      Mol Plant. 2019 Jul 1;12(7):951-966. (PMID: 30890495)
      BMC Plant Biol. 2010 Nov 16;10:249. (PMID: 21078193)
      Nucleic Acids Res. 2007 Jul;35(Web Server issue):W585-7. (PMID: 17517783)
      Mol Plant. 2009 Nov;2(6):1181-97. (PMID: 19995724)
      Philos Trans R Soc Lond B Biol Sci. 2010 Mar 12;365(1541):729-48. (PMID: 20124341)
      Life Sci Alliance. 2019 Sep 30;2(5):. (PMID: 31570514)
      J Proteomics. 2010 Oct 10;73(11):2092-123. (PMID: 20816881)
      Bioinformatics. 2006 Jul 15;22(14):e408-16. (PMID: 16873501)
      Annu Rev Biochem. 2017 Jun 20;86:685-714. (PMID: 28301740)
      Nat Plants. 2020 Feb;6(2):95-106. (PMID: 31844283)
      Arch Biochem Biophys. 2013 Nov 15;539(2):102-9. (PMID: 23851381)
      Plant J. 2011 Apr;66(1):34-44. (PMID: 21443621)
      Bioinformatics. 2004 Jan 22;20(2):289-90. (PMID: 14734327)
      Curr Biol. 2006 Feb 7;16(3):221-9. (PMID: 16461275)
      J Mol Biol. 2011 Jan 21;405(3):804-18. (PMID: 21087612)
      J Mol Biol. 2021 Aug 6;433(16):166894. (PMID: 33639212)
      Nucleic Acids Res. 2001 Aug 15;29(16):E82. (PMID: 11504890)
      New Phytol. 2013 Dec;200(4):1022-33. (PMID: 23915300)
      Plant Physiol. 2013 Feb;161(2):644-62. (PMID: 23257241)
      Trends Plant Sci. 2002 Jan;7(1):14-21. (PMID: 11804822)
      FEBS Lett. 2001 Oct 12;506(3):291-5. (PMID: 11602264)
      BMC Bioinformatics. 2012;13 Suppl 16:S2. (PMID: 23176207)
      Plant Cell. 2020 May;32(5):1361-1376. (PMID: 32152187)
      Plant Physiol. 2014 Apr;164(4):2081-95. (PMID: 24515833)
      Front Plant Sci. 2022 Oct 26;13:1040688. (PMID: 36388587)
    • Accession Number:
      0 (Proteome)
    • Publication Date:
      Date Created: 20241111 Date Completed: 20241121 Latest Revision: 20241123
    • Publication Date:
      20241123
    • Accession Number:
      PMC11581415
    • Accession Number:
      10.1371/journal.pcbi.1012575
    • Accession Number:
      39527633