Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Source:
      Publisher: Frontiers Research Foundation Country of Publication: Switzerland NLM ID: 101548977 Publication Model: eCollection Cited Medium: Print ISSN: 1664-302X (Print) Linking ISSN: 1664302X NLM ISO Abbreviation: Front Microbiol Subsets: PubMed not MEDLINE
    • Publication Information:
      Original Publication: Lausanne : Frontiers Research Foundation
    • Abstract:
      The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies.
      Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
      (Copyright © 2021 Moreno-Indias, Lahti, Nedyalkova, Elbere, Roshchupkin, Adilovic, Aydemir, Bakir-Gungor, Santa Pau, D’Elia, Desai, Falquet, Gundogdu, Hron, Klammsteiner, Lopes, Marcos-Zambrano, Marques, Mason, May, Pašić, Pio, Pongor, Promponas, Przymus, Saez-Rodriguez, Sampri, Shigdel, Stres, Suharoschi, Truu, Truică, Vilne, Vlachakis, Yilmaz, Zeller, Zomer, Gómez-Cabrero and Claesson.)
    • References:
      Microbiome. 2014 May 05;2:15. (PMID: 24910773)
      F1000Res. 2020 Oct 15;9:1246. (PMID: 33274053)
      Nature. 2019 Apr;568(7750):43-48. (PMID: 30918406)
      Cell Host Microbe. 2017 Aug 9;22(2):134-141. (PMID: 28799899)
      Nat Commun. 2021 May 11;12(1):2671. (PMID: 33976176)
      Front Microbiol. 2021 Feb 19;12:634511. (PMID: 33737920)
      Oncotarget. 2017 Feb 7;8(6):9546-9556. (PMID: 28061434)
      BioData Min. 2017 Dec 11;10:36. (PMID: 29238404)
      Genome Biol. 2013 Jan 15;14(1):R2. (PMID: 23320958)
      Microbiome. 2018 Feb 01;6(1):23. (PMID: 29391044)
      Nat Methods. 2016 Jul;13(7):581-3. (PMID: 27214047)
      Appl Environ Microbiol. 2009 Dec;75(23):7537-41. (PMID: 19801464)
      Nucleic Acids Res. 2020 Jan 8;48(D1):D570-D578. (PMID: 31696235)
      PLoS One. 2012;7(2):e30126. (PMID: 22319561)
      mSystems. 2018 Nov 13;3(6):. (PMID: 30443602)
      PLoS One. 2013 Apr 22;8(4):e61217. (PMID: 23630581)
      Microbiome. 2017 Mar 3;5(1):27. (PMID: 28253908)
      Cell Metab. 2022 May 3;34(5):719-730.e4. (PMID: 35354069)
      Brain Behav Immun. 2014 May;38:1-12. (PMID: 24370461)
      Nat Commun. 2020 Jul 14;11(1):3514. (PMID: 32665548)
      Nat Med. 2016 Jul 7;22(7):713-22. (PMID: 27387886)
      Genome Biol. 2014;15(12):550. (PMID: 25516281)
      BMC Bioinformatics. 2008 Sep 19;9:386. (PMID: 18803844)
      BMC Bioinformatics. 2019 Jul 3;20(1):374. (PMID: 31269897)
      Nat Microbiol. 2018 Mar;3(3):347-355. (PMID: 29335554)
      mBio. 2020 Jun 9;11(3):. (PMID: 32518182)
      mSystems. 2018 Jan 9;3(1):. (PMID: 29359195)
      Nat Methods. 2014 Nov;11(11):1144-6. (PMID: 25218180)
      Nature. 2010 Mar 4;464(7285):59-65. (PMID: 20203603)
      Genome Res. 2015 Oct;25(10):1558-69. (PMID: 26260972)
      PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. (PMID: 27400279)
      Front Microbiol. 2020 Feb 19;11:136. (PMID: 32140140)
      Front Microbiol. 2017 Nov 15;8:2224. (PMID: 29187837)
      Bioinformatics. 2018 Jul 1;34(13):i32-i42. (PMID: 29950008)
      BMC Genomics. 2019 Dec 10;20(1):960. (PMID: 31823721)
      Hypertension. 2020 Nov;76(5):1555-1562. (PMID: 32909848)
      Nat Methods. 2019 Jul;16(7):627-632. (PMID: 31182859)
      Nat Microbiol. 2018 Jan;3(1):8-16. (PMID: 29255284)
      Microbiome. 2018 Dec 17;6(1):226. (PMID: 30558668)
      Drug Discov Today. 2018 Sep;23(9):1644-1657. (PMID: 29890228)
      Gigascience. 2017 Aug 1;6(8):1-11. (PMID: 28637310)
      Science. 2016 Apr 29;352(6285):565-9. (PMID: 27126040)
      Cell. 2018 Mar 8;172(6):1198-1215. (PMID: 29522742)
      Mol Ecol. 2018 Jun;27(12):2714-2724. (PMID: 29761593)
      ISME J. 2012 Mar;6(3):564-76. (PMID: 21993395)
      Genome Res. 2013 Oct;23(10):1704-14. (PMID: 23861384)
      Bioinformatics. 2018 Apr 1;34(7):1235-1237. (PMID: 29194469)
      Nat Rev Gastroenterol Hepatol. 2019 Nov;16(11):656-661. (PMID: 31562390)
      Brief Bioinform. 2019 May 21;20(3):752-766. (PMID: 29077790)
      Nat Biotechnol. 2018 Dec 03;:. (PMID: 30531897)
      PeerJ. 2020 Mar 24;8:e8783. (PMID: 32231882)
      Cell. 2016 Dec 1;167(6):1469-1480.e12. (PMID: 27912057)
      J Stat Softw. 2014;59(13):1-21. (PMID: 26917999)
      J Biosci. 2019 Oct;44(5):. (PMID: 31719224)
      IEEE J Biomed Health Inform. 2020 Oct;24(10):2993-3001. (PMID: 32396115)
      Genome Biol. 2019 Dec 23;20(1):293. (PMID: 31870407)
      PLoS One. 2014 Apr 22;9(4):e95511. (PMID: 24755769)
      Front Immunol. 2019 Jan 07;9:2868. (PMID: 30666248)
      Forensic Sci Int Genet. 2019 Jul;41:72-82. (PMID: 31003081)
      PeerJ. 2017 Feb 9;5:e2969. (PMID: 28289558)
      Methods. 2019 Aug 15;166:74-82. (PMID: 30885720)
      PLoS One. 2018 Nov 9;13(11):e0207072. (PMID: 30412640)
      Nat Methods. 2011 Jul 17;8(9):761-3. (PMID: 21765408)
      Nature. 2007 Oct 18;449(7164):804-10. (PMID: 17943116)
      Nat Biotechnol. 2017 Sep 12;35(9):833-844. (PMID: 28898207)
      J Microbiol. 2020 Mar;58(3):206-216. (PMID: 32108316)
      Bioinformatics. 2020 Jul 1;36(Suppl_1):i39-i47. (PMID: 32657370)
      Front Microbiol. 2020 Apr 03;11:393. (PMID: 32318028)
      Genome Biol. 2011 Jun 24;12(6):R60. (PMID: 21702898)
      World J Gastroenterol. 2016 Jan 14;22(2):501-18. (PMID: 26811603)
      Sci Rep. 2020 Apr 7;10(1):6026. (PMID: 32265477)
      Nat Protoc. 2020 Mar;15(3):799-821. (PMID: 31942082)
      Nat Rev Immunol. 2013 Nov;13(11):790-801. (PMID: 24096337)
      Microbiome. 2020 Jun 30;8(1):103. (PMID: 32605663)
      Nat Commun. 2019 Nov 28;10(1):5416. (PMID: 31780648)
      Immunol Rev. 2017 Sep;279(1):90-105. (PMID: 28856737)
      Nat Commun. 2014 Jul 08;5:4344. (PMID: 25003530)
      Science. 2016 Apr 29;352(6285):560-4. (PMID: 27126039)
      Biostatistics. 2019 Oct 1;20(4):599-614. (PMID: 29868846)
      PLoS Comput Biol. 2019 Jul 25;15(7):e1007007. (PMID: 31344036)
      Nat Rev Genet. 2016 Jul 15;17(8):470-86. (PMID: 27418159)
      Nature. 2019 Apr;568(7753):505-510. (PMID: 30867587)
      PeerJ. 2015 Oct 08;3:e1319. (PMID: 26500826)
      Nat Rev Microbiol. 2018 Jul;16(7):410-422. (PMID: 29795328)
      mSystems. 2019 May 14;4(4):. (PMID: 31098399)
      Nat Genet. 2019 Apr;51(4):600-605. (PMID: 30778224)
      Nat Microbiol. 2020 Sep;5(9):1079-1087. (PMID: 32572223)
      J Transl Med. 2017 Apr 8;15(1):73. (PMID: 28388917)
      Sci Adv. 2020 Oct 14;6(42):. (PMID: 33055153)
      Nat Biotechnol. 2019 Aug;37(8):852-857. (PMID: 31341288)
      Mol Biol Evol. 2020 Feb 1;37(2):593-598. (PMID: 31633780)
      mBio. 2018 Jun 5;9(3):. (PMID: 29871916)
    • Contributed Indexing:
      Keywords: ML4Microbiome; biomarker identification; machine learning; microbiome; personalized medicine
    • Publication Date:
      Date Created: 20210311 Latest Revision: 20231111
    • Publication Date:
      20231111
    • Accession Number:
      PMC7937616
    • Accession Number:
      10.3389/fmicb.2021.635781
    • Accession Number:
      33692771