Monitoring the ecological status of rivers with diatom eDNA metabarcoding: a comparison of taxonomic markers and analytical approaches for the inference of a molecular diatom index.

  • publication
  • 27-09-2020

Apothéloz-Perret-Gentil L, Bouchez A, Cordier T, Cordonier A, Guéguen J, Rimet F, Vasselon V, Pawlowski J. Mol. Ecol. 2020 Sep;():. 10.1111/mec.15646.

Recently, several studies demonstrated the usefulness of diatom eDNA metabarcoding as an alternative to assess the ecological quality of rivers and streams. However, the choice of the taxonomic marker as well as the methodology for data analysis differ between these studies, hampering the comparison of their results and effectiveness. The aim of this study was to compare two taxonomic markers commonly used in diatom metabarcoding and three distinct analytical approaches to infer a molecular diatom index. We used the values of classical morphological diatom index as a benchmark for this comparison. We amplified and sequenced both a fragment of the rbcL gene and the V4 region of the 18S rRNA gene for 112 epilithic samples from Swiss and French rivers. We inferred index values using three analytical approaches: by computing it directly from taxonomically assigned sequences, by calibrating de novo the ecovalues of all metabarcodes, and by using a supervised machine learning algorithm to train predictive models. In general, the values of index obtained using the two "taxonomy-free" approaches, encompassing molecular assignment and machine learning, were closer correlated to the values of the morphological index than the values based on taxonomically assigned sequences. The correlations of the three analytical approaches were higher in the case of rbcL compared to the 18S marker, highlighting the importance of the reference database which is more complete for the rbcL marker. Our study confirms the effectiveness of diatom metabarcoding as an operational tool for rivers ecological quality assessment and shows that the analytical approaches by-passing the taxonomic assignments are particularly efficient when reference databases are incomplete.

see on Pubmed