Das Leibniz-Institut zur Analyse des Biodiversitätswandels

ist ein Forschungsmuseum der Leibniz Gemeinschaft

MitoGeneExtractor

AutorInnen: 
Brasseur, M. V., Astrin, J. J., Geiger, M. F., & Mayer, C.
Erscheinungsjahr: 
2023
Vollständiger Titel: 
MitoGeneExtractor: Efficient extraction of mitochondrial genes from next-generation sequencing libraries
Publiziert in: 
Methods in Ecology and Evolution
Publikationstyp: 
Zeitschriftenaufsatz
DOI Name: 
doi.org/10.1111/2041-210X.14075.
Keywords: 
COI, data mining, data re-use, DNA barcoding, mitochondrial genes, ND5
Bibliographische Angaben: 
Brasseur, M. V., Astrin, J. J., Geiger, M. F., & Mayer, C. (2023): MitoGeneExtractor: Efficient extraction of mitochondrial genes from next-generation sequencing libraries.- Methods in Ecology and Evolution, 00, 1–8. https://doi.org/10.1111/2041-210X.14075.
Abstract: 

1. Mitochondrial DNA (mtDNA) sequences are often found as byproducts in next-generation sequencing (NGS) datasets that were originally created to capture genomic or transcriptomic information of an organism. These mtDNA sequences are often discarded, wasting this valuable sequencing information.
2. We developed MitoGeneExtractor, an innovative tool which allows to extract mitochondrial protein coding genes (PCGs) of interest from NGS libraries through multiple sequence alignments of sequencing reads to amino acid references. General references, for example on order level are sufficient for mining mitochondrial PCGs. In a case study, we applied MitoGeneExtractor to recently published genomic datasets of 1993 birds and were able to extract complete or nearly complete sequences for all 13 mitochondrial PCGs for a large proportion of libraries. Compared to an existing assembly guided sequence reconstruction algorithm, MitoGeneExtractor was faster and substantially more sensitive.
3. We compared COI sequences mined with MitoGeneExtractor to COI databases. Mined sequences show a high sequence similarity and correct taxonomic assignment between the recovered sequence and the assigned morphospecies in most samples. In some cases of incongruent taxonomic assignments, we found evidence for contamination in NGS libraries. 
4. MitoGeneExtractor allows a fast extraction of mitochondrial PCGs from a wide range of NGS datasets. We recommend to routinely harvest and curate mitochondrial sequence information from genomic resources. MitoGeneExtractor output can be used to identify contaminated NGS libraries and to validate the
species identity of the sequenced animal based on the extracted COI sequences. 

Ansprechpartnerin / Ansprechpartner

Sektionsleiter
Stellvertretender Datenschutzbeauftragter
+49 228 9122-403
c.mayer.zfmk [at] uni-bonn.de