A novel workflow to improve genotyping of multigene families in wildlife species: an experimental set‐up with a known model system
peer-reviewed
Erstveröffentlichung
2020-11-21Authors
Gillingham, Mark A. F.
Montero, B. Karina
Wihelm, Kerstin
Grudzus, Kara
Sommer, Simone
Wissenschaftlicher Artikel
Published in
Molecular Ecology Resources ; 21 (2020), 3. - S. 982-998. - ISSN 1755-098X. - eISSN 1755-0998
Link to original publication
https://dx.doi.org/10.1111/1755-0998.13290Faculties
Fakultät für NaturwissenschaftenInstitutions
Institut für Evolutionsökologie und NaturschutzgenomikDocument version
published version (publisher's PDF)Abstract
Abstract
Genotyping complex multigene families in novel systems is particularly challenging. Target primers frequently amplify simultaneously multiple loci leading to high PCR and sequencing artefacts such as chimeras and allele amplification bias. Most genotyping pipelines have been validated in nonmodel systems whereby the real genotype is unknown and the generation of artefacts may be highly repeatable. Further hindering accurate genotyping, the relationship between artefacts and genotype complexity (i.e. number of alleles per genotype) within a PCR remains poorly described. Here, we investigated the latter by experimentally combining multiple known major histocompatibility complex (MHC) haplotypes of a model organism (chicken, Gallus gallus, 43 artificial genotypes with 2–13 alleles per amplicon). In addition to well‐defined ‘optimal’ primers, we simulated a nonmodel species situation by designing ‘cross‐species’ primers based on sequence data from closely related Galliform species. We applied a novel open‐source genotyping pipeline (ACACIA; https://gitlab.com/psc_santos/ACACIA), and compared its performance with another, previously published pipeline (AmpliSAS). Allele calling accuracy was higher when using ACACIA (98.5% versus 97% and 77.8% versus 75% for the ‘optimal’ and ‘cross‐species’ data sets, respectively). Systematic allele dropout of three alleles owing to primer mismatch in the ‘cross‐species’ data set explained high allele calling repeatability (100% when using ACACIA) despite low accuracy, demonstrating that repeatability can be misleading when evaluating genotyping workflows. Genotype complexity was positively associated with nonchimeric artefacts, chimeric artefacts (nonlinearly by levelling when amplifying more than 4–6 alleles) and allele amplification bias. Our study exemplifies and demonstrates pitfalls researchers should avoid to reliably genotype complex multigene families.
Is supplemented by
https://figshare.com/projects/ACACIA/66485https://gitlab.com/psc_santos/ACACIA
https://gitlab.com/psc_santos/ACACIA/-/tags/V1.0
https://onlinelibrary.wiley.com/action/downloadSupplement?doi=10.1111%2F1755-0998.13290&file=men13290-sup-0001-supinfo.docx
Subject headings
[GND]: MHC[LCSH]: Acàcia
[Free subject headings]: allele dropout | amplicon genotyping | high‐throughput sequencing | multigene family | open‐source genotyping pipeline | PCR amplification bias | sequencing bias
[DDC subject group]: DDC 570 / Life sciences
Metadata
Show full item recordDOI & citation
Please use this identifier to cite or link to this item: http://dx.doi.org/10.18725/OPARU-46141
Gillingham, Mark A. F. et al. (2022): A novel workflow to improve genotyping of multigene families in wildlife species: an experimental set‐up with a known model system. Open Access Repositorium der Universität Ulm und Technischen Hochschule Ulm. http://dx.doi.org/10.18725/OPARU-46141
Citation formatter >