TY - JOUR
T1 - 2b or not 2b? 2bRAD is an effective alternative to ddRAD for phylogenomics
AU - Chambers, E. Anne
AU - Tarvin, Rebecca D.
AU - Santos, Juan C.
AU - Ron, Santiago R.
AU - Betancourth-Cundar, Mileidy
AU - Hillis, David M.
AU - Matz, Mikhail V.
AU - Cannatella, David C.
N1 - Publisher Copyright:
© 2023 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
PY - 2023/3/8
Y1 - 2023/3/8
N2 - Restriction-site-associated DNA sequencing (RADseq) has become an accessible way to obtain genome-wide data in the form of single-nucleotide polymorphisms (SNPs) for phylogenetic inference. Nonetheless, how differences in RADseq methods influence phylogenetic estimation is poorly understood because most comparisons have largely relied on conceptual predictions rather than empirical tests. We examine how differences in ddRAD and 2bRAD data influence phylogenetic estimation in two non-model frog groups. We compare the impact of method choice on phylogenetic information, missing data, and allelic dropout, considering different sequencing depths. Given that researchers must balance input (funding, time) with output (amount and quality of data), we also provide comparisons of laboratory effort, computational time, monetary costs, and the repeatability of library preparation and sequencing. Both 2bRAD and ddRAD methods estimated well-supported trees, even at low sequencing depths, and had comparable amounts of missing data, patterns of allelic dropout, and phylogenetic signal. Compared to ddRAD, 2bRAD produced more repeatable datasets, had simpler laboratory protocols, and had an overall faster bioinformatics assembly. However, many fewer parsimony-informative sites per SNP were obtained from 2bRAD data when using native pipelines, highlighting a need for further investigation into the effects of each pipeline on resulting datasets. Our study underscores the importance of comparing RADseq methods, such as expected results and theoretical performance using empirical datasets, before undertaking costly experiments.
AB - Restriction-site-associated DNA sequencing (RADseq) has become an accessible way to obtain genome-wide data in the form of single-nucleotide polymorphisms (SNPs) for phylogenetic inference. Nonetheless, how differences in RADseq methods influence phylogenetic estimation is poorly understood because most comparisons have largely relied on conceptual predictions rather than empirical tests. We examine how differences in ddRAD and 2bRAD data influence phylogenetic estimation in two non-model frog groups. We compare the impact of method choice on phylogenetic information, missing data, and allelic dropout, considering different sequencing depths. Given that researchers must balance input (funding, time) with output (amount and quality of data), we also provide comparisons of laboratory effort, computational time, monetary costs, and the repeatability of library preparation and sequencing. Both 2bRAD and ddRAD methods estimated well-supported trees, even at low sequencing depths, and had comparable amounts of missing data, patterns of allelic dropout, and phylogenetic signal. Compared to ddRAD, 2bRAD produced more repeatable datasets, had simpler laboratory protocols, and had an overall faster bioinformatics assembly. However, many fewer parsimony-informative sites per SNP were obtained from 2bRAD data when using native pipelines, highlighting a need for further investigation into the effects of each pipeline on resulting datasets. Our study underscores the importance of comparing RADseq methods, such as expected results and theoretical performance using empirical datasets, before undertaking costly experiments.
KW - Dendrobatidae
KW - Ranidae
KW - missing data
KW - phylogenetic signal
KW - restriction-site-associated DNA sequencing
UR - http://www.scopus.com/inward/record.url?scp=85152645465&partnerID=8YFLogxK
U2 - 10.1002/ece3.9842
DO - 10.1002/ece3.9842
M3 - Article
AN - SCOPUS:85152645465
SN - 2045-7758
VL - 13
JO - Ecology and Evolution
JF - Ecology and Evolution
IS - 3
M1 - e9842
ER -