Volume 18, Issue 3
RESOURCE ARTICLE

dartr: An r package to facilitate analysis of SNP data generated from reduced representation genome sequencing

Bernd Gruber

Corresponding Author

E-mail address: bernd.gruber@canberra.edu.au

Institute for Applied Ecology, University of Canberra, Canberra, ACT, Australia

Correspondence

Bernd Gruber, Institute for Applied Ecology, University of Canberra, Canberra, ACT, Australia.

Email: bernd.gruber@canberra.edu.au

Search for more papers by this author
Peter J. Unmack

Institute for Applied Ecology, University of Canberra, Canberra, ACT, Australia

Search for more papers by this author
Oliver F. Berry

CSIRO Environomics Future Science Platform, Indian Ocean Marine Research Centre, The University of Western Australia (M097), Crawley, WA, Australia

Search for more papers by this author
Arthur Georges

Institute for Applied Ecology, University of Canberra, Canberra, ACT, Australia

Search for more papers by this author
First published: 20 December 2017
Citations: 63

Abstract

Although vast technological advances have been made and genetic software packages are growing in number, it is not a trivial task to analyse SNP data. We announce a new r package, dartr, enabling the analysis of single nucleotide polymorphism data for population genomic and phylogenomic applications. dartr provides user‐friendly functions for data quality control and marker selection, and permits rigorous evaluations of conformation to Hardy–Weinberg equilibrium, gametic‐phase disequilibrium and neutrality. The package reports standard descriptive statistics, permits exploration of patterns in the data through principal components analysis and conducts standard F‐statistics, as well as basic phylogenetic analyses, population assignment, isolation by distance and exports data to a variety of commonly used downstream applications (e.g., newhybrids, faststructure and phylogeny applications) outside of the r environment. The package serves two main purposes: first, a user‐friendly approach to lower the hurdle to analyse such data—therefore, the package comes with a detailed tutorial targeted to the r beginner to allow data analysis without requiring deep knowledge of r. Second, we use a single, well‐established format—genlight from the adegenet package—as input for all our functions to avoid data reformatting. By strictly using the genlight format, we hope to facilitate this format as the de facto standard of future software developments and hence reduce the format jungle of genetic data sets. The dartr package is available via the r CRAN network and GitHub.

Number of times cited according to CrossRef: 63

  • Fine-scale species distribution modelling and genotyping by sequencing to examine hybridisation between two narrow endemic plant species, Scientific Reports, 10.1038/s41598-020-58525-2, 10, 1, (2020).
  • Assessing the benefits and risks of translocations in depauperate species: A theoretical framework with an empirical validation, Journal of Applied Ecology, 10.1111/1365-2664.13581, 57, 4, (831-841), (2020).
  • A new strain group of common carp: The genetic differences and admixture events between Cyprinus carpio breeds, Ecology and Evolution, 10.1002/ece3.6286, 10, 12, (5431-5439), (2020).
  • Robbery in progress: Historical museum collections bring to light a mitochondrial capture within a bird species widespread across southern Australia, the Copperback Quail‐thrush Cinclosoma clarum, Ecology and Evolution, 10.1002/ece3.6403, 10, 13, (6785-6793), (2020).
  • Historical demography and climate driven distributional changes in a widespread Neotropical freshwater species with high economic importance, Ecography, 10.1111/ecog.04874, 43, 9, (1291-1304), (2020).
  • Contrasting patterns of local adaptation along climatic gradients between a sympatric parasitic and autotrophic tree species, Molecular Ecology, 10.1111/mec.15537, 29, 16, (3022-3037), (2020).
  • Fine-scale population structure and evidence for local adaptation in Australian giant black tiger shrimp (Penaeus monodon) using SNP analysis, BMC Genomics, 10.1186/s12864-020-07084-x, 21, 1, (2020).
  • Epstein-Barr Virus Genomes Reveal Population Structure and Type 1 Association with Endemic Burkitt Lymphoma, Journal of Virology, 10.1128/JVI.02007-19, 94, 17, (2020).
  • Accounting for cryptic population substructure enhances detection of inbreeding depression with genomic inbreeding coefficients: an example from a critically endangered marsupial, Molecular Ecology, 10.1111/mec.15540, 29, 16, (2978-2993), (2020).
  • Oceanic and coastal populations of a harvested macroinvertebrate Rochia nilotica in north-western Australia are isolated and may be locally adapted, Marine and Freshwater Research, 10.1071/MF19172, 71, 7, (782), (2020).
  • Genome‐wide SNP analysis reveals an increase in adaptive genetic variation through selective breeding of coral, Molecular Ecology, 10.1111/mec.15482, 29, 12, (2176-2188), (2020).
  • Genomic data and morphological re‐assessment reveals synonymy and hybridisation among Seringia taxa (Lasiopetaleae, Malvaceae) in remote north‐western Australia, TAXON, 10.1002/tax.12233, 69, 2, (307-320), (2020).
  • Radiation of tropical island bees and the role of phylogenetic niche conservatism as an important driver of biodiversity, Proceedings of the Royal Society B: Biological Sciences, 10.1098/rspb.2020.0045, 287, 1925, (20200045), (2020).
  • Unraveling hierarchical genetic structure in a marine metapopulation: A comparison of three high‐throughput genotyping approaches, Molecular Ecology, 10.1111/mec.15405, 29, 12, (2189-2203), (2020).
  • Springtail phylogeography highlights biosecurity risks of repeated invasions and intraregional transfers among remote islands, Evolutionary Applications, 10.1111/eva.12913, 13, 5, (960-973), (2020).
  • Genomic analysis of Spanish wheat landraces reveals their variability and potential for breeding, BMC Genomics, 10.1186/s12864-020-6536-x, 21, 1, (2020).
  • Inbreeding and disease avoidance in a free‐ranging koala population, Molecular Ecology, 10.1111/mec.15488, 29, 13, (2416-2430), (2020).
  • A perfect storm of genetic drift and divergence may prevent the rebuilding of the gemfish (Rexea solandri) stock on the east Australian coast, Fisheries Research, 10.1016/j.fishres.2020.105645, 230, (105645), (2020).
  • Detecting fine-scale population structure in the age of genomics: a case study of lake sturgeon in the Great Lakes, Fisheries Research, 10.1016/j.fishres.2020.105646, 230, (105646), (2020).
  • Comparative assessment of range‐wide patterns of genetic diversity and structure with SNPs and microsatellites: A case study with Iberian amphibians, Ecology and Evolution, 10.1002/ece3.6670, 10, 19, (10353-10363), (2020).
  • Phylogeography of the Cran’s bully Gobiomorphus basalis (Gobiiformes: Eleotridae) and an analysis of species boundaries within the New Zealand radiation of Gobiomorphus, Biological Journal of the Linnean Society, 10.1093/biolinnean/blaa052, (2020).
  • Genome-wide SNPs detect no evidence of genetic population structure for reef manta rays (Mobula alfredi) in southern Mozambique, Heredity, 10.1038/s41437-020-00373-x, (2020).
  • Cryptic Patterns of Speciation in Cryptic Primates: Microendemic Mouse Lemurs and the Multispecies Coalescent, Systematic Biology, 10.1093/sysbio/syaa053, (2020).
  • Population Genomic Analyses of Wild and Farmed Striped Catfish Pangasianodon Hypophthalmus in the Lower Mekong River, Journal of Marine Science and Engineering, 10.3390/jmse8060471, 8, 6, (471), (2020).
  • Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies, PeerJ, 10.7717/peerj.9391, 8, (e9391), (2020).
  • Genetic Basis of Maize Resistance to Multiple Insect Pests: Integrated Genome-Wide Comparative Mapping and Candidate Gene Prioritization, Genes, 10.3390/genes11060689, 11, 6, (689), (2020).
  • Morphological and molecular evidence of population divergence in a widespread shorebird across its southern mainland Australian distribution, Conservation Genetics, 10.1007/s10592-020-01286-2, (2020).
  • Combining genotype, phenotype, and environmental data to delineate site‐adjusted provenance strategies for ecological restoration, Molecular Ecology Resources, 10.1111/1755-0998.13191, 0, 0, (2020).
  • Islands within islands: genetic structuring at small spatial scales has implications for long‐term persistence of a threatened species, Animal Conservation, 10.1111/acv.12603, 0, 0, (2020).
  • Subtle genetic clustering among South Australian colonies of little penguins (Eudyptula minor), Conservation Genetics, 10.1007/s10592-020-01284-4, (2020).
  • Multiple Lines of Evidence Indicate Limited Natural Recruitment of Golden Perch (Macquaria ambigua) in the Highly Regulated Lachlan River, Water, 10.3390/w12061636, 12, 6, (1636), (2020).
  • Patterns of hybridization and cryptic introgression among one- and four-needled pinyon pines, Annals of Botany, 10.1093/aob/mcaa045, (2020).
  • Phylogeography of the iconic Australian red-tailed black-cockatoo (Calyptorhynchus banksii) and implications for its conservation, Heredity, 10.1038/s41437-020-0315-y, (2020).
  • Investigating inbreeding in a free-ranging, captive population of an Australian marsupial, Conservation Genetics, 10.1007/s10592-020-01278-2, (2020).
  • Genetic diversity of coffee (Coffea spp.) in Mexico evaluated by using DArTseq and SNP markers, Genetic Resources and Crop Evolution, 10.1007/s10722-020-00940-5, (2020).
  • Crypsis and convergence: integrative taxonomic revision of the Gehyra australis group (Squamata: Gekkonidae) from northern Australia , PeerJ, 10.7717/peerj.7971, 8, (e7971), (2020).
  • Ultraconserved yet informative for species delimitation: Ultraconserved elements resolve long‐standing systematic enigma in Central European bees, Molecular Ecology, 10.1111/mec.15629, 0, 0, (2020).
  • DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms, PLOS ONE, 10.1371/journal.pone.0239122, 15, 10, (e0239122), (2020).
  • Genome‐wide SNPs resolve spatiotemporal patterns of connectivity within striped marlin (Kajikia audax), a broadly distributed and highly migratory pelagic species, Evolutionary Applications, 10.1111/eva.12892, 13, 4, (677-698), (2019).
  • Genomic Characterization of Coho Salmon Spawning Populations from the Hood Canal, Transactions of the American Fisheries Society, 10.1002/tafs.10206, 149, 1, (3-13), (2019).
  • Weak genetic structure despite strong genomic signal in lesser sandeel in the North Sea, Evolutionary Applications, 10.1111/eva.12875, 13, 2, (376-387), (2019).
  • Taxonomy and conservation of grassland earless dragons: new species and an assessment of the first possible extinction of a reptile on mainland Australia, Royal Society Open Science, 10.1098/rsos.190233, 6, 5, (190233), (2019).
  • Boomeranging around Australia: Historical biogeography and population genomics of the anti‐equatorial fish Microcanthus strigatus (Teleostei: Microcanthidae), Molecular Ecology, 10.1111/mec.15172, 28, 16, (3771-3785), (2019).
  • Accounting for kin sampling reveals genetic connectivity in Tasmanian and New Zealand school sharks, Galeorhinus galeus, Ecology and Evolution, 10.1002/ece3.5012, 9, 8, (4465-4472), (2019).
  • A genome-wide approach for uncovering evolutionary relationships of Australian Bactrocera species complexes (Diptera: Tephritidae), Invertebrate Systematics, 10.1071/IS18065, (2019).
  • Lifetime Fitness Costs of Inbreeding and Being Inbred in a Critically Endangered Bird, Current Biology, 10.1016/j.cub.2019.06.064, (2019).
  • Novel multimarker comparisons address the genetic population structure of silvertip sharks (Carcharhinus albimarginatus), Marine and Freshwater Research, 10.1071/MF18296, 70, 7, (1007), (2019).
  • From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species, BMC Genomics, 10.1186/s12864-019-5806-y, 20, 1, (2019).
  • Phylogeography and species delimitation of Cherax destructor (Decapoda: Parastacidae) using genome-wide SNPs, Marine and Freshwater Research, 10.1071/MF18347, 70, 6, (857), (2019).
  • Perspectives on the clonal persistence of presumed ‘ghost’ genomes in unisexual or allopolyploid taxa arising via hybridization, Scientific Reports, 10.1038/s41598-019-40865-3, 9, 1, (2019).
  • Tandem host‐parasite dispersal inferred from similarities in phylogeographical patterns among Little Penguins and their ‘terrestrial’ ectoparasites, Journal of Biogeography, 10.1111/jbi.13714, 46, 11, (2520-2531), (2019).
  • Restriction-site associated DNA sequencing supports a sister group relationship of Nigritella and Gymnadenia (Orchidaceae), Molecular Phylogenetics and Evolution, 10.1016/j.ympev.2019.03.018, (2019).
  • Increased Adaptive Variation Despite Reduced Overall Genetic Diversity in a Rapidly Adapting Invader, Frontiers in Genetics, 10.3389/fgene.2019.01221, 10, (2019).
  • An Integrative Approach Using Phylogenomics and High-Resolution X-Ray Computed Tomography for Species Delimitation in Cryptic Taxa, Systematic Biology, 10.1093/sysbio/syz048, (2019).
  • Receding ice drove parallel expansions in Southern Ocean penguins, Proceedings of the National Academy of Sciences, 10.1073/pnas.1904048116, (201904048), (2019).
  • Deciphering the Evolutionary History of Arowana Fishes (Teleostei, Osteoglossiformes, Osteoglossidae): Insight from Comparative Cytogenomics, International Journal of Molecular Sciences, 10.3390/ijms20174296, 20, 17, (4296), (2019).
  • Genetic diversity mirrors trophic ecology in coral reef fish feeding guilds, Molecular Ecology, 10.1111/mec.14936, 27, 24, (5004-5018), (2018).
  • Genomewide SNP markers breathe new life into phylogeography and species delimitation for the problematic short‐necked turtles (Chelidae: Emydura) of eastern Australia, Molecular Ecology, 10.1111/mec.14925, 27, 24, (5195-5213), (2018).
  • Population Genomics: Advancing Understanding of Nature, , 10.1007/13836_2018_60, (2018).
  • Genome-Wide Assessment of Diversity and Divergence Among Extant Galapagos Giant Tortoise Species, Journal of Heredity, 10.1093/jhered/esy031, 109, 6, (611-619), (2018).
  • Assault from all sides: hybridization and introgression threaten the already critically endangered Myuchelys georgesi (Chelonia: Chelidae), Endangered Species Research, 10.3354/esr00928, 37, (239-247), (2018).
  • The State of “Omics” Research for Farmed Penaeids: Advances in Research and Impediments to Industry Utilization, Frontiers in Genetics, 10.3389/fgene.2018.00282, 9, (2018).
  • Use of High Density Single Nucleotide Polymorphism (SNP) Arrays to Assess Genetic Diversity and Population Structure of Dairy Cattle in Smallholder Dairy Systems: The Case of Girinka Programme in Rwanda, Frontiers in Genetics, 10.3389/fgene.2018.00438, 9, (2018).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.