Please use this identifier to cite or link to this item: http://dspace.azti.es/handle/24689/975
Files in This Item:
There are no files associated with this item.
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDiaz-Arce, Natalia
dc.contributor.authorRodriguez-Ezpeleta, Naiara
dc.date.accessioned2020-10-07T14:25:03Z-
dc.date.available2020-10-07T14:25:03Z-
dc.date.issued2019
dc.identifierISI:000469457300001
dc.identifier.issn1664-8021
dc.identifier.urihttp://dspace.azti.es/handle/24689/975-
dc.description.abstractRestriction site-associated DNA sequencing (RAD-seq) has become a powerful and widely used tool in molecular ecology studies as it allows to cost-effectively recover thousands of polymorphic sites across individuals of non-model organisms. However, its successful implementation in population genetics relies on correct data processing that would minimize potential loci-assembly biases and consequent genotyping error rates. RAD-seq data processing when no reference genome is available involves the assembly of hundreds of thousands high-throughput sequencing reads into orthologous loci, for which various key parameter values need to be selected by the researcher. Previous studies exploring the effect of these parameter values found or assumed that a larger number of recovered polymorphic loci is associated with a better assembly. Here, using three RAD-seq datasets from different species, we explore the effect of read filtering, loci assembly and polymorphic site selection on number of markers obtained and genetic differentiation inferred using the Stacks software. We find (i) that recovery of higher numbers of polymorphic loci is not necessarily associated with higher genetic differentiation, (ii) that the presence of PCR duplicates, selected loci assembly parameters and selected SNP filtering parameters affect the number of recovered polymorphic loci and degree of genetic differentiation, and (iii) that this effect is different in each dataset, meaning that defining a systematic universal protocol for RAD-seq data analysis may lead to missing relevant information about population differentiation.
dc.language.isoEnglish
dc.publisherFRONTIERS MEDIA SA
dc.subjectrestriction site-associated DNA sequencing
dc.subjectPCR clones
dc.subjectstacks parameters
dc.subjectSNP filtering
dc.subjectde novo assembly
dc.subjectTOOL SET
dc.subjectDISCOVERY
dc.titleSelecting RAD-Seq Data Analysis Parameters for Population Genetics: The More the Better?
dc.typeArticle
dc.identifier.journalFRONTIERS IN GENETICS
dc.format.volume10
dc.contributor.funderDepartment of Agriculture, Fisheries and Food of the Basque Country through the project GENPES
dc.contributor.funder``Ayudas de formacion a jovenes investigadores y tecnologos en el entorno cientifico-tecnologico y empresarial del sector agropesquero y alimentario vasco��(Basque Government)
dc.identifier.doi10.3389/fgene.2019.00533
Appears in Publication types:Artículos científicos



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.