Название: Genotyping by Sequencing for Crop Improvement
Автор: Группа авторов
Издательство: John Wiley & Sons Limited
Жанр: Биология
isbn: 9781119745679
isbn:
Another important pipeline widely used for NGS data analysis is dDocent pipeline (www.dDocent.com) which is a simple bash wrapper to quality analysis, assemble, map, and call SNPs from almost any kind of RAD sequencing (Puritz et al. 2014). However, most of these pipelines are hard to code for a student with little bioinformatics background. Most of these pipelines vary with respect to the complexity of the genome and computational space required. Besides there are several bioinformatics tools such as BWA, Bowtie2, SAM tools, GATK, BCFtools including a set of Perl utility scripts (Kagale et al. 2016) that can be used for GBS data analysis. However, there should be knowledge of the installation and usage of these tools for proper utilization in data analysis. With the advancements in NGS approaches, GBS has become a widely used approach in plant breeding and genetics, particularly for understanding complex quantitative traits.
DArT‐seq GBS (https://www.diversityarrays.com/technology‐and‐resources/dartseq/) somehow overcomes the limitation of the missing data point. The technique is an extension of traditional DArT technology where DArT representations are sequenced on the NGS platform. The fragment sequencing enables a dramatic increase in the number of genomic fragments analyzed and an increase in the number of reported markers thus making it a cost‐effective technology than the initial DArT method.
1.5.2 Whole‐Genome Resequencing (WGR)
WGR with high coverage and depth overcomes the limitations of GBS due to missing data points and heterozygous calls. In general, WGR involves the sequencing of enough DNA fragments (>5×–20×) to cover the whole genome of an organism. Due to sequencing cost, the technique is suitable in crop plants having smaller genome sizes such as rice. In such cases, GBS can be replaced by resequencing of a larger size population at 5–6× depth. However, WGR for few samples can be done at a much higher read depth of 10–20× as in the case of the BSA‐seq approach (Nguyen et al. 2019). One of the important BSA‐seq‐based approaches is quantitative trait loci (QTL)‐seq developed by Takagi et al. (2015) in rice. Later this technique has been widely used in several crop plants. Takagi et al. (2015) developed a pipeline for analysis of the whole genome sequence of bulks and identification of causative variants. WGR has been used in several studies for identification of genome‐wide SNPs, genotyping mapping populations for construction of high‐density linkage maps and QTL mapping, linkage and genome‐wide association studies (GWASs), of reference genome improvement, and genomic selection (Poland and Rife 2012; Bhatia et al. 2013; Chung et al. 2017; Nguyen et al. 2019).
1.5.3 SNP Arrays
Along with GBS, high‐density DNA array‐based SNP chips or SNP arrays have become a widely used SNP detection platform for high multiplex genotyping. SNP arrays work by hybridization of DNA fragments with allele‐specific oligonucleotide probes (SNP probes) and fluorescence‐based detection of signals. In general, SNP arrays can be roughly categorized into two types based on SNP detection methods: (i) nonenzymatic differential hybridization including allele‐specific hybridization, (ii) enzymatic reactions including primer extension, and mini‐sequencing (Ding and Jin 2009). For making SNP arrays, the first step is the identification of genome‐wide SNPs by sequencing (preferably WGR) of a large diverse panel. The SNPs arrays may include SNPs from coding (genic) regions only and/or genome‐wide SNPs from other noncoding regions. SNPs are in silico validated with several custom tools and final filtered SNPs are identified. The oligonucleotide probes containing SNP alleles are designed and bound on a solid glass plate surface. SNP chips can be custom designed commercially from two widely used platforms: Affymetrics (www.affymetrics.com) as Axiom Affymetrics SNP Chips (Affymetrix/Thermo Fisher Axiom®) or Illumina (https://www.illumina.com/science/technology/microarray.html) as Immunia Infinium assay (Illumina Infinium®). Affymetrics SNP array relies on differential hybridization due to different melting temperatures for matched and mismatched SNPs binding to target DNA sequence. On the other hand, Illumina Infinium assay uses Illumina BeadArray technology that relies on primer extension to distinguish two SNP alleles. The Affymetrix SNP array uses 25‐mer for SNP calling while the Illumina BeadArray uses 50‐mer for target capture. In rice, a high‐resolution 44K Affymetrix array, 50K Infinium array, and 700K high‐density rice array are available for rice SNP genotyping (McCouch et al. 2010; Tung et al. 2010; Chen et al. 2013; McCouch et al. 2015). Additionally, high‐density SNP arrays have been developed for other crop plants such as maize (Ganal et al. 2011) and sunflower (Bachlava et al. 2012) as well as domestic animal species, including cattle (Gibbs et al. 2009; Matukumalli et al. 2009) and pig (Ramos et al. 2009). One major advantage of SNP arrays is the reproducibility of data points where GBS does have some shortcomings. However, the disadvantage is the less polymorphism as compared to GBS and WGR and detection of only alleles present in the array (Table 1.2).
Table 1.2 Comparison between different marker techniques commonly used in plant research.
SSR | GBS | WGR | SNP array | KASP™ | |
---|---|---|---|---|---|
DNA quality | Moderate | High | High | High | High |
PCR‐based | Yes | Yes | No | No | No |
Allele detection | High | High | High | Low | Low |
Polymorphism | High | High | High | Low | Low |
Ease to use | Easy | Not easy | Not easy | Easy | Easy |
Reproducibility | High | Low | High | High | High |
Cost | Moderate | Low to moderate | High | High | moderate |
Cost for analysis | High | High |
СКАЧАТЬ
|