International Journal on Advanced Science, Engineering and Information Technology, Vol. 11 (2021) No. 2, pages: 820-827, DOI:10.18517/ijaseit.11.2.12535

Leveraging the 3000 Rice Genome Data for Computational Design of Polymorphic Markers in a Local Rice Variety Lacking Sequence Data

Dani Satyawan, Ahmad Warsun, Ahmad Dadang, Muhamad Yunus


DNA markers can detect DNA sequence variations in the genome, and they are useful for genetic studies, DNA fingerprinting, and genotype-based selection in breeding programs. Rice, as one of the model plants for genetic and genomic studies, has abundant DNA markers stored in various online databases. Selecting markers in rice is not limited by marker availability but rather by their polymorphism in the target population. We developed a computational method to screen millions of single nucleotide polymorphism (SNP) markers listed in IRRI 3000 rice genome database in order to find a subset of markers that are polymorphic in an F2 mapping population created from a cross between a parental line with a known genome sequence and a local Indonesian variety with no genome sequence data. The parental lines were genotyped using an affordable medium-density SNP array. The genotype data was cross-referenced with the rice genome database to perform phylogenetic analysis and identify accessions clusters with the highest genetic similarities to each parental line. The cluster data was then used to identify monomorphic SNP candidates within the cluster but exhibit consistent polymorphism between the two clusters. Using this method, we obtained a SNP marker set for a segment in rice chromosome 8 with 76.19% polymorphism rate, which is much higher than the expected 1.06% polymorphism rate if the SNP markers were chosen randomly. The improved polymorphism rate was also observed when the method was applied to other random chromosome segments and randomly chosen parental candidates.


Rice; SNP marker; computational marker design; genome sequence utilization.

Viewed: 77 times (since Sept 4, 2017)

cite this paper     download