Two different high throughput sequencing approaches identify thousands of de novo genomic markers for the genetically depleted Bornean elephant.

Reeta Sharma, Benoit Goossens, Benoit Goossens, Célia Kun-Rodrigues, Tatiana Teixeira, Nurzhafarina Othman, Nurzhafarina Othman, Jason Q. Boone, Nathaniel K. Jue, Craig Obergfell, Rachel J. O'Neill, Lounès Chikhi, Lounès Chikhi, Lounès Chikhi

Research output: Contribution to journalArticlepeer-review


High throughput sequencing technologies are being applied to an increasing number of model species with a high-quality reference genome. The application and analyses of whole-genome sequence data in non-model species with no prior genomic information are currently under way. Recent sequencing technologies provide new opportunities for gathering genomic data in natural populations, laying the empirical foundation for future research in the field of conservation and population genomics. Here we present the case study of the Bornean elephant, which is the most endangered subspecies of Asian elephant and exhibits very low genetic diversity. We used two different sequencing platforms, the Roche 454 FLX (shotgun) and Illumina, GAIIx (Restriction site associated DNA, RAD) to evaluate the feasibility of the two methodologies for the discovery of de novo markers (single nucleotide polymorphism, SNPs and microsatellites) using low coverage data. Approximately, 6,683 (shotgun) and 14,724 (RAD) SNPs were detected within our elephant sequence dataset. Genotyping of a representative sample of 194 SNPs resulted in a SNP validation rate of ∼ 83 to 94% and 17% of the loci were polymorphic with a low diversity (Ho = 0.057). Different numbers of microsatellites were identified through shotgun (27,226) and RAD (868) techniques. Out of all di-, tri-, and tetra-microsatellite loci, 1,706 loci had sufficient flanking regions (shotgun) while only 7 were found with RAD. All microsatellites were monomorphic in the Bornean but polymorphic in another elephant subspecies. Despite using different sample sizes, and the well known differences in the two platforms used regarding sequence length and throughput, the two approaches showed high validation rate. The approaches used here for marker development in a threatened species demonstrate the utility of high throughput sequencing technologies as a starting point for the development of genomic tools in a non-model species and in particular for a species with low genetic diversity.
Original languageAmerican English
StatePublished - Nov 21 2012
Externally publishedYes


  • Genetics
  • Biology
  • Bioinformatics
  • Genomics

Cite this