YRI Trio Dataset
Overview
The YRI Trio Dataset provides complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria, which represent the first human genomes sequenced using Illumina’s next generation Sequence-by-Synthesis technology. For each genome, the dataset contains >30x average depth of paired 35-base reads.
This data set can be used for the following applications:
- The development of alignment algorithms
- The development of de novo assembly algorithms
- The development of algorithms that define genetic regions of interest, sequence motifs, structural variants, copy number variations, and site-specific polymorphisms
- To test the viability of annotation engines that start with raw sequence data
Application Gallery
Do you have an application, visualization or otherwise great use of this data?
Submit it now, and be featured here!
Visit Source
Infochimps Platform
Use this data on the Infochimps Big Data Platform to unlock:
- Advanced analytical capabilities
- Hosting for customer databases
- Access to tools such as Hadoop, Pig, and R
- …and more to come!
Learn More »
Tags
Categories
Stats
| Added by: | fnordquist | |
|---|---|---|
| Link: | http://developer.amazonwebservices.com/connect/entr[ ... ]=2899&categoryID=279 | |
| Created: | almost 2 years ago | |
| Updated: | about 1 year ago | |
Share
