-
Offsite
—
Description Biocyc curate and maintain several databases: > BioCyc is a collection of 371 Pathway/Genome Databases. Each Pathway/Genome Database in the BioCyc collection describes the genome and metabolic pathways of a single organism, with the exception of the MetaCyc database, which is a reference source on metabolic pathways from many organisms. These include ...
-
Offsite
—
Description The International HapMap Project is a partnership of scientists and funding agencies from Canada, China, Japan, Nigeria, the United Kingdom and the United States to develop a public resource that will help researchers find genes associated with human disease and response to pharmaceuticals. Datasets From ...
-
Offsite
—
AceDB is a genome database system developed since 1989 primarily by Jean Thierry-Mieg (CNRS, Montpellier) and Richard Durbin (Sanger Institute). It provides a custom database kernel, with a non-standard data model designed specifically for handling scientific data flexibly, and a graphical user interface with many specific displays and tools for genomic data. AceDB is ...
-
Offsite
—
This data set contains the raw export files of the first genome sequenced by Illumina Individual Genome Service using Illumina’s Genome Analyzer technology of paired 75-base reads. 92,254,659,274 bases were used to generate a consensus sequence with coverage of 32x average depth. The genome was obtained via peripheral blood of Jay Flatley, CEO of Illumina.
-
Offsite
—
The YRI Trio Dataset provides complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria, which represent the first human genomes sequenced using Illumina’s next generation Sequence-by-Synthesis technology. For each genome, the dataset contains >30x average depth of paired 35-base reads. This data set can be used for the following applications: The ...
-
Offsite
—
“The Allen Brain Atlas that shows the expression pattern of almost every gene in the mouse brain, detailed in a huge series of microscopic images. This resource, which is available to everyone on the Internet, is a wonderful tool for brain researchers” (David Linden) The Allen Mouse Brain Atlas is an interactive, genome-wide image database of gene expression. Find ISH ...
-
Offsite
—
The 1000 Genomes data is an open dataset from the biological research community containing genetic sequencing data. The complete dataset is huge, at roughly 150TB uncompressed.