Tag

genetics

14 datasets
  • GenBank - NIH genetic sequence database

    Offsite — Description From the main page: > GenBank® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008 Jan;36(Database issue):D25-30). There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence ...
  • BioCyc

    Offsite — Description Biocyc curate and maintain several databases: > BioCyc is a collection of 371 Pathway/Genome Databases. Each Pathway/Genome Database in the BioCyc collection describes the genome and metabolic pathways of a single organism, with the exception of the MetaCyc database, which is a reference source on metabolic pathways from many organisms. These include ...
  • HapMap

    Offsite — Description The International HapMap Project is a partnership of scientists and funding agencies from Canada, China, Japan, Nigeria, the United Kingdom and the United States to develop a public resource that will help researchers find genes associated with human disease and response to pharmaceuticals. Datasets From ...
  • WikiPathways

    Offsite — About From front page: > In the new tradition of Wikipedia, WikiPathways is an open, public platform dedicated to the curation of biological pathways by and for the scientific community. Openness In December 2008 WikiPathways [switched](http://scienceblogs.com/commonknowledge/2008/12/getting_it_right_wikipathways_1.php) to CC-BY, which is compliant with the ...
  • The Arabidopsis Information Resource

    Offsite — Description > The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana. Data available from TAIR includes the complete genome sequence along with gene structure, gene product information, metabolism, gene expression, DNA and seed stocks, genome maps, genetic and physical ...
  • The Public Library of Science (PLoS)

    Offsite — About From website: > The Public Library of Science (PLoS) is a nonprofit organization committed to making the world’s scientific and medical literature freely available online, without restrictions on use or further distribution, free from private or government control. For more information on our organization and mission, please visit our Web site. > PLoS ...
  • Ensembl Genome Browser

    Offsite — About From website: > Ensembl is a joint project between EMBL – EBI and the Sanger Institute to develop a software system which produces and maintains automatic annotation on selected eukaryotic genomes. Ensembl is primarily funded by the Wellcome Trust. > This site provides free access to all the data and software from the Ensembl project. Click on a species name ...
  • Human Genome Data Set

    Offsite — This data set contains the raw export files of the first genome sequenced by Illumina Individual Genome Service using Illumina’s Genome Analyzer technology of paired 75-base reads. 92,254,659,274 bases were used to generate a consensus sequence with coverage of 32x average depth. The genome was obtained via peripheral blood of Jay Flatley, CEO of Illumina.
  • YRI Trio Dataset

    Offsite — The YRI Trio Dataset provides complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria, which represent the first human genomes sequenced using Illumina’s next generation Sequence-by-Synthesis technology. For each genome, the dataset contains >30x average depth of paired 35-base reads. This data set can be used for the following applications: The ...
  • Ensembl - FASTA Database Files

    Offsite — FASTA database files are sequence databases of transcript and translation models predicted by the Ensembl analysis and annotation pipeline, as well as by ab initio methods. Read more about the FASTA format.
  • GenBank

    Offsite — GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008 Jan;36(Database issue):D25-30). There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence records in the WGS division as of ...
  • Unigene

    Offsite — Each UniGene entry is a set of transcript sequences that appear to come from the same transcription locus (gene or expressed pseudogene), together with information on protein similarities, gene expression, cDNA clone reagents, and genomic location.
  • Ensembl Annotated Human Genome Data - for MySQL

    Offsite — This data set provides scientists with the opportunity to research and understand this important area of biology. These snapshots includes all the databases that are available at http://www.ensembl.org, as well as the Ensembl Biomart, which is a denormalized, query-optimized database that facilitates complex queries of one or more datasets. Full installation instructions ...
  • Influenza Virus (including updated Swine Flu sequences)

    Offsite — This data set includes database and sequence data from the NIAID Influenza Genome Sequencing Project and Genbank. For more information on this data set refer to the NCBI Influenza Virus Resource *Update: This data set is being updated regularly to include new sequences of swine influenza A (H1N1) submitted by the Center for Disease Control and Prevention (CDC).