Showing 21 - 37 out of 37 datasets
  • Human Genome Build 37

    Offsite — 2009 assembly of the human genome (hg19, GRCh37 Genome Reference Consortium Human Reference 37 (GCA_000001405.1)) in one gzip-compressed FASTA file per chromosome.
  • Ensembl Genome Data

    Offsite — The Ensembl project produces genome databases for vertebrates and other eukaryotic species, and makes this information freely available online. Data can be downloaded in a variety of formats, from flat files to MySQL dumps. Freely available for use under an Apache-style license.
  • Poisonous Plants of The Southern United States

    Offsite — A listing of plants in the Southern United States with descriptions, toxicity, and symptoms.
  • Plants Poisonous To Humans and Livestock

    Free Download — A listing of plants poisonous to humans and livestock. Includes scientific and common names. Source: http://www.ansci.cornell.edu/plants/index.html
  • Flora of North America

    Offsite — FNA presents for the first time, in one published reference source, information on the names, taxonomic relationships, continent-wide distributions, and morphological characteristics of all plants native and naturalized found in North America north of Mexico. Source: http://www.fna.org/
  • Human Genome Data Set

    Offsite — This data set contains the raw export files of the first genome sequenced by Illumina Individual Genome Service using Illumina’s Genome Analyzer technology of paired 75-base reads. 92,254,659,274 bases were used to generate a consensus sequence with coverage of 32x average depth. The genome was obtained via peripheral blood of Jay Flatley, CEO of Illumina.
  • YRI Trio Dataset

    Offsite — The YRI Trio Dataset provides complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria, which represent the first human genomes sequenced using Illumina’s next generation Sequence-by-Synthesis technology. For each genome, the dataset contains >30x average depth of paired 35-base reads. This data set can be used for the following applications: The ...
  • Ensembl - FASTA Database Files

    Offsite — FASTA database files are sequence databases of transcript and translation models predicted by the Ensembl analysis and annotation pipeline, as well as by ab initio methods. Read more about the FASTA format.
  • 3D Version of the PubChem Library

    Offsite — This data set is a 3D Version of the PubChem Library. PubChem provides information on the biological activities of small molecules. It is a component of NIH’s Molecular Libraries Roadmap Initiative.
  • PubChem Library

    Offsite — PubChem provides information on the biological activities of small molecules. It is a component of NIH’s Molecular Libraries Roadmap Initiative.
  • GenBank

    Offsite — GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008 Jan;36(Database issue):D25-30). There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence records in the WGS division as of ...
  • Unigene

    Offsite — Each UniGene entry is a set of transcript sequences that appear to come from the same transcription locus (gene or expressed pseudogene), together with information on protein similarities, gene expression, cDNA clone reagents, and genomic location.
  • Ensembl Annotated Human Genome Data - for MySQL

    Offsite — This data set provides scientists with the opportunity to research and understand this important area of biology. These snapshots includes all the databases that are available at http://www.ensembl.org, as well as the Ensembl Biomart, which is a denormalized, query-optimized database that facilitates complex queries of one or more datasets. Full installation instructions ...
  • Influenza Virus (including updated Swine Flu sequences)

    Offsite — This data set includes database and sequence data from the NIAID Influenza Genome Sequencing Project and Genbank. For more information on this data set refer to the NCBI Influenza Virus Resource *Update: This data set is being updated regularly to include new sequences of swine influenza A (H1N1) submitted by the Center for Disease Control and Prevention (CDC).
  • Allen Brain Atlas - complete gene expression pattern of mouse brain

    Offsite — “The Allen Brain Atlas that shows the expression pattern of almost every gene in the mouse brain, detailed in a huge series of microscopic images. This resource, which is available to everyone on the Internet, is a wonderful tool for brain researchers” (David Linden) The Allen Mouse Brain Atlas is an interactive, genome-wide image database of gene expression. Find ISH ...
  • 1000 Genomes Data

    Offsite — The 1000 Genomes data is an open dataset from the biological research community containing genetic sequencing data. The complete dataset is huge, at roughly 150TB uncompressed.
  • FindTheBest.com Paradoxes listing

    Offsite — Find and compare paradoxes by definitions, explanations, categories and more.