Tag

size-large

Showing 1 - 20 out of 27 datasets
  • Bulk.resource.org

    Offsite — Bulk.resource.org is a service of public.resource.org. Public.resource.org is a non-profit committed to publishing and sharing public domain materials in the United States. This system contains unsupported, as-is copies of selected U.S. government archives, including: The SEC’s EDGAR Database Commerce Business Daily U.S. Copyright Database Patent Full Text Database ...
  • The CIA World Factbook

    Free Download — Description US government profiles of countries and territories around the world. Information on geography, people, government, transportation, economy, communications, etc. Openness: OPEN License: Public Domain: “The Factbook is in the public domain. Accordingly, it may be copied freely without permission of the Central Intelligence Agency (CIA).” ...
  • GenBank - NIH genetic sequence database

    Offsite — Description From the main page: > GenBank® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008 Jan;36(Database issue):D25-30). There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence ...
  • UK - Office of National Statistics

    Offsite — Description “Free access to data produced by the Office for National Statistics, government departments and devolved administrations.” Datasets Census: links to lists of all standard census tables on <http://www.statistics.gov.uk/census2001/table_list.asp> and an excel file available from ...
  • RCSB Protein Data Bank

    Offsite — Description As of August 2008 over 52 thousand structures available for download. From home page: > The RCSB PDB provides a variety of tools and resources for studying the structures of biological macromolecules and their relationships to sequence, function, and disease. > > The RCSB is a member of the wwPDB whose mission is to ensure that the PDB archive remains ...
  • CEPR Data

    Offsite — Description From the front page: > ceprDATA.org provides consistent, user-friendly versions of the Survey of Income and Program Participation (SIPP), Current Population Survey (CPS), and other datasets used at CEPR available to all interested policy researchers and academics. > > Each dataset listed above is available to download. In addition, you can download and ...
  • GeoNames

    Offsite — The geonames.org geographical database is available for download free of charge under a creative commons attribution license. It contains over eight million geographical names and consists of 6.3 million unique features whereof 2.2 million populated places and 1.8 million alternate names. All features are categorized into one out of nine feature classes and further ...
  • NBER US Patent Citation Database

    Offsite — Description [Taken verbatim from the above url] These data comprise detail information on almost 3 million U.S. patents granted between January 1963 and December 1999, all citations made to these patents between 1975 and 1999 (over 16 million), and a reasonably broad match of patents to Compustat (the data set of all firms traded in the U.S. stock market). These ...
  • FreeBase

    Offsite — Description “Freebase is an open database of the world’s information. It is built by the community and for the community—free for anyone to query, contribute to, built applications on top of, or integrate into their websites.” Openness: OPEN License: cc-by + GFDL for wikipedia derived part (large). Access: ok but no bulk (perhaps via their query engine API but ...
  • Economagic Economic Time Series

    Offsite — Description A large collection of USA time series data taken from a variety of sources, primarily state and central government in the USA. From the about page <http://www.economagic.com/about/>: > This page is meant to be a comprehensive site of free, easily available economic time series data useful for economic research, in particular economic forecasting. This ...
  • Open Directory Project (ODP)

    Offsite — From [about page](http://www.dmoz.org/about.html): > The Open Directory Project is the largest, most comprehensive human-edited directory of the Web. It is constructed and maintained by a vast, global community of volunteer editors. Openness: OPEN Access: good. (see tags) License: has a [bespoke license](http://www.dmoz.org/license.html) which looks [OKD ...
  • Archimedes Palimpset

    Offsite — The Archimedes Palimpsest is a medieval parchment manuscript, now consisting of 174 parchment folios. While it contains no less than seven treatises by Archimedes, calling it the Archimedes Palimpsest is a little confusing. As it is now, the manuscript is a Byzantine prayerbook, written in Greek, and technically called a euchologion. This euchologion was completed by ...
  • Reference Database of Immune Cells

    Offsite — Description From home page: “RefDIC is an open-access database of quantitative mRNA/Protein profiles specifically for immune cells.” From <http://refdic.rcai.riken.jp/document.cgi>: > RefDIC is an open resource compendium of quantitative mRNA/Protein profile data specifically for immune cells. You can easily retrieve various aspects of mRNA/Protein profiles of ...
  • The National Public Transport Data Repository (traveline)

    Offsite — Description Data created by [traveline](http://www.pti.org.uk/) and used by (among others) [transportdirect](http://transportdirect.info). From <http://www.pti.org.uk/repository.htm>: > The third snapshot of the traveline data was taken in October 2006. It was based on the data that traveline was using in one week in October 2006. > > Guidance Notes were available ...
  • Federal Aviation Administration - Data and Statistics

    No Data — Description From main page: > Accident & Incident Reports > > * Preliminary Data > * Final Data > * More » Accident & Incident Data > > Aviation Data & Statistics > > * Airline On-Time Statistics & Delay Causes > * Airmen Knowledge Test Statistics > * More » Aviation Data & Statistics > > Commercial Space Data > > * Upcoming & Recent Launch Data ...
  • Numbrary

    Offsite — Description Not a producer of data but focused on extracting and aggregating data from other sources. Openness: OPEN License: no explicit license used but all underlying data from US government so PD. Access: ok. www: yes. bulk: no. api: no.
  • Flossmetrics - Free Libre and Open Source Software Metrics

    Offsite — Description From front page: > The main objective of FLOSSMETRICS is to construct, publish and analyse a large scale database with information and metrics about libre software development coming from several thousands of software projects, using existing methodologies, and tools already developed. The project will also provide a public platform for validation and ...
  • US Copyright Renewal Database

    Offsite — Released in 2008 and funded by Hewlett Foundation. From front page: > … This database makes searchable the copyright renewal records received by the US Copyright Office between 1950 and 1992 for books published in the US between 1923 and 1963. Note that the database includes ONLY US Class A (book) renewals. > > The period from 1923-1963 is of special interest for US ...
  • Citeseer Metadata

    Offsite — Title: Scientific Literature Digital Library Description CiteSeer is a scientific literature digital library and search engine that focuses primarily on the literature in computer and information science. CiteSeer aims to improve the dissemination and feedback of the scientific literature and to provide improvements in functionality, usability, availability, cost, ...
  • MovieLens Data Sets

    Offsite — Description This data set contains 10000054 ratings and 95580 tags applied to 10681 movies by 71567 users of the online movie recommender service MovieLens. Users were selected at random for inclusion. All users selected had rated at least 20 movies. Unlike previous MovieLens data sets, no demographic information is included. Each user is represented by an id, and ...