Collection

The Comprehensive Knowledge Archive Network (CKAN) Collection

Showing 201 - 250 out of 369 datasets

From their website:

CKAN is the Comprehensive Knowledge Archive Network, a registry of open knowledge packages and projects (and a few closed ones)…Those familiar with freshmeat, CPAN or PyPI can think of CKAN as providing an analogous service for open knowledge…CKAN is developed and maintained by the Open Knowledge Foundation. Both the CKAN code and data are open: free for anyone to use and reuse. To find out more check out the the CKAN project at knowledgeforge.net

CKAN is a peer in the global data commons and Infochimps is proud to be able to mirror their collection of over 300 datasets.

  • Journal of the American Dental Association

    Offsite — Articles free 12 months after publication.
  • World Population, GDP and Per Capita GDP, 1-2003 AD

    Offsite — Author Angus Maddison Openness: Not open No license Plus following statement attached to link to data: “Last update: March 2007, copyright Angus Maddison” Format xls (excel)
  • Open Architecture Network

    Offsite — About “The Open Architecture Network is an online, open source community dedicated to improving living conditions through innovative and sustainable design. Here designers of all persuasions can: Share their ideas, designs and plans View and review designs posted by others Collaborate with each other, people in other professions and community leaders to address ...
  • Medknow Publications

    Offsite — Medknow Publications is the largest publisher in India for academic and scientific biomedical journals. The publishing house is committed to the improving the visibility and accessibility of the science from the developing world. Medknow pioneers in ‘fee-less-free’ model of open access publishing and provides immediate free access to the electronic editions of the ...
  • District of Colombia Data Catalog

    Offsite — List of datasets made available online in convenient way by Washington DC’s Office of Chief Technology Officer. From the main page: > For years the District of Columbia has provided public access to city operational data through the Internet. Now the District provides real-time data from multiple agencies to citizens, a catalyst ensuring agencies operate as more ...
  • OpenStreetMap

    Offsite — OpenStreetMap (OSM) is aimed at creating and providing free geodata such as street maps to anyone who wants them. The project was started because most maps you think of as free actually have legal or technical restrictions on their use, holding back people from using them in creative, productive or unexpected ways. The planet.osm file described at the download link above ...
  • New Popular Edition Maps and Postcodes

    Offsite — Description New Popular Edition Maps plus postcode data submitted by users using the maps. Openness Two sets of data available Map scans themselves: Not open Access: fine (see <http://www.npemap.org.uk/FAQ.html#download> License: CC by-nc v2.5 (not open) Postcode database: Open License: Public domain Access: see download link
  • Opensecrets.org - Money in Politics Data

    Offsite — Description ‘Your Guide to Money in US Elections’. Includes databases covering money spent on: 527s Contributions Lobbying PACs Travel Also has a ‘revolving door’ database: > Track the movements of well-connected individuals between government and private interests in this searchable database of more than 6,400 people. See which influential lobbyists and firms ...
  • GenBank - NIH genetic sequence database

    Offsite — Description From the main page: > GenBank® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008 Jan;36(Database issue):D25-30). There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence ...
  • European State Finance Database

    Offsite — Description From the front page: > An International Project concerned with the collection of the sources and data of European Fiscal History and their interpretation, funded in 1989-90 by the British Academy and in 1990-3 by the Economic and Social Research Council (Award No. R000231968). ©ESFDB March 1995 Openness: Uncertain Access: all data available for ...
  • Public Domain Sounds

    Offsite — “With recording devices and microphones pdsounds volunteers acoustically discover the beauty of the world. From the million sounds of things to the pure waves of sinus. Our goal is to record it all and make it available for free.” “All pdsounds recordings are in the public domain.”
  • Public Domain 4 U

    Offsite — “There are so many great songs in the public domain arena. We are ripping and specially mastering MP3 files as fast as we can. We hope you enjoy our selections.” Growing collection of early American Music, mostly selected from the Internet Archive and assumed to be in the public domain.
  • Online Computer Library Center: WorldCat

    Offsite — Description Worldcat is an very large bibliographic service combining the catalogue of a large number of different institutions around the world. From [their site (2007-08-22)](http://www.oclc.org/firstsearch/content/worldcat/default.htm): > At the center of FirstSearch is WorldCat, a cooperative database of bibliographic records contributed by more than 57,000 ...
  • ISO 639-3 - Codes for the Representation of Names of Languages

    Offsite — About ISO 639-3 is a list of three letter codes for languages: > ISO 639-3 attempts to provide as complete an enumeration of languages as possible, including living, extinct, ancient, and constructed languages, whether major or minor, written or unwritten. Access/re-use The list can be [viewed](http://www.sil.org/iso639-3/codes.asp) or ...
  • Alt Law

    Offsite — Full text of the U.S. Supreme and Circuit Appeals Courts. For more details see altlaw.org.
  • Thrive 42

    Offsite — “thrive42 is an open source project for producing high quality content for teaching and learning.” “Both ThriveCore (the php engine) and ThriveContent (the question content) are open source. ThriveCore is released under the GNU General Public License and ThriveContent is released under the GNU Free Documentation License.”
  • Quotations Book

    Offsite — “We’ve made the best place in the world to read and share quotations.” “43,000+ quotes in one file” – available as [Google Base raw XML file](http://quotationsbook.com/full_syndication_gbase.xml.zip) or as [mixed schema raw XML file](http://quotationsbook.com/full_syndication_mixed.xml.zip). Quotes are licensed under a Creative Commons Attribution license.
  • Wikipedia 3

    Offsite — “Wikipedia³ is a conversion of the English Wikipedia into RDF. It’s a monthly updated dataset containing around 47 million triples.” “The Wikipedia³ datasets are of course licensed under the GFDL. Enjoy!”
  • Her Majesty&#x27;s Court Service - XHIBIT Daily Court Status

    Offsite — From [website](http://www.hmcourts-service.gov.uk/onlineservices/xhibit/index.htm): > XHIBIT improves the daily business of every Crown Court in England and Wales by providing hearing information to those who need it within minutes. > XHIBIT is the first step in joining-up the Criminal JusticeSystem. [Terms and ...
  • NBER US Patent Citation Database

    Offsite — Description [Taken verbatim from the above url] These data comprise detail information on almost 3 million U.S. patents granted between January 1963 and December 1999, all citations made to these patents between 1975 and 1999 (over 16 million), and a reasonably broad match of patents to Compustat (the data set of all firms traded in the U.S. stock market). These ...
  • GovTrack.us U.S Congress Legislative Data

    Offsite — U.S. Congress data including all Members of Congress since the beginning of the United States, legislative data including bills, sponsorship, roll call votes since around 1990.
  • U.S. National Park Service (NPS) - Yosemite National Park

    Offsite — About > Yosemite National Park, one of the first wilderness parks in the United States, is best known for its waterfalls, but within its nearly 1,200 square miles, you can find deep valleys, grand meadows, ancient giant sequoias, a vast wilderness area, and much more. > Yosemite Nature Notes is a video podcast series that tells unique stories about the natural and ...
  • european_business_registers

    Offsite — Wiki overview with links to European business registers – including the new register for European companies. Company information can be retrieved via Global Business Register which distributes EBR data at GBRDirect
  • Reference Database of Immune Cells

    Offsite — Description From home page: “RefDIC is an open-access database of quantitative mRNA/Protein profiles specifically for immune cells.” From <http://refdic.rcai.riken.jp/document.cgi>: > RefDIC is an open resource compendium of quantitative mRNA/Protein profile data specifically for immune cells. You can easily retrieve various aspects of mRNA/Protein profiles of ...
  • Luxembourg - Ministère des Finances - Budget

    Offsite — About Budget documents from 2006 to present in PDF format. Reuse Reproduction permitted if attribution if given – but only for noncommercial purposes. [Copyright notice](http://www.mf.public.lu/functions/apropos_du_site/index.php) states: > En l’absence d’indication contraire, la reproduction des informations contenues sur ce site est autorisée à des fins non ...
  • Slovenia - Statistical Office of the Republic of Slovenia

    Offsite — About National statistics from Slovenia. Data Topics include: Demography and social statistics Economy Environment and natural resources General Openness Open. Page footer says: > Use and publication of data is allowed provided the source is acknowledged.
  • Government Information Locator Service

    Offsite — From <http://www.gpoaccess.gov/gils/about.html> > The Government Information Locator Service (GILS) is an effort to identify, locate, and describe publicly available Federal information resources, including electronic information resources. GILS records identify public information resources within the Federal Government, describe the information available in these ...
  • NMRShiftDB

    Offsite — Description > NMRShiftDB is a NMR database (web database) for organic structures and their nuclear magnetic resonance (nmr) spectra. It allows for spectrum prediction (13C, 1H and other nuclei) as well as for searching spectra, structures and other properties. Last not least, it features peer-reviewed submission of datasets by its users. The NMRShiftDB software is ...
  • BioCyc

    Offsite — Description Biocyc curate and maintain several databases: > BioCyc is a collection of 371 Pathway/Genome Databases. Each Pathway/Genome Database in the BioCyc collection describes the genome and metabolic pathways of a single organism, with the exception of the MetaCyc database, which is a reference source on metabolic pathways from many organisms. These include ...
  • Brede Wiki

    Offsite — Brede Wiki is a wiki with structured information from neuroscience. It contains primarily information from published peer-reviewed neuroimaging articles. MediaWiki templates are used to structure the data. Bzipped XML MediaWiki dumps are available as well as SQLite SQL files.
  • UK Census - Digitised boundary datasets

    Offsite — About > Census area statistics (CAS) provide counts of people or households for geographical areas broken down by socio-demographic characteristics such as age, gender or employment. Digitised boundary datasets (DBDs) are a digitised representation of the underlying geography of the census. Access/Re-use Much of the material says it is Crown Copyright. It must be ...
  • Blue Obelisk Data Repository (BODR)

    No Data — Description “Another core Blue Obelisk project is the development of a shared data repository. This repository lists many important chemoinformatics data such as elemental properties, atomic radii, etc. including references to original literature. Software developers can use this repository on online webpages or in chemistry software for free.” Repository There is ...
  • Developmental Therapeutics Program NCI/NIH

    Offsite — About From [about page](http://dtp.nci.nih.gov/about.html): > As the drug discovery and development arm of the National Cancer Institute, the Developmental Therapeutics Program (DTP) plans, conducts, and facilitates development of therapeutic agents for cancer and AIDS. We are your resource for research materials, including Web-accessible data and tools, vialed and ...
  • A Wiki for Executable English

    Offsite — This is a kind of Wiki, for content in open vocabulary, executable English. English text (like this sentence) is normally something for a person to read, but it cannot be used as a program that you can run on a computer. On the other hand, executable English is something that a person can read, and that you can also run on a computer. Shared use of the system is free. ...
  • Enamine

    Offsite — About From website: > Since our inception in 1991 we have expanded our compound collection by adding compounds from uncommon chemical classes which feature drug-like physico-chemical properties. Our libraries have been attractive to pharma, biotech and agrochemical companies around the world and now our in house stock exceeds 1 million unique compounds. include: ...
  • European chemical Substances Information System (ESIS)

    Offsite — About ESIS (European chemical Substances Information System), is an IT System which provides you with information on chemicals, related to: EINECS (European Inventory of Existing Commercial chemical Substances) O.J. C 146A, 15.6.1990, ELINCS (European List of Notified Chemical Substances) in support of Directive 92/32/EEC, the 7th amendment to Directive ...
  • Chemical Structure Repository

    Offsite — Description From project website: > Chemical Structures [was] initiated in June 2006. It aims to provide a set of organic structures, which includes 3D coordinates, InChi code, molecular weight, melting point, etc. The last relase (v1.05) contains over 250 structures. License BSD license according to [SourceForge project ...
  • The National Public Transport Data Repository (traveline)

    Offsite — Description Data created by [traveline](http://www.pti.org.uk/) and used by (among others) [transportdirect](http://transportdirect.info). From <http://www.pti.org.uk/repository.htm>: > The third snapshot of the traveline data was taken in October 2006. It was based on the data that traveline was using in one week in October 2006. > > Guidance Notes were available ...
  • Al Jazeera Creative Commons Repository

    Offsite — About > Select Al Jazeera video footage – at this time footage of the War on Gaza – is available for free to be downloaded, shared, remixed, subtitled and eventually rebroadcasted by users and TV stations across the world with acknowledgement to Al Jazeera. Access/Re-use Available for download via website. War on Gaza footage is licensed under Creative Commons ...
  • Chemical Entities of Biological Interest (ChEBI)

    Offsite — Description > Chemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on ‘small’ chemical compounds. The term ‘molecular entity’ refers to any constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer, etc., identifiable as a separately distinguishable ...
  • MovieLens Data Sets

    Offsite — Description This data set contains 10000054 ratings and 95580 tags applied to 10681 movies by 71567 users of the online movie recommender service MovieLens. Users were selected at random for inclusion. All users selected had rated at least 20 movies. Unlike previous MovieLens data sets, no demographic information is included. Each user is represented by an id, and ...
  • Avoiding Mass Extinctions Engine

    Offsite — About AMEE is a neutral aggregation platform to measure and track all the energy data in the world. This includes aggregating every emission factor and methodology related to CO2 and Energy Assessments, and all the consumption data (fuel, water, waste, quantitative and qualitative factors) of everything. It is a web-service (API) that combines measurement, ...
  • GSKdata

    Offsite — About > Marking another positive step in the collaborative fight against cancer, GlaxoSmithKline (GSK) has released the genomic profiling data for over 300 cancer cell lines via the National Cancer Institute’s cancer Bioinformatics Grid™ (caBIG™). Cancer cell lines can be manipulated in the laboratory and have been used extensively by GSK in the discovery and ...
  • OpenVocab

    Offsite — About From [website](http://open.vocab.org/about) > OpenVocab is a community maintained vocabulary intended for use on the Semantic Web > OpenVocab is ideal for properties and classes that don’t warrant the effort of creating or maintaining a full schema. OpenVocab allows anyone to create and modify vocabulary terms using their web browser. Each term is described ...
  • ChemBank

    Offsite — About > ChemBank is a public, web-based informatics environment created by the Broad Institute’s Chemical Biology Program and funded in large part by the National Cancer Institute’s Initiative for Chemical Genetics (ICG). This knowledge environment includes freely available data derived from small molecules and small-molecule screens, and resources for studying the ...
  • MIDAS - Heritage project

    Offsite — From the website: > What is MIDAS? > MIDAS sets out an agreed list of the items or ‘units’ of information that should be included in an inventory or other systematic record of the historic environment. These units of information are grouped together under broad headings or ‘information schemes’. These cover areas such as Monument Character, Events, People and ...
  • Global Historical Climatology Network (GHCN)

    Offsite — About data Temperature mean, max and min for 5×5 grid over all of the earth: > The Global Historical Climatology Network (GHCN-Monthly) data base contains historical temperature, precipitation, and pressure data for thousands of land stations worldwide. The period of record varies from station to station, with several thousand extending back to 1950 and several ...
  • The Berlin Stratospheric Data Series

    Offsite — About the data Stratospheric Analyses from between 1957 and 2001 by Stratospheric Research Group at the [Meteorological Institute](http://www.met.fu-berlin.de/), Free University Berlin. Data format ASCII. See the [documentation on the Gridpoint Data Format](http://strat-www.met.fu-berlin.de/products/cdrom/html/section8.html#section8-2). License Don’t ...
  • Variability Analysis of Surface Climate Observations (VASClimO)

    Offsite — About data From project webpage: > VASClimO was a joint climate research project of the Global Precipitation Climatology Centre (GPCC) at the German Met Service (DWD) and the Institute for Atmosphere and Environment – Working Group for Climatology at the Johann Wolfgang Goethe University Frankfurt. > The project was funded by the Bundesministerium für Bildung, ...
  • Asian Development Bank (ADB) - Statistical Database System (SDBS)

    Offsite — About From [front page](https://sdbs.adb.org/sdbs/): > The Statistical Database System (SDBS) is the Asian Development Bank’s central statistical database that stores macro-economic and social data of its developing member countries (DMCs). The SDBS data come from statistical contacts that are mostly national statistics offices and central banks of the DMCs. SDBS ...