Collection

The Comprehensive Knowledge Archive Network (CKAN) Collection

Showing 51 - 100 out of 369 datasets

From their website:

CKAN is the Comprehensive Knowledge Archive Network, a registry of open knowledge packages and projects (and a few closed ones)…Those familiar with freshmeat, CPAN or PyPI can think of CKAN as providing an analogous service for open knowledge…CKAN is developed and maintained by the Open Knowledge Foundation. Both the CKAN code and data are open: free for anyone to use and reuse. To find out more check out the the CKAN project at knowledgeforge.net

CKAN is a peer in the global data commons and Infochimps is proud to be able to mirror their collection of over 300 datasets.

  • A Vision of Britain Through Time

    Offsite — Description “A vision of Britain between 1801 and 2001. Including maps, statistical trends and historical descriptions.” More focused on re-organising the data for access by locality than on enabling downloading of the raw data but have clearly put significant time and energy into data extraction. Specifically they have: An [administrative ...
  • Administrative boundaries of Spain

    Offsite — Basic reference datasets for Spain. - Geodetic vertex - Administrative boundaries of Spain at scales 1:1,000,000, 1:200,000 AND 1:25,000 - A dataset of Spain at the scale 1:1,000,000
  • APRS World Database

    Offsite — [begin excerpt from Integrating the Aprsworld Database Into Your Application] The aprsworld.net project was started in March 2001 by James Jefferson Jarvis, KB0THN. The goal from the beginning has ben to parse the APRS internet stream into data that can be stored in a relational database system. As the time of writing (September 2003) about 1 million raw APRS packets ...
  • Placeopedia.com - Connecting Wikipedia articles with their locations

    Offsite — Description Geolocations for wikipedia articles. Openness: OPEN License: listed as CC by-sa 2.5 on <http://www.placeopedia.com/data/> Access: good. See <http://www.placeopedia.com/data/> bulk: [xml](http://www.placeopedia.com/data/all.xml), [rss](http://www.placeopedia.com/data/all.rss), [kml](http://www.placeopedia.com/data/all.kml).
  • United Nations Common Database

    Offsite — Description “UNCDB provides selected series from numerous specialized international data sources for all available countries and areas.” 430 series as of 2007-11-25 (see <http://unstats.un.org/unsd/cdb/cdb_list_series.asp> Openness: NOT OPEN License: Non-Commercial restrictions. Specifically on main page states: > “Usage of the United Nations Common ...
  • FreeBase

    Offsite — Description “Freebase is an open database of the world’s information. It is built by the community and for the community—free for anyone to query, contribute to, built applications on top of, or integrate into their websites.” Openness: OPEN License: cc-by + GFDL for wikipedia derived part (large). Access: ok but no bulk (perhaps via their query engine API but ...
  • Connexions

    Offsite — Connexions Description “A place to view and share educational material made of small knowledge chunks called modules that can be organized as courses, books, reports, etc. Anyone may view or contribute …” As of 2007-04-24 had 3944 reusable modules woven into 214 collections.
  • World Bank - Data &amp; Research

    Offsite — License Terms and conditions on [FAQs](http://www.worldbank.org/faqs) page state: “The World Bank is pleased to allow Users to visit the Site and download and copy the information, documents and materials (collectively, “Materials”) from the Site for User’s personal, non-commercial use, without any right to resell, redistribute or create derivative works therefrom, ...
  • BioMed Central

    Offsite — “Publisher of more than 170 peer-reviewed open access journals.” “BioMed Central is an independent publishing house committed to providing immediate open access to peer-reviewed biomedical research. All original research articles published by BioMed Central are made freely and permanently accessible online immediately upon publication. BioMed Central views open access to ...
  • A Free View of the World -- OpenAerialMap

    Offsite — Description “OpenAerialMap is an open collection of aerial photographs, collected into a single coherent view of the world.” Openness: OPEN License: Public Domain of CC Attribution. > "OpenAerialMap is a young project, and as such, many of the licensing restrictions are still being worked out. In the short term, all imagery uploaded to OpenAerialMap should be ...
  • Belgium - Rekenhof (Court of Audit) - Budget

    Offsite — About Annual reports of the Court of Audit – which is responsible for matters related to the national budget. Includes figures on the budget. PDF documents covering 1998 to present. Re-use No copyright notice on site. Conditions of re-use not clear.
  • Greece - Ministry of Accounting and Finance - Budget

    Offsite — About Executive summaries of Greek national budget from 2000 to present in PDF format. Listed at: <http://www.mof-glk.gr/en/budget/> <http://www.mof-glk.gr/en/budget.htm> Reuse No copyright notice.
  • Citizendium

    Offsite — Contributions to Citizendium from Wikipedia are licenced under the GFDL.
  • Historical Events Markup Language

    Offsite — Title: Historical Event Markup Language Description Historical Event Markup and Linking Project (Heml) provides an XML schema for historical events and a Java Web app which transforms conforming documents into hyperlinked timelines, maps and tables. It aims to provide a most information-rich interchange format for historical data, and thus add a historical ...
  • Mathematics Genealogy Project

    Offsite — Description “The intent of this project is to compile information about ALL the mathematicians of the world. We earnestly solicit information from all schools who participate in the development of research level mathematics and from all individuals who may know desired information.” 115036 records as of 5 January 2008 Openness: NOT OPEN License: none ...
  • General Social Survey

    Offsite — Description From website: > The GSS contains a standard ‘core’ of demographic and attitudinal questions, plus topics of special interest. Many of the core questions have remain unchanged since 1972 to facilitate time trend studies as well as replication of earlier findings. The GSS takes the pulse of America, and is a unique and valuable resource. It has tracked the ...
  • WikiPathways

    Offsite — About From front page: > In the new tradition of Wikipedia, WikiPathways is an open, public platform dedicated to the curation of biological pathways by and for the scientific community. Openness In December 2008 WikiPathways [switched](http://scienceblogs.com/commonknowledge/2008/12/getting_it_right_wikipathways_1.php) to CC-BY, which is compliant with the ...
  • Mineral Resources On-Line Spatial Data

    Offsite — ###About “This is the main distribution web site for spatial data generated for, and collected by, the Mineral Resources Program (MRP) of the U.S. Geological Survey. This site is operated by the Spatial Information Delivery Project of the MRP. The site distributes detailed Mineral Resource, Geochemical, and Geophysical data along with regional and global geologic, ...
  • Internet Movie Database

    Offsite — Description Large film/movie database [claiming](http://imdb.com/Licensing/): 425,000+ titles 1,700,000 + filmographies of cast and crew members Films from 1891 to Present Foreign and independent movies, television movies and shows, new and future releases, and more Information on database structure can be found here: <http://us.imdb.com/Licensing/structure.html> ...
  • International Social Survey Programme

    Offsite — Only scientific use of the data is accepted. With your registration, you agree that you will use the data only for scientific purposes. In addition, you must describe your scientific project.
  • Average House Prices in the UK by Postcode API

    Offsite — ReST API (XML or JSON output) of current average house prices for rental and sales per postcode area. Per month. Historical data starts 2007.
  • Second Life Economy Data

    Offsite — Description Data on the economy of the Second Life virtual world. Openness: OPEN License: not specified. However availability and nature of data make it likely this is open. Access: good. Format. xls and xml. General economy: http://secondlife.com/whatis/economy-data.php LindeX: http://secondlife.com/whatis/economy-market.php
  • RCSB Protein Data Bank

    Offsite — Description As of August 2008 over 52 thousand structures available for download. From home page: > The RCSB PDB provides a variety of tools and resources for studying the structures of biological macromolecules and their relationships to sequence, function, and disease. > > The RCSB is a member of the wwPDB whose mission is to ensure that the PDB archive remains ...
  • WHO Global Price Reporting Mechanism

    No Data — Global Price Reporting Mechanism (World Health Organization – AIDS Medicines and Diagnostics Service (AMDS)) contains prices of shipments of pharmaceuticals in an international development context. For each listed transaction the drug, price, strength, dosage form, destination country, shipment method, manufacturing company, manufacturing country, the date of order, ...
  • Congresspedia

    Offsite — About From website: > Congresspedia is a collaboratively written “citizens’ encyclopedia on Congress,” designed to shine more light on the workings of the U.S. Congress. Congresspedia is part of SourceWatch, a similarly collaborative, wiki-based website documenting the people, organizations and issues shaping the public agenda. Congresspedia is a wiki, meaning that ...
  • EDINA UKBORDERS

    Offsite — About UKBORDERS provides digitised boundary datasets of the UK, available in many Geographic Information System (GIS) formats (MapInfo MIF/MID, ArcView Shape, Arc/Info Export and several others), for teachers and researchers in the UK Higher and Further Education community to download and use in their work. Re-use Available for re-use in UK HE/FE.
  • University of Huddersfield -- Circulation and Recommendation Data

    Offsite — About Circulation and Recommendation data from the [University of Huddersfield Library](http://library.hud.ac.uk). Data is comprised of two parts: > 1. Circulation Data. This breaks down the loans by year, by academic school, and by individual academic courses. This data will primarily be of interest to other academic libraries. UK academic libraries may be able to ...
  • The CIESIN World Data Center

    Offsite — Description The CIESIN World Data Center is a portal, hosted by NASA’s Socioeconomic Data and Applications Center (SEDAC), which provides access to a wide range of global data, associated documentation, and visualization and analysis tools, and to the community of experts on global data. Contains a very large number of different datasets many of which are open. ...
  • The CKAN client Python package.

    Offsite — The CKAN client software may be used to make requests on the Comprehensive Knowledge Archive Network (CKAN) REST API. Synopsis ## The simplest way to make CKAN requests is: import ckanclient Instantiate the CKAN client. ckan = ckanclient.CkanClient(api_key=my_key) Get the package list. ckan.package_register_get() package_list = ckan.last_message print ...
  • Christian Classics Ethereal Library

    Offsite — Description Losts of texts oriented to Christian themes (though this includes stuff like Dante. Openness: SEMI-OPEN License: public domain/restricted. > Most of the books at the Christian Classics Ethereal library are in the public domain. A few are public domain in the United States but not in Europe; you should check local copyright status before copying ...
  • The collaborative, 3D encyclopedia of proteins and other molecules

    Offsite — Description From the email excerpted on [Peter Murray-Rust’s blog](http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=990): > Hi Dr. Murray-Rust, > > I’m a student in Joel Sussman’s lab at the Weizmann Insitute of > Science. Joel, Jaime Prilusky and I have developed Proteopedia, a > new online tool/database with the overall goal of making structural > biology clearer for ...
  • CiteULike datasets

    Offsite — Description From the [data page](http://www.citeulike.org/faq/data.adp): Who-posted-what data The latest data snapshot can always be downloaded athttp://static.citeulike.org/data/current.bz2 Older datasets are available on a daily basis and can be found at URLs of the form http://static.citeulike.org/data/2007-05-30.bz2 Data is available from 2007-05-30 onwards. ...
  • Routing Information Service from RIPE

    Offsite — This page links to the raw data collected by the RRCs in MRT format. This format is described in an IETF draft. These files can be read using libbgpdump, a library maintained by the RIPE NCC. The data is collected using Quagga. This data is made available for researchers without restrictions. However, if you copy the data and publish an analysis, please send us a pointer ...
  • Flickr - The Commons

    Offsite — About > The key goals of The Commons on Flickr are to firstly show you hidden treasures in the world’s public photography archives, and secondly to show how your input and knowledge can help make these collections even richer. Re-use/Openness Photos have “no known copyright restrictions”. From [rights page](http://www.flickr.com/commons/usage/): > Participating ...
  • Economic History Services Databases

    Offsite — Title: EH.Net Databases Description From <http://eh.net/about> > “EH.Net operates the Economic History Services web site and several electronic mailing lists to provide resources and promote communication among scholars in economic history and related fields. EH.Net is supported by the Economic History Association and other affiliated organizations: the Business ...
  • Internet Routing Registry

    Offsite — The Internet Routing Registry (IRR) is a distributed routing database development effort. Data from the Internet Routing Registry may be used by anyone worldwide to help debug, configure, and engineer Internet routing and addressing. The IRR provides a mechanism for validating the contents of BGP announcement messages or mapping an origin AS number to a list of networks.
  • Federal Reserve Economic Data

    Offsite — Description Federal Reserve Economic Data (FRED), over 15,000 FREE US series. Download data. View charts. Get email updates. Create data lists. Can browse data or download in bulk from <http://research.stlouisfed.org/fred2/downloaddata/>. Openness: OPEN (?) License: none specified but as Federal US Govt probably public domain.
  • Website Attica: Persons of Ancient Athens

    Offsite — Description From the website: > Website Attica complements and enhances the published volumes of Persons of Ancient Athens. The addenda et corrigenda to the published volumes, which are issued as a supplement to PAA periodically, are regularly updated at this web site. Searches may be made 10,000 names of the ATHENIANS database in beta, gamma, and delta (second half ...
  • NGA GEOnet Names

    Offsite — Complete Files of Geographic Names for Geopolitical Areas from GNS Published by the US National Geospatial Intelligence Agency, this global gazetteer is the core data from which the geonames dataset was initially derived.
  • Werner Icking Music Archive

    Offsite — Description Lots of sheet music. While quite a bit has source files much only seems to be in pdf. Openness: OPEN License: not specified but strongly appears to be open plus most of the works are in the PD so apart from typesetting issues the scores should be PD. Access: ok. bulk: no. api: no. www: yes.
  • GRASS GIS North Carolina Dataset

    Offsite — We developed a completely new free geospatial dataset and substituted all Spearfish (SD) examples in the previous editions with this new, much richer North Carolina (NC, USA) data set. This data set is a comprehensive collection of raster, vector and imagery data covering parts of North Carolina (NC), USA (map), prepared from public data sources provided by the North ...
  • GEneral Multilingual Environmental Thesaurus

    Offsite — A thesaurus in 20+ languages for terms related to the environment and environmental data. Published by the European Environment Agency. Available in RDF without reuse constraints.
  • PubMed Central

    Offsite — About > PubMed Central (PMC) is the U.S. National Institutes of Health (NIH) free digital archive of biomedical and life sciences journal literature. Re-use/openness Mixed. Some all rights reserved, some [open access](http://www.pubmedcentral.nih.gov/about/openftlist.html), some [public domain](http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=245). See ...
  • Economic Data from Klepper on Simons

    Offsite — Description 2 datasets: 1. Archive Data on Tire Manufacturers. All the data used in analyses in Steven Klepper and Kenneth L. Simons, “The Making of an Oligopoly: Firm Survival and Technological Change in the Evolution of the U.S. Tire Industry,” Journal of Political Economy, vol. 108 no. 4, August 2000, pp. 728-760. 2. Archive Data on Radio and Television ...
  • Grand Dictionnaire Terminologique

    Offsite — Official translations of technical terminology. From the Office of the French Language. No obvious way to download the source.
  • Bulgaria - Ministry of Finance - Budget

    Offsite — About Budget documents in PDF, 2006 to present. <http://www.minfin.bg/en/page/24> <http://www.minfin.bg/en/documents> Assorted financial data, in XLS/PDF. <http://www.minfin.bg/en/statistics/?cat=1> <http://www.minfin.bg/en/statistics/> Re-use Not clear.
  • National Digital Archive of Datasets (NDAD)

    Offsite — About > The National Digital Archive of Datasets (NDAD) preserves and provides online access to archived digital datasets and documents from UK central government departments. Our collection spans 40 years of recent history, with the earliest available dataset dating back to about 1963. Openness/re-use The ...
  • Stockholm International Peace Research Institute Databases

    Offsite — Description A whole set of databases/datasets relating to international relations (wars, military etc). Have a nice, shiny front-end named FIRST <http://first.sipri.org/> (Facts on International Relations and Security Trends) which incorporates data from a whole bunch of other places. List of databases on the databases page: > Facts on International Relations and ...
  • British Atmospheric Data Centre (BADC)

    Offsite — About The British Atmospheric Data Centre (BADC) is the Natural Environment Research Council’s (NERC) Designated Data Centre for the Atmospheric Sciences. The role of the BADC is to assist UK atmospheric researchers to locate, access and interpret atmospheric data and to ensure the long-term integrity of atmospheric data produced by NERC projects. ...
  • One Laptop Per Child Sound Samples

    Offsite — Description From download page: > Over 8.5GB of FREE Samples – Sound Effects, Loops, Grooves, Drums, Voices and Instruments – for The Children of the World. This huge and continuously expanding collection of new and original samples have been donated to Dr. Richard Boulanger @ cSounds.com specifically to support the OLPC developers, students and XO users. They are ...