-
Offsite
—
About From [website](http://aws.amazon.com/publicdatasets/): > Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge for the community, and like all AWS services, users pay only for the compute and storage they use for their ...
-
Free Download
—
Yearly population count by US county from 1980 to 2009. Format Yearly population count by US county (fips code) from 1980 to 2009 as one tab-separated-values (.tsv) file. One timeseries [1980-2009] per line. Cleaned up and ready for immediate use in Excel, Matlab, R, SQL, etc. Snippet: 01001 32259 31985 32036 32054 32134 32245 32893 33268 33636 ... (snip) 01003 78556 ...
-
Free Download
—
This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Venus Crater Data.
-
Free Download
—
This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Mercury Crater Data.
-
Offsite
—
This site hosts a computer program that produces data. The intended use of the program is to help with the empirical analysis of other programs, particularly those that consume data. For example, it can produce data to test sorting programs.
-
Free Download
—
This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Lunar Crater Data.
-
Offsite
—
data from United Nations Statistics Division on energy.
-
Free Download
—
A simple mapping from hex color codes to color names and rgb values.
Eg:
color, hex, r, g, b
Almond,#EFDECD,239,222,205
Dodger blue,#1E90FF,30,144,255
Meat brown,#E5B73B,229,183,59
Scarlet,#FF2000,255,32,0
Tiffany Blue,#0ABAB5,10,186,181
Violet (color wheel),#7F00FF,127,0,255
Source:
http://en.wikipedia.org/wiki/List_of_colors
-
Free Download
—
This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of user counts by the day on which the account was created collected from tweets sent between March 2006 and March ...
-
Offsite
—
This data set contains the raw export files of the first genome sequenced by Illumina Individual Genome Service using Illumina’s Genome Analyzer technology of paired 75-base reads. 92,254,659,274 bases were used to generate a consensus sequence with coverage of 32x average depth. The genome was obtained via peripheral blood of Jay Flatley, CEO of Illumina.
-
Offsite
—
OpenStreetMap is a free editable map of the whole world. OpenStreetMap allows you to view, edit and use geographical data in a collaborative way from anywhere on Earth. This database, usable for tile rendering, map servers, analysis, and visualization, is the OpenStreetMap planet (Planet.osm) in a database cluster: thus it can easily be attached as a new database ...
-
Offsite
—
This data set contains a complete copy of all Wikimedia wikis, in the form of wikitext source and metadata embedded in XML as provided by the Wikimedia Foundation.
The data set will be updated every month and the 3 previous months will always be available for use. We will list previous snapshots in the text of this description.
-
Offsite
—
This dataset contains a 320 GB sample of the data used to power trendingtopics.org. It includes 7 months of hourly page traffic statistics for over 2.5 Million wikipedia articles (~ 1 TB uncompressed) along with the associated wikipedia content, linkgraph, & metadata. Compiled by Peter Skomoroch at Data Wrangling, LLC on May, 31, 2009 To mount the snapshot: localmachine ...
-
Free Download
—
Locations of Flood Control (Fire & Water Restoration Company) throughout the US. Data includes:BUSINESS
ADDRESS1
ADDRESS2CITYSTATEZIPPHONEURL
-
Free Download
—
Description
A flat text list of human classified spam accounts from http://twitter.com.
Fields:
twitter_user_screen_name: twitter screen name of spam account
Source(s):
http://www.writing.com/main/view_item/item_id/1618035-Twitter-Spammers
-
Offsite
—
Two representative samples of (~1 million) Facebook users collected in April 2009 with friend list, privacy settings and network membership for each user.
One representative sample of ~13K Facebook application installations by ~300K users.
Two samples of weighted college Facebook users collected in Oct 2010.
-
Free Download
—
Restaurant Inspection Scores Search Menu Enter the restaurant name, or select a city from the list, or search for inspection done between the start and end dates. All fields are optional. If you wish to look at all ratings, simply click on “Search”. If you are unsure of a name, enter as much of the name as you know. [For example, entering “Ch” returns every ...
-
Free Download
—
A spreadsheet of the properties of usable datasets, in three levels of description detail, with suggestions for metrics for each proposed property. This is a formative exploration of what might qualify as usable and measurable dataset properties. It is subject to substantial revision.
Contact: William L. Anderson — band AT praxis101 DOT com
-
Offsite
—
california school performance data
-
Offsite
—
Articles/Briefs/ Newsletters, Bill Summaries/ Databases, Reports
Lots of tables, lists of legislators’ occupations, per diem rates, rates of various minority Legislators.