Category

Showing 1 - 20 out of 32 datasets

Events

Not finding the data sets you're looking for? Not all of our data sets are categorized yet. Try checking out tags instead.
  • Twitter Census - Conversation Metrics: One Year of URLs, Hashtags, Smileys Usage (by Hour)

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Census - Conversation Metrics: One year of URLs, Hashtags, Smileys usage (monthly)

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 35 million users, over ...
  • Freebase Wikipedia Extraction (WEX)

    Offsite — The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted intabular form. Freebase WEX is provided as a set of database tables in TSV format ...
  • Twitter Census: Trst Rank

    Free Download — The service for this API has ceased Our apologies for the inconvenience this may cause. You can find a download of the data set for this API on this page Twitter influence metrics with the click of a button! Trstrank measures Twitter user reputation, importance and influence in a way far more robust than counting the number of followers. It is a sophisticated measure ...
  • DBPedia Main

    Offsite — DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.6 million things, including at least 213,000 persons, 328,000 places, 57,000 music albums, 36,000 films, 20,000 companies. The knowledge base consists of 274 million pieces of ...
  • Measuring Worth: Interest Rates - US, UK, China, Japan

    Offsite — The mission of the site is to make available to the public the highest quality and most reliable historical data on important economic aggregates, with particular emphasis on nominal measures. The data have been created using the highest standards of the fields of economics and history and are rigorously refereed by the most distinguished researchers in the fields. ...
  • Disasters worldwide from 1900-2008

    Free Download — Disaster data from 1900 – 2008, organized by start and end date, country (and sub-location), disaster type (and sub-type), disaster name, cost, and persons killed and affected by the disaster. Create disaster data trend reporting, based on geography, frequency, date or nature of the event. Design a visualization or time lapse illustrating disaster events around the ...
  • Twitter Census: Twitter Users by Location

    Free Download — Twitter location data from millions of users! This is a free download of Twitter user location data collected from March 2006 to March 2010. The location data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset has basically two fields: the ...
  • Word List - 1000 Most Frequent Words from an Internet Corpus

    Free Download — This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.
  • Twitter Census: Smileys

    Free Download — Twitter smiley data from billions of tweets! This is a free download of Twitter data from March 2006 to November 2009. The data set consists of smileys, or emoticons that follow a convention similar to these examples: . :-) ;-) :D, etc. The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and ...
  • Freebase Data Dump

    Offsite — Freebase data dumps provide all of the current facts and assertions within the Freebase system. The data dumps are complete, general-purpose extracts of the Freebase data in a variety of formats. Freebase releases a fresh data dump every three months. Freebase is an open database of the world’s information, covering millions of topics across hundreds of categories. ...
  • EUROPA -Activities of the European Union

    Offsite — This website provides approximately 3 000 summaries of EU legislation. They are offered in the form of factsheets disseminated under 32 thematic areas corresponding to the activities of the EU. The themes range from agriculture to transport, presenting comprehensive and up-to-date coverage of EU legislation. What is not covered, however, are legal decisions having only ...
  • U-Boats and their Victories Infoboxes

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of U-Boats. The dataset lists U-Boats, the type of ships they sunk, total number of ships sunk, and tonnage.
  • 2008 United States Candidates for Nomination for the Presidential Elections

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of 2008 candidates for nomination for the United States presidential elections. The dataset includes the homepage, campaign slogan, logo for over a dozen nominees.
  • U S Secretary Infboxes

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of U S Secretary Box. Included in the dataset is a list of secretaries, their department, years of service, president, the individuals that served before and after them.
  • Start US Presidential Ticket Box Infoboxes

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Start U S Presidential Ticket Box.
  • Twitter Census: Tweets by Hour Tweeted

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of tweet counts by the hour in which the tweet was created collected from tweets sent between March 2006 and March ...
  • Twitter Census: Stock Tweets

    Free Download — Stock tweets from millions of Twitter posts leading up to, during, and after the Global Financial Crisis (March 2006 – March 2010). The data comes from analysis of 1.6 billion tweets during that time period, from approximately 40 million users. This data set includes 2.3 million stock tweets with the ticker symbol and keyword references. Twitter users will post the ...
  • Twitter Census: Hashtags, URLs, Smileys by Day

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Census: Hashtags, URLs, Smileys by Hour

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...