Category

Showing 1 - 20 out of 35 datasets

Social Networks

Not finding the data sets you're looking for? Not all of our data sets are categorized yet. Try checking out tags instead.
  • Twitter Census :: Developer Tools - Mapping from Twitter User Search ID to Twitter API IDs

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data comes from analysis on the full set of tweets during that time period, which is 35 million users, over 500 million tweets, and more than 1 billion relationships between users. This dataset maps Twitter screen names to a user’s corresponding Twitter API ID ...
  • Enron Email Dataset

    Offsite — From the CALO Project at Carnegie-Mellon University a massive dataset of emails recovered from discovery documents in the Enron trials About This dataset was collected and prepared by the CALO Project (A Cognitive Assistant that Learns and Organizes). It contains data from about 150 users, mostly senior management of Enron, organized into folders. The corpus contains a ...
  • Twitter Census - Conversation Metrics: One Year of URLs, Hashtags, Smileys Usage (by Hour)

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Census - Conversation Metrics: One year of URLs, Hashtags, Smileys usage (Smiley Counts)

    Free Download — Twitter smiley data from millions of tweets! This is a free download of Twitter data from March 2006 to November 2009. The smiley data comes from analysis on the full set of tweets during that time period, which is 35 million users, over 500 ...
  • Twitter Census - Conversation Metrics: One year of URLs, Hashtags, Smileys usage (monthly)

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 35 million users, over ...
  • Twitter Census: Trst Rank

    Free Download — The service for this API has ceased Our apologies for the inconvenience this may cause. You can find a download of the data set for this API on this page Twitter influence metrics with the click of a button! Trstrank measures Twitter user reputation, importance and influence in a way far more robust than counting the number of followers. It is a sophisticated measure ...
  • Marvel Universe Social Graph

    Free Download — A fun Marvel Comics character collaboration graph constructed by Cesc Rosselló, Ricardo Alberich, and Joe Miro from the University of the Balearic Islands. The Marvel Universe, that is, the artificial world that takes place in the universe of the Marvel comic books, is an example of a social collaboration network. They compare the characteristics of this universe to ...
  • Disasters worldwide from 1900-2008

    Free Download — Disaster data from 1900 – 2008, organized by start and end date, country (and sub-location), disaster type (and sub-type), disaster name, cost, and persons killed and affected by the disaster. Create disaster data trend reporting, based on geography, frequency, date or nature of the event. Design a visualization or time lapse illustrating disaster events around the ...
  • Stanford Large Network Dataset Collection

    Offsite — Stanford Large Network Dataset Collection Social networks: online social networks, edges represent interactions between people Communication networks: email communication networks with edges representing communication Citation networks: nodes represent papers, edges represent citations Collaboration networks: nodes represent scientists, edges represent collaborations ...
  • Twibs : Find the Businesses on Twitter

    Offsite — Twibs was created by a small group of people with one purpose: Give twitter users a place to find businesses on twitter. The Twibs founders are big believers in the power of twitter to connect customers with businesses. They are working on making it easy for consumers to find businesses, both local and national. Keep in mind, they’re just getting started, so there may be ...
  • Twitter Census: Twitter Users by Location

    Free Download — Twitter location data from millions of users! This is a free download of Twitter user location data collected from March 2006 to March 2010. The location data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset has basically two fields: the ...
  • EMDAT - The International Emergency Disasters Database

    Offsite — Description From front page: > Since 1988 the WHO Collaborating Centre for Research on the Epidemiology of Disasters (CRED) has been maintaining an Emergency Events Database EM-DAT. EM-DAT was created with the initial support of the WHO and the Belgian Government. > > The main objective of the database is to serve the purposes of humanitarian action at national and ...
  • TAGora » Integrated IMDB and Netflix Dataset

    Offsite — To support the investigation of communal data structures, such as folksonomies, in the context of recommendation, we have created a large knowledge base about movies and how users rate movies. To achieve this, a large portion of the Internet Movie Database (IMDB) was downloaded from to provide information about movies, actors and production personnel, as well a large set ...
  • Features and Friends - Color spectra and Image Features of 20k user profile pictures with user stats

    Offsite — “Features_and_friends.csv” contains 33 image features for 19,217 MySpace profile pictures. Also included is the number of friends for each user in the sample. The columns are (roughly): n – Number of brightness levels pn – A measure ...
  • Twitter Census: Smileys

    Free Download — Twitter smiley data from billions of tweets! This is a free download of Twitter data from March 2006 to November 2009. The data set consists of smileys, or emoticons that follow a convention similar to these examples: . :-) ;-) :D, etc. The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and ...
  • Twitter Census: Hashtags, URLs, Smileys by Month

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Census: Tweets by Hour Tweeted

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of tweet counts by the hour in which the tweet was created collected from tweets sent between March 2006 and March ...
  • Twitter Census: Stock Tweets

    Free Download — Stock tweets from millions of Twitter posts leading up to, during, and after the Global Financial Crisis (March 2006 – March 2010). The data comes from analysis of 1.6 billion tweets during that time period, from approximately 40 million users. This data set includes 2.3 million stock tweets with the ticker symbol and keyword references. Twitter users will post the ...
  • Twitter Census: Hashtags, URLs, Smileys by Day

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • My Facebook Photo Likes

    Free Download — My Facebook Photo Likes