Category

Showing 21 - 35 out of 35 datasets

Social Networks

Not finding the data sets you're looking for? Not all of our data sets are categorized yet. Try checking out tags instead.
  • Samples of Facebook users and Facebook user application installations

    Offsite — Two representative samples of (~1 million) Facebook users collected in April 2009 with friend list, privacy settings and network membership for each user. One representative sample of ~13K Facebook application installations by ~300K users. Two samples of weighted college Facebook users collected in Oct 2010.
  • Sample of Facebook users at a college network

    Offsite — The dataset contains all the information posted on approximately 1,700 Facebook profiles by students at an anonymous, northeastern American university. Profiles were sampled at one-year intervals, beginning in 2006. Note from source, as of June 2011: The T3 dataset is still offline as we take further steps to ensure the privacy of students in the dataset. Please check ...
  • Geocities Archive

    Offsite — YES THAT IS RIGHT, WE ARE RELEASING GEOCITIES ON A TORRENT. This is going to be one hell of a torrent – the compression is happening as we speak, and it’s making a machine or two very unhappy for weeks on end. The hope had been to upload it today, but the reality is this is a lot of stuff – probably 900 gigabytes will be in the torrent itself. It’s not perfect, it’s not ...
  • Twitter Development Talk - API Documentation

    Offsite
  • Twitter Census: Twitter Users by Background Color

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of user profile background color counts collected from user profiles between March 2006 and March 2010. Each color is listed as ...
  • Twitter Census: Twitter Users by Friends Count

    Free Download — Twitter follower data from millions of users! This is a free download of Twitter account follower data collected from March 2006 to March 2010. The Twitter data in this download comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. Infochimps uses a tool ...
  • Twitter Census: Twitter Users by Followers Count

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of user counts for the number of followers collected from user profiles between March 2006 and March 2010. The number of ...
  • Twitter Census: Twitter Users by Month Added

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of user counts by the month in which the account was created collected from tweets sent between March 2006 and ...
  • Twitter Census: Twitter Users by Day Added

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of user counts by the day on which the account was created collected from tweets sent between March 2006 and March ...
  • Twitter Census: Twitter Users by Hour Added

    Free Download — This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users. This dataset is a list of the number of user counts by the hour in which the account was created collected from tweets sent between March 2006 and ...
  • Twitter Census: Stock Tweets

    Free Download — Stock tweets from millions of Twitter posts leading up to, during, and after the Global Financial Crisis (March 2006 – March 2010). The data comes from analysis of 1.6 billion tweets during that time period, from approximately 40 million users. This data set includes 2.3 million stock tweets with the ticker symbol and keyword references. Twitter users will post the ...
  • Twitter Census: Hashtags, URLs, Smileys by Day

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Census: Hashtags, URLs, Smileys by Hour

    Free Download — Twitter data from millions of tweets! This is a download of Twitter data from March 2006 to November 2009. The data set consists of “tokens,” which are hashtags (#data), URLs, or emoticons (Twitter smileys or other “faces” created using keyboard characters). The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 ...
  • Twitter Sentiment Dataset 2008 Debates

    Offsite — Twitter Sentiment Dataset from the 1st 2008 Presidential Debate Nick Diakopoulos and myself analysed the sentiment of tweets to characterize the Presidential debates. You can read about it in this paper. For this work, we collected sentiment judgements on 3,238 tweets from the first 2008 Presidential debate. Some notes about this data: Twitter owners own their tweets. ...
  • Enron Email Data with Manager-Subordinate Relationship Metadata

    Free Download — GraphML Representation of the Enron Email Dataset – Version 0.12 Overview This dataset contains a representation of the Enron email dataset derived from the MySQL representations previously released by USC/ISI 1 and UC Berkeley 2. In addition, it contains ground truth about a set of manager-subordinate relationships within the company that existed between January 2000 ...