Tag

token

5 datasets
  • Word Frequencies in Written & Spoken English from British National Corpus (100M-word)

    Offsite — by Geoffrey Leech, Paul Rayson, Andrew Wilson Overview Download word lists Books of English word frequencies have in the past suffered from severe limitations of sample size and breadth. They have also tended to be restricted to word forms alone. Most importantly, almost all have dealt only with written language. This book overcomes these limitations. It is derived from ...
  • Twitter Census: Smileys

    Free Download — Twitter smiley data from billions of tweets! This is a free download of Twitter data from March 2006 to November 2009. The data set consists of smileys, or emoticons that follow a convention similar to these examples: . :-) ;-) :D, etc. The data comes from analysis on the full set of tweets during that time period, which is 40 million users, 1.6 billion tweets, and ...
  • Google Books Ngrams

    Offsite — Description Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links will directly download a fragment of the given corpus. For ...
  • Twitter Wordbag

    No Data — The service for this API has ceased Our apologies for the inconvenience this may cause. Twitter Wordbag unlocks word importance and frequency data across the Twitter universe. Discover any Twitter user’s word usage frequency, or most characteristic words, relative to the average of all Twitter users, by simply querying a screen name or user ID. This Twitter word API ...
  • Twitter Word Usage

    No Data — The service for this API has ceased Our apologies for the inconvenience this may cause. Twitter word statistics for any term! Discover how commonly a given word is used on Twitter. This Twitter word API allows you to query a term and access valuable statistics about its usage on Twitter. Query any term to obtain word usage statistics, including global frequency, ...