Tag

corpora

Showing 21 - 32 out of 32 datasets
  • Word List - Official Scrabble (TM) Player's Dictionary (OSPD) 2nd ed (with Definitions, Excel format

    Free Download — 4,160 official crosswords delta (crswd-d.txt) When combined with the 113,809 crosswords file, it produces the official crossword list compatible with the second edition of the Official Scrabble Players Dictionary. (Scrabble is a registered ...
  • Word List - List of Acronyms

    Free Download — 6,213 acronyms (acronyms.txt) common acronyms & abbreviations
  • Word List - 350,000+ Words

    Free Download — Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.
  • Word List - Official Scrabble (TM) Player's Dictionary (OSPD) 2nd ed

    Free Download — 4,160 official crosswords delta (crswd-d.txt) When combined with the 113,809 crosswords file, it produces the official crossword list compatible with the second edition of the Official Scrabble Players Dictionary. (Scrabble is a registered trademark of Milton-Bradley licensed to Merriam-Webster.)
  • Word List - 21,000+ Common Given Names (US & Great Britain)

    Free Download — 21,986 names (names.txt) This database contains the most common names used in the United States and Great Britain. Spelling checkers may want to supplement their basic word list with this one.
  • Word List - 4,900+ Common Female Given Names (English-speaking Countries)

    Free Download — 4,946 female names (names-f.txt) Frequent given names of females in English speaking countries. Spelling checkers may want to supplement their basic word list with this one.
  • Word List - 3,800+ Common Male Given Names (English-speaking Countries)

    Free Download — 3,800 male names Frequent given names of male in English speaking countries. Spelling checkers may want to supplement their basic word list with this one.
  • Word List - Commonly Misspelled English Words

    Free Download — 366 often misspelled words (oftenmis.txt) many of the most commonly misspelled words in English speaking countries
  • USPTO (US Patent Office) patents: Bulk Downloads of Full Text, Scans or OCR

    Offsite — The following USPTO patent products are available for free download from Google. Patent Grants Patent Grant Multi-Page Images (1790 – present) Patent Grant Full Text with Embedded Images (2001 – present) Patent Grant Full Text (1976 – present) Patent Grant Bibliographic Data (1976 – present) Patent Grant OCR Text (1920 – 1979) Patent Grant Single-Page Images (Oct ...
  • Google Labs - Books Ngram Viewer

    Offsite — Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links below will directly download a fragment of the given corpus. For instance, ...
  • Twitter Wordbag

    No Data — The service for this API has ceased Our apologies for the inconvenience this may cause. Twitter Wordbag unlocks word importance and frequency data across the Twitter universe. Discover any Twitter user’s word usage frequency, or most characteristic words, relative to the average of all Twitter users, by simply querying a screen name or user ID. This Twitter word API ...
  • Twitter Word Usage

    No Data — The service for this API has ceased Our apologies for the inconvenience this may cause. Twitter word statistics for any term! Discover how commonly a given word is used on Twitter. This Twitter word API allows you to query a term and access valuable statistics about its usage on Twitter. Query any term to obtain word usage statistics, including global frequency, ...