-
Free Download
—
The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to their rows and columns. A few files had extraneous characters in the title. These were corrected to be consistent. A few files ...
-
Free Download
—
The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to their rows and columns. A few files had extraneous characters in the title. These were corrected to be consistent. A few files ...
-
Offsite
—
Quite a bit of data is available for download but only individually (not in a single file). According to web page have have: > * GINAP – code to standardize given names and correct common problems in name samples. Such standardization is an important step in analysis of given names. > * Popular given names, US 1801 to 1999 – a collection of sets of standardized female ...
-
Free Download
—
This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.
-
Free Download
—
This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency
-
Offsite
—
2006 Survey Sexual frequency and satisfaction
Via Sean Banks, you can view a spreadsheet version here
-
Offsite
—
by Geoffrey Leech, Paul Rayson, Andrew Wilson Overview Download word lists Books of English word frequencies have in the past suffered from severe limitations of sample size and breadth. They have also tended to be restricted to word forms alone. Most importantly, almost all have dealt only with written language. This book overcomes these limitations. It is derived from ...
-
Free Download
—
1,185 King James Version frequent substrings (KJVfreq.txt) The most frequently occurring 1,185 substrings in the King James Version Bible ranked and counted by order of frequency.
-
Free Download
—
467 current fiction substrings (fiction.txt) The most frequently occurring 467 character sequences (n-grams) occurring in a best-selling novel by Amy Tan in 1990.
-
Offsite
—
Three separate datasets obtained from the 1990 cense. One set includes last names, one has first male names, and one has first female names. They contain the following data: the name, frequency in percent, cumulative frequency in percent, and rank.
-
Offsite
—
Description Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links will directly download a fragment of the given corpus. For ...
-
Offsite
—
Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links below will directly download a fragment of the given corpus. For instance, ...