Moby Project License13 datasets
Information from Grady Ward (author)
This documentation, the software and/or database are:
Public Domain material by grant from the author, January, 2001.
Conditions from Project Gutenberg
Project Gutenberg asks that you include, in whole,
the header file (attached as aaPG-Readme.txt), or in
this schema file under the credit for Project Gutenberg.
Please refer there for more information
Information from Project Gutenberg
Copyright laws are changing all over the world, be sure to check
the laws for your country before redistributing these files!!!
Please take a look at the important information in this header.
The Legal Small Print
The small print! for public domain eTexts
Why is this “Small Print!” statement here? You know: lawyers.
They tell us you might sue us if there is something wrong with
your copy of this etext, even if you got it for free from
someone other than us, and even if what’s wrong is not our
fault. So, among other things, this “Small Print!” statement
disclaims most of our liability to you. It also tells you how
you can distribute copies of this etext if you want to.
Before! You Use Or Read This Etext
By using or reading any part of this PROJECT GUTENBERG-TM
etext, you indicate that you understand, agree to and accept
this “Small Print!” statement. If you do not, you can receive
a refund of the money (if any) you paid for this etext by
sending a request within 30 days of receiving it to the person
you got it from. If you received this etext on a physical
medium (such as a disk), you must return it with your request.
About Project Gutenberg-TM Etexts
This PROJECT GUTENBERG-TM etext, like most PROJECT GUTENBERG-TM
etexts, is a “public domain” work distributed by Professor Michael
S. Hart through the Project Gutenberg Association (the “Project”).
Among other things, this means that no one owns a United States
copyright on or for this work, so the Project (and you!) can copy
and distribute it in the United States without permission and
without paying copyright royalties. Special rules, set forth below,
apply if you wish to copy and distribute this etext under the
Project’s “PROJECT GUTENBERG” trademark.
To create these etexts, the Project expends considerable
efforts to identify, transcribe and proofread public domain
works. Despite these efforts, the Project’s etexts and any
medium they may be on may contain “Defects”. Among other
things, Defects may take the form of incomplete, inaccurate or
corrupt data, transcription errors, a copyright or other
intellectual property infringement, a defective or damaged
disk or other etext medium, a computer virus, or computer
codes that damage or cannot be read by your equipment.
LIMITED WARRANTY; DISCLAIMER OF DAMAGES
But for the “Right of Replacement or Refund” described below,
1 the Project (and any other party you may receive this
etext from as a PROJECT GUTENBERG-TM etext) disclaims all
liability to you for damages, costs and expenses, including
legal fees, and 2 YOU HAVE NO REMEDIES FOR NEGLIGENCE OR
UNDER STRICT LIABILITY, OR FOR BREACH OF WARRANTY OR CONTRACT,
INCLUDING BUT NOT LIMITED TO INDIRECT, CONSEQUENTIAL, PUNITIVE
OR INCIDENTAL DAMAGES, EVEN IF YOU GIVE NOTICE OF THE
POSSIBILITY OF SUCH DAMAGES.
If you discover a Defect in this etext within 90 days of
receiving it, you can receive a refund of the money (if any)
you paid for it by sending an explanatory note within that
time to the person you received it from. If you received it
on a physical medium, you must return it with your note, and
such person may choose to alternatively give you a replacement
copy. If you received it electronically, such person may
choose to alternatively give you a second opportunity to
receive it electronically.
THIS ETEXT IS OTHERWISE PROVIDED TO YOU “AS-IS”. NO OTHER
WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED, ARE MADE TO YOU AS
TO THE ETEXT OR ANY MEDIUM IT MAY BE ON, INCLUDING BUT NOT
LIMITED TO WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A
Some states do not allow disclaimers of implied warranties or
the exclusion or limitation of consequential damages, so the
above disclaimers and exclusions may not apply to you, and you
may have other legal rights.
You will indemnify and hold the Project, its directors,
officers, members and agents harmless from all liability, cost
and expense, including legal fees, that arise directly or
indirectly from any of the following that you do or cause:
1 distribution of this etext, 2 alteration, modification,
or addition to the etext, or 3 any Defect.
DISTRIBUTION UNDER “PROJECT GUTENBERG-TM”
You may distribute copies of this etext electronically, or by
disk, book or any other medium if you either delete this
“Small Print!” and all other references to Project Gutenberg,
1. Only give exact copies of it. Among other things, this requires
that you do not remove, alter or modify the etext or this “small
print!” statement. You may however, if you wish, distribute this
etext in machine readable binary, compressed, mark-up, or
proprietary form, including any form resulting from conversion by
word pro- cessing or hypertext software, but only so long as
- The etext, when displayed, is clearly readable, and does not
contain characters other than those intended by the author of the
work, although tilde (~), asterisk (*) and underline (_)
characters may be used to convey punctuation intended by the
author, and additional characters may be used to indicate
hypertext links; OR
- The etext may be readily converted by the reader at no expense
into plain ASCII, EBCDIC or equivalent form by the program that
displays the etext (as is the case, for instance, with most word
- You provide, or agree to also provide on request at no
additional cost, fee or expense, a copy of the etext in its
original plain ASCII form (or in EBCDIC or other equivalent
2. Honor the etext refund and replacement provisions of this “Small
3. Pay a trademark license fee to the Project of 20% of the gross
profits you derive calculated using the method you already use to
calculate your applicable taxes. If you don’t derive profits, no
royalty is due. Royalties are payable to “Project Gutenberg
Literary Archive Foundation” the 60 days following each date you
prepare (or were legally required to prepare) your annual (or
equivalent periodic) tax return. Please contact us beforehand to
let us know your plans and to work out the details.
What If You Want To Send Money Even If You Don’t Have To?
The Project gratefully accepts contributions of money, time, public
domain etexts, and royalty free copyright licenses. If you are
interested in contributing scanning equipment or software or other
items, please contact Michael Hart at: firstname.lastname@example.org
end the small print! for public domain etexts ver.04.07.00
Free Download — 467 current fiction substrings (fiction.txt) The most frequently occurring 467 character sequences (n-grams) occurring in a best-selling novel by Amy Tan in 1990.
Free Download — 6,213 acronyms (acronyms.txt) common acronyms & abbreviations
Free Download — Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.
Free Download — A word list with over 100,000 entries that are officially permitted in crossword games like Scrabble™. This word list is available in a simple, alphabetically-ordered Excel format, making it convenient for reference, spell-checking, or in more sophisticated application, for developers looking to build a custom spelling dictionary. The entries include variants of words: ...
Free Download — 4,160 official crosswords delta (crswd-d.txt) When combined with the 113,809 crosswords file, it produces the official crossword list compatible with the second edition of the Official Scrabble Players Dictionary. (Scrabble is a registered trademark of Milton-Bradley licensed to Merriam-Webster.)
Free Download — 1,185 King James Version frequent substrings (KJVfreq.txt) The most frequently occurring 1,185 substrings in the King James Version Bible ranked and counted by order of frequency.
Free Download — This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.
Free Download — 21,986 names (names.txt) This database contains the most common names used in the United States and Great Britain. Spelling checkers may want to supplement their basic word list with this one.
Free Download — 4,946 female names (names-f.txt) Frequent given names of females in English speaking countries. Spelling checkers may want to supplement their basic word list with this one.
Free Download — 3,800 male names Frequent given names of male in English speaking countries. Spelling checkers may want to supplement their basic word list with this one.
Free Download — A common word list with over 250,000 entries of hyphenated, capitalized and compound English words. The download consists of entries containing more than one word, as well as capitalized words and acronyms. Phrases are considered “common” if they or variations of them occur in a standard dictionary or thesaurus. This word list is available in a simple, ...
Free Download — 366 often misspelled words (oftenmis.txt) many of the most commonly misspelled words in English speaking countries
Free Download — U.S. place names for more than 10,000 entries. This U.S. place name list is available in a simple, alphabetically-ordered .txt format, making it convenient for reference, spell-checking, or in more sophisticated application, for developers looking to build a custom location tool or database. The entries represent a sampling of U.S. place names: 10,196 places in total.