Tag

corpus

Showing 61 - 62 out of 62 datasets
  • USPTO (US Patent Office) patents: Bulk Downloads of Full Text, Scans or OCR

    Offsite — The following USPTO patent products are available for free download from Google. Patent Grants Patent Grant Multi-Page Images (1790 – present) Patent Grant Full Text with Embedded Images (2001 – present) Patent Grant Full Text (1976 – present) Patent Grant Bibliographic Data (1976 – present) Patent Grant OCR Text (1920 – 1979) Patent Grant Single-Page Images (Oct ...
  • Google Labs - Books Ngram Viewer

    Offsite — Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links below will directly download a fragment of the given corpus. For instance, ...