Category

Showing 81 - 100 out of 716 datasets

Science

Not finding the data sets you're looking for? Not all of our data sets are categorized yet. Try checking out tags instead.
  • Word Lists Collection

    Offsite — The data is a smorgasbord of word lists, including spell check oriented word lists, an inflection database, parts of speech word list, jargon file word lists, the contents from Ispell, spell check dictionaries, tables that convert between American, British and Canadian spellings, and links to several other word lists.
  • Cloudiness, Wind Speed, Heating/Cooling Days, and Relative Humidity for Select Cities - 1971-2000

    Free Download — All information is airport data, except as noted. The data is from a period of record through 2005, except heating and cooling normals for period 1971-2000. The temperature is in Fahrenheit degrees. The source is the U.S. National Oceanic and ...
  • NuDat - Nuclear Structure and Decay Data

    Offsite — Nuclear atomic data from the National Nuclear Data Center for radioactive isotopes, including detailed decay schemes along with associated decay probabilities. Evaluated (recommended) nuclear structure and decay information for 3,175 nuclides, about 160,210 levels, 240,608 gamma-rays, etc. Obtained from ENSDF (Evaluated Nuclear Structure Data File) and Nuclear Wallet ...
  • Hydrofracking - Bradford PA Hydraulic Fracturing Fluid Product Disclosure

    Free Download — Hydrofracking – Bradford PA Hydraulic Fracturing Fluid Product Disclosure – ATGAS 2H CHESAPEAKE APPALACHIA LL of the Bradford PA Hydrofracking well blowout from: Daniel Spadoni | Community Relations Coordinator Department of Environmental Protection 208 West Third Street, Suite 101, Williamsport, PA 17701 Phone: (570) 327-3659 | Fax: (570) 327-3565 www.depweb.state.pa.us ...
  • The arXiv in your pocket - Downloadable Physics Pre-Print Archive

    Offsite — The arXiv Physics pre-print publishing corpus
  • Westbury Lab Usenet Corpus: 28M postings from 47000+ newsgroups 2005-2009

    Offsite — A USENET corpus (2005-2009) This corpus is a collection of public USENET postings. This corpus was collected between Oct 2005 and Jan 2010, and covers 47860 English language, non-binary-file news groups. Despite our best effots, this corpus includes a very small number of non-English words, non-words, and spelling errors. The corpus is untagged, raw text. It may be ...
  • Speech Accent Archive: 1200+ speech samples from a variety of language backgrounds

    Offsite — The speech accent archive uniformly presents a large set of speech samples from a variety of language backgrounds. Native and non-native speakers of English read the same paragraph and are carefully transcribed. The archive is used by people who wish to compare and analyze the accents of different English speakers. The Elicitation Paragraph Please call Stella. Ask her ...
  • ICPSR (Inter-university Consortium for Political & Social Resource): 500,000 data sets metaindex

    Offsite — ICPSR offers more than 500,000 digital files containing social science research data. Disciplines represented include political science, sociology, demography, economics, history, gerontology, criminal justice, public health, foreign policy, ...
  • Word Frequencies in Written & Spoken English from British National Corpus (100M-word)

    Offsite — by Geoffrey Leech, Paul Rayson, Andrew Wilson Overview Download word lists Books of English word frequencies have in the past suffered from severe limitations of sample size and breadth. They have also tended to be restricted to word forms alone. Most importantly, almost all have dealt only with written language. This book overcomes these limitations. It is derived from ...
  • Natural Gas Hydrofracking Terms Inference Matrix 17 – Anthony Ingraffea Cornell

    Free Download — Natural Gas Hydrofracking Terms Inference Matrix 17 – Anthony Ingraffea Cornell I have been been meaning to post a demonstration dataset and with the significance and importance of the issues and points Cornell Professor Anthony Ingraffea raises from his unique experience and perspective of the natural gas industry and Hydrofracking operations pointing out it’s ...
  • Hydrofracking industry from a unique perspective-Compelling succinct qualified questions

    Free Download — ================== All, Hydrofracking has the potential to be the number one aspect affecting society’s health and well being. It is, or it is scheduled to be, that pervasive. In your sentiments against Hydrofracking that we share; or perhaps not, I hope you will agree that more information is necessary from the natural gas / Hydrofracking industry. Also, whether you ...
  • Natural Gas Hydrofracking Terms Inference Matrix 36 – Brad Gill IOGANY (Indep Oil & Gas Assoc of NY)

    Free Download — Natural Gas Hydrofracking Terms Inference Matrix 36 – Brad Gill IOGANY (Indep Oil & Gas Assoc of NY) This substantial Excel spreadsheet “Inference Matrix” chronicles and categorizes Internet references on web pages and online .PDF documents with ...
  • AceDB Genome Database

    Offsite — AceDB is a genome database system developed since 1989 primarily by Jean Thierry-Mieg (CNRS, Montpellier) and Richard Durbin (Sanger Institute). It provides a custom database kernel, with a non-standard data model designed specifically for handling scientific data flexibly, and a graphical user interface with many specific displays and tools for genomic data. AceDB is ...
  • Word List - 1,000+ Most Frequent words in King James Bible

    Free Download — 1,185 King James Version frequent substrings (KJVfreq.txt) The most frequently occurring 1,185 substrings in the King James Version Bible ranked and counted by order of frequency.
  • Chemical Structure Repository

    Offsite — Description From project website: > Chemical Structures [was] initiated in June 2006. It aims to provide a set of organic structures, which includes 3D coordinates, InChi code, molecular weight, melting point, etc. The last relase (v1.05) contains over 250 structures. License BSD license according to [SourceForge project ...
  • Letter frequency - Substring frequency in an Amy Tan Novel

    Free Download — 467 current fiction substrings (fiction.txt) The most frequently occurring 467 character sequences (n-grams) occurring in a best-selling novel by Amy Tan in 1990.
  • Airborne Antarctic Ozone Experiment (AAOE-87)

    Offsite — This data is from the Airborne Antarctic Ozone Experiment (AAOE) which was based in Punta Arenas, Chile during August and September 1987. The data was primarily collected onboard the NASA ER-2 and DC-8 aircraft, along with ozonesonde data collected at four Antarctic stations: Halley Bay, McMurdo, Palmer Station, and the South Pole. The experiment tested the chemical and ...
  • Word List - Official Scrabble (TM) Player's Dictionary (OSPD) 2nd ed (with Definitions, Excel format

    Free Download — 4,160 official crosswords delta (crswd-d.txt) When combined with the 113,809 crosswords file, it produces the official crossword list compatible with the second edition of the Official Scrabble Players Dictionary. (Scrabble is a registered ...
  • Mercury Crater Data

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Mercury Crater Data.
  • Big Four Pageants Titleholders

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Big Four Pageants Titleholders.