User

mrflip

Philip Kromer
1fe8a50552058b90da2c164efdcbcf6f.jpg?size=80&default=http%3a%2f%2fwww.infochimps.com%2fmarketplace%2fassets%2fgravatar-sample
Website:

http://infochimps.org

Institution:

Infochimps, inc

Uploaded datasets

Showing 21 - 40 out of 75 datasets
  • GeoNames.org Postal Code files - US Zip Code Geolocations

    Free Download — Find US zip code data and corresponding latitude and longitude in a simple text format. This zip code data set is organized alphabetically and includes every US zip code, along with the corresponding city, state, latitude and longitude. Looking for more geo data? Check out the 2010 Census Demographic Profiles API to add demographics details to your geo analysis! Format ...
  • Google Labs - Books Ngram Viewer

    Offsite — Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links below will directly download a fragment of the given corpus. For instance, ...
  • googleclusterdata - System traces of production workloads on Google clusters

    Offsite — This project is intended for the distribution of data of production workloads running on Google clusters. The first dataset (data-1), provides traces over a 7 hour period. The workload consists of a set of tasks, where each task runs on a single machine. Tasks consume memory and one or more cores (in fractional units). Each task belongs to a single job; a job may have ...
  • Growth Charts - US Children Birth to 36 months

    Offsite — For US Boys and Girls, percentile charts for Length-for-age Weight-for-age Head circumference-for-age Weight-for-length Stature-for-age Weight-for-age Body mass index-for-age
  • HIV Drug Resistance Database

    Offsite — The main functions of HIVDB are: To store, analyze and make available the diverse forms of data underlying drug resistance knowledge to the broad community of researchers and clinicians studying HIV drug resistance and using HIV drug resistance tests; To provide a publicly available online resource to help those performing HIV drug resistance surveillance, interpreting ...
  • Hostnames of Internet addresses suspected of SSH password authentication attacks

    Offsite — Dragon Research Group (DRG) sshpwauth report Entries consist of fields with identifying characteristics of a a source IP address that has been seen attempting to remotely login to a host using SSH password authentication. This report lists hosts that are highly suspicious and are likely conducting malicious SSH password authentication attacks. Each entry is sorted ...
  • ICPSR (Inter-university Consortium for Political & Social Resource): 500,000 data sets metaindex

    Offsite — ICPSR offers more than 500,000 digital files containing social science research data. Disciplines represented include political science, sociology, demography, economics, history, gerontology, criminal justice, public health, foreign policy, ...
  • Infochimps Test Dataset (Purchasable)

    Free Download — This is a dataset used for internal testing.
  • IQSS Dataverse Network

    Offsite — The IQSS Dataverse Network — access the world’s largest collection of social science research data here by searching across or browsing through one of the virtual data archives (called “dataverses”) listed below. You may also create a dataverse of your own, backed up in perpetuity by the Henry A. Murray Archive, which may easily be customized to appear as if it is on ...
  • List of Angel Groups and Investors

    Offsite — Up-to-date list of professional early-stage investors, with information on geographical location, industry, investment size and focus.
  • List of Dirty, Obscene, Banned and otherwise unacceptable words

    Free Download — A banned word list representing a collection of many lists from around the web of words considered socially unacceptable for one reason or another. What to do with a banned word list? Use this dirty word list to screen for spammers and griefers, to censor dissidents; to better understand the semiotic role of taboo signifiers in an online modality; to monitor user ...
  • List of publicly-accessible transit data feeds from googletransitdatafeed

    Offsite — This is a list of transit schedule data published by transit agencies and operators in GTFS format for developers to use. They contain scheduled times, stop locations, route information and optionally fare information and detailed route shapes. Another list of official GTFS data is maintained by the GTFS Data Exchange site. For details on the feed format, see the General ...
  • Measuring Worth: A collection of calculators to help answer questions of Comparative Relative Value

    Offsite — Have you ever wondered what the price of gold was in 1907? Or what the GDP was in 1932? Presented here are many useful historic data sets that we developed in order to answer such questions. Some of these are currently hosted on EH.Net. The Annual ...
  • Measuring Worth: Dollar-Pound Exchange Rate From 1791

    Offsite — Dollar-Pound Exchange Rate From 1791 An exchange rate is the price of one currency in terms of another currency. The exchange rate presented here is the price of the British pound in U.S. dollars, that is, the number of U.S. dollars per British pound. This is the traditional way in which the relative price of the two currencies is compared. The exchange rate between the ...
  • Measuring Worth: Interest Rates - US, UK, China, Japan

    Offsite — The mission of the site is to make available to the public the highest quality and most reliable historical data on important economic aggregates, with particular emphasis on nominal measures. The data have been created using the highest standards of the fields of economics and history and are rigorously refereed by the most distinguished researchers in the fields. ...
  • MLB Gameday: full Play-by-play, Box Score, Pitch and Pitch Trajectory

    Offsite — MLB Gameday provides data files describing every pitch for most games, beginning in the 2007 season. The XML files are available from their Gameday server. You might enjoy Alan M. Nathan’s description of the gameday fields Mike Fast’s many useful articles John Walsh’s strike zone exploration at the Hardball Times. This diagram of pitch trajectory fields might be ...
  • NASA - Five Millennium Catalog of Solar Eclipses: -1999 to +3000 (2000 BCE to 3000 CE)

    Offsite — Eclipses of the Sun can only occur when the Moon is near one of its two orbital nodes 1 during the New Moon phase . It is then possible for the Moon’s penumbral, umbral or antumbral shadows to sweep across Earth’s surface thereby producing an eclipse. The referenced tables summarizes all eclipses over this five millennium period (2000 BCE to 3000 CE) by century. Each ...
  • National Center for Educational Statistics (NCES): Tables and Figures

    Offsite — This search tool lets you locate all tables/figures/charts published in the inventory of NCES’ National Education Data Resource Center (NEDRC) Postsecondary Tables Library; the Condition of Education; the Digest of Education Statistics; Indicators of School Crime and Safety and other NCES publications. Tables are constantly being added (thousands of tables, graphs & ...
  • NCAA College Basketball: Measured Probability of Home team winning with lead vs time remaining

    Offsite — The WP modeling technique I use is sometimes called an ‘empirical matrix.’ I took a set of play-by-play data from recent years of NCAA regular season games 1,782 games from the past 3 years—360 thousand in-game observations in all] and divided it ...
  • Neuronal Wiring Network of the Caenorhabditis elegans roundworm

    Offsite — This data was first discussed by Chen, Hall, and Chklovskii, PNAS, March 21, 2006 vol. 103:12 pp.4723-4728. A full analysis of the following data can be found in “Structural properties of the C. elegans neuronal network” by Lav R. Varshney, Beth L. Chen, Eric Paniagua, David H. Hall and Dmitri B. Chklovskii. Provided is a compilation of an updated version of C. elegans ...