Infochimps Logo
Products Solutions Documentation Blog Login

We’re a Ruby shop that works with big data. Here are the open source tools that we’ve contributed to or built to make this happen.

Many thanks to the other contributors to these projects. If you are one of them, maybe you should join our team.

Wukong is Ruby for Hadoop—it makes Hadoop so easy a chimpanzee can use it.

  • owner
  • contributors
  • main contributor

Cluster_chef is a powerful tool for maintaining and describing the software configurations that let a machine provide its services.

  • owner
  • contributors
  • main contributor

Swineherd is for running scripts and workflows on filesystems.

  • owner
  • contributors
  • main contributor

HbaseBulkloader is a bulkloader for HBase that explores various strategies. Includes Apache Pig load and store functions.

  • owner
  • contributors
  • main contributor

IMW is the Infinite Monkeywrench (IMW) is a Ruby frameworks to simplify the tasks of acquiring, extracting, transforming, loading, and packaging data.

  • owner
  • contributors
  • main contributor

Wonderdog is a bulkloader for Elastic Search. Includes a simple storefunc for Apache Pig.

  • owner
  • contributors
  • main contributor

The ChimpMARK-2010 is a collection of massive real-world data sets, interesting real-world problems, and simple example code to solve them.

  • owner
  • contributors
  • main contributor

Infochimps

  • About us
  • Team
  • Careers
  • Press

Explore

  • Geo and IP APIs
  • Social Media APIs
  • Datasets
  • Solutions

Developers

  • Documentation
  • Code Examples
  • HOWTO Guides
  • Labs

Legal Stuff

  • Terms of use
  • Security
  • Privacy
  • Copyright

Help

  • FAQ
  • Feature Request
  • Request Data
  • Contact

Follow us

  • TwitterTwitter
  • FacebookFacebook
  • RssBlog

© 2012 Infochimps, Inc. All Rights Reserved.