Introduction to the MR4C repo

About MR4C

MR4C is an implementation framework that allows you to run native code within the Hadoop execution framework. Pairing the performance and flexibility of natively developed algorithms with the unfettered scalability and throughput inherent in Hadoop, MR4C enables large-scale deployment of advanced data processing applications.

Map to this repo

This repository includes user guide, tutorials and source code for the MR4C framework created by Google Inc. We suggest you run through this repo in the following order:

  1. Make sure that you have all dependencies and build (see below).
  2. Test that MR4C install was successful
    • Run from the test directory
  3. Study up on MR4C
    • in the UserGuide directory covers the basic concepts behind MR4C
  4. Run through the example algorithms in the tutorial directory
  5. Build your own algorithm using the examples as templates and let us know if you have questions or comments!



There are four scripts included to build, clean, deploy and/or remove mr4c. Build with:


Clean previous builds with:


Deploy to /usr/local/mr4c using:


Remove all components with:


If you get stuck, have questions, or would like to provide any feedback, please don’t hesitate to contact us at [email protected] Let’s do big things together.