Overview Build Status Maven Central

Herd is big data governance for the cloud. The herd unified data catalog helps separate compute from storage in the cloud. Herd job orchestration manages your ETL and analytics processes while tracking all data in the catalog. Here is a quick summary of features:

Find out more about herd features on our GitHub project page

Quick Start

The best way to start learning about herd is through these links. The demo installation process is quick and easy - you can have herd up and running in AWS in 10-15 minutes and start registering data immediately afterwards.

Get Involved

We are actively seeking organizations and individuals that are interested in adopting herd and contributing to the development effort. Find out more in the contributions section of our GitHub project page. If you have any questions or discussion topics, post them on GitHub Issues or email us at [email protected]


Herd is licensed under Apache License 2.0