sparkMeasure

sparkMeasure CI Maven Central

SparkMeasure is a tool for performance troubleshooting of Apache Spark workloads

SparkMeasure simplifies the collection and analysis of Spark performance metrics. Use sparkMeasure for troubleshooting interactive and batch Spark workloads. Use it also to collect metrics for long-term retention or as part of a CI/CD pipeline. SparkMeasure is also intended as a working example of how to use Spark Listeners for collecting Spark task metrics data.

Getting started with sparkMeasure, by example

One tool for different use cases, links to documentation and examples

Architecture diagram

sparkMeasure architecture diagram

Main concepts underlying sparkMeasure

FAQ: