Bunsen lets users load, transform, and analyze FHIR data with Apache Spark. It offers Java and Python APIs to convert FHIR resources into Spark Datasets, which then can be explored with the full power of that platform, including with Spark SQL. For details see the Bunsen documentation.


Bunsen is built and tested with Apache Maven, with the standard Maven lifecycle to build, install, and deploy it.

User documentation is built with Sphinx. PySpark should be installed in the environment to generate the Python documentation. With that in place, the user can simply run make html in the docs directory to build the documentation, and make deploy in that directory to publish it to the GitHub pages site.


Bunsen is hosted in the Maven Central repository.


Bunsen's Java code should follow the Google Java Style Guide.


Please use GitHub issues to record any requests or issues for this project.




Copyright 2017 Cerner Innovation, Inc.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at


Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.