TOBIAS - Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal

PyPI Version install with bioconda bioRxiv badge

Introduction

ATAC-seq (Assay for Transposase-Accessible Chromatin using high-throughput sequencing) is a sequencing assay for investigating genome-wide chromatin accessibility. The assay applies a Tn5 Transposase to insert sequencing adapters into accessible chromatin, enabling mapping of regulatory regions across the genome. Additionally, the local distribution of Tn5 insertions contains information about transcription factor binding due to the visible depletion of insertions around sites bound by protein - known as footprints.

TOBIAS is a collection of command-line bioinformatics tools for performing footprinting analysis on ATAC-seq data, and includes:

For information on each tool, please see the wiki.

Installation

TOBIAS is written as a python package and can be quickly installed via pip:

$ pip install tobias

TOBIAS is also available as a conda package on the Bioconda channel:

$ conda install tobias -c bioconda

Please see the installation page for more info.

Usage

All tools are available through the command-line as TOBIAS <TOOLNAME>, for example:

$ TOBIAS ATACorrect
__________________________________________________________________________________________

                                   TOBIAS ~ ATACorrect
__________________________________________________________________________________________

ATACorrect corrects the cutsite-signal from ATAC-seq with regard to the underlying
sequence preference of Tn5 transposase.

Usage:
TOBIAS ATACorrect --bam <reads.bam> --genome <genome.fa> --peaks <peaks.bed>

Output files:
- <outdir>/<prefix>_uncorrected.bw
- <outdir>/<prefix>_bias.bw
- <outdir>/<prefix>_expected.bw
- <outdir>/<prefix>_corrected.bw
- <outdir>/<prefix>_atacorrect.pdf

(...)

Overview and command-line examples

Pipelines

While each TOBIAS tool can be run independently, they are developed to be run as part of an analysis pipeline. We provide ready-made pipelines for performing bias-correction, footprinting, differential binding and visualization for multiple conditions automatically.

Snakemake pipeline
We provide a pre-set snakemake workflow which is found here.

Nextflow pipeline
You can also run the TOBIAS tool as a nextflow pipeline. The pre-set workflow can be found here.

Nextflow kubernetes/de.NBI cloud aware pipeline
We also provide the TOBIAS nextflow pipeline for a cloud computing environment. One version utilizes a kubernetes framework, and a second version utilizing a webbased job scheduler, started automatically within a local TOBIAS run, making use of the de.NBI cloud.

License

This project is licensed under the MIT license.

Contact

Mette Bentsen (mette.bentsen (at) mpi-bn.mpg.de)