wf-cas9 is a nextflow workflow for the multiplexed analysis of Oxford Nanopore Cas9 enrichment sequencing.
The ONT Cas9 sequencing kit allows the enrichment of genomic regions of interest by amplifying target regions from adapters ligated to Cas9 cleavage sites. The purpose of this workflow is to assess the effectiveness of such Cas9 enrichment, but it can be applied to other enrichment approaches. The workflow outputs help assess the effectiveness of the enrichement strategy and can be used to diagnose issues such as poorly performing probes.
Inputs to the workflow are: a reference genome file, FASTQ reads from enrichment sequencing, and a BED file detailing the regions of interests (targets). The main outputs are a report containing summary statistics and plots which give an overview of the enrichment, and a BAM file with target-overlapping reads.
The main steps of the workflow are alignemnt of reads to the genome using minimap2 and the analaysis of read-target overlap with bedtools.
The workflow uses nextflow to manage compute and software resources, as such nextflow will need to be installed before attempting to run the workflow.
The workflow can currently be run using either Docker or singularity to provide isolation of the required software. Both methods are automated out-of-the-box provided either docker or singularity is installed.
It is not required to clone or download the git repository in order to run the workflow. For more information on running EPI2ME Labs workflows visit out website.
To obtain the workflow, having installed
nextflow, users can run:
nextflow run epi2me-labs/wf-cas9 --help
to see the options for the workflow.
The main inputs are:
To test on a small dataset with two targets and two chromosomes: first download and unpack the demo data
wget https://ont-exd-int-s3-euwst1-epi2me-labs.s3.amazonaws.com/wf-cas9/wf-cas9-demo.tar.gz \&& tar -xvf wf-cas9-demo.tar.gz```shellnextflow run epi2me-labs/wf-cas9 \--fastq wf-cas9-demo/fastq/ \--ref_genome wf-cas9-demo/grch38/grch38_chr19_22.fa.gz \--targets wf-cas9-demo/targets.bed \--full_report
The primary outputs of the workflow include:
--full_report, the report will also contain the following elements that may be useful for
diagnosing issues with the experiment. These are turned off by default as they can lead to slow loading of the