wf-cas9 documentation

By EPI2ME Labs
2 min read

wf-cas9

wf-cas9 is a nextflow workflow for the multiplexed analysis of Oxford Nanopore Cas9 enrichment sequencing.

Introduction

The ONT Cas9 sequencing kit allows the enrichment of genomic regions of interest by amplifying target regions from adapters ligated to Cas9 cleavage sites. The purpose of this workflow is to assess the effectiveness of such Cas9 enrichment, but it can be applied to other enrichment approaches. The workflow outputs help assess the effectiveness of the enrichement strategy and can be used to diagnose issues such as poorly performing probes.

Inputs to the workflow are: a reference genome file, FASTQ reads from enrichment sequencing, and a BED file detailing the regions of interests (targets). The main outputs are a report containing summary statistics and plots which give an overview of the enrichment, and a BAM file with target-overlapping reads.

The main steps of the workflow are alignemnt of reads to the genome using minimap2 and the analaysis of read-target overlap with bedtools.

Quickstart

The workflow uses nextflow to manage compute and software resources, as such nextflow will need to be installed before attempting to run the workflow.

The workflow can currently be run using either Docker or singularity to provide isolation of the required software. Both methods are automated out-of-the-box provided either docker or singularity is installed.

It is not required to clone or download the git repository in order to run the workflow. For more information on running EPI2ME Labs workflows visit out website.

Workflow options

To obtain the workflow, having installed nextflow, users can run:

nextflow run epi2me-labs/wf-cas9 --help

to see the options for the workflow.

The main inputs are:

  • Folder of FASTQ reads.
  • Genome reference file.
  • Target BED file with 4 columns:
    • chromosome
    • start
    • end
    • target_name

To test on a small dataset with two targets and two chromosomes: first download and unpack the demo data

wget https://ont-exd-int-s3-euwst1-epi2me-labs.s3.amazonaws.com/wf-cas9/wf-cas9-demo.tar.gz \
&& tar -xvf wf-cas9-demo.tar.gz
```shell
nextflow run epi2me-labs/wf-cas9 \
--fastq wf-cas9-demo/fastq/ \
--reference_genome wf-cas9-demo/grch38/grch38_chr19_22.fa.gz \
--targets wf-cas9-demo/targets.bed

Workflow outputs

The primary outputs of the workflow include:

  • A folder per sample containing:
    • BAM file filtered to contain reads overlapping with targets (*_on_target.bam).
    • BED file with alignment information for on-target reads (*on_target.bed).
    • A simple text file providing a summary of sequencing reads (*.stats).
  • sample_summary.csv - read and alignment summary for each sample.
  • target_summary.csv - read and alignment summary for reads overlapping each target.
  • A combined HTML report detailing the primary findings of the workflow across all samples including:
    • Sequencing quality plots.
    • Tables summarising targeted sequencing results.
    • Plots of stranded coverage at each target.
    • Histograms of on and off-target coverage for each sample.
    • Off-target hotspot region tables.

Share

EPI2ME Labs

EPI2ME Labs

Senior Button Pusher

Quick Links

TutorialsWorkflowsOpen DataContact

Social Media

© 2020 - 2023 Oxford Nanopore Technologies plc. All rights reserved. Registered Office: Gosling Building, Edmund Halley Road, Oxford Science Park, OX4 4DQ, UK | Registered No. 05386273 | VAT No 336942382. Oxford Nanopore Technologies, the Wheel icon, EPI2ME, Flongle, GridION, Metrichor, MinION, MinIT, MinKNOW, Plongle, PromethION, SmidgION, Ubik and VolTRAX are registered trademarks of Oxford Nanopore Technologies plc in various countries. Oxford Nanopore Technologies products are not intended for use for health assessment or to diagnose, treat, mitigate, cure, or prevent any disease or condition.