The Oxford Nanopore Technologies Open Data (ont-open-data) provides reference sequencing data from Oxford Nanopore sequencing devices. Data access is provided through the Registry of Open Data on AWS.

The Oxford Nanopore Technologies Open Data project aims to provide exemplar datasets from state of the art long-read sequencing. Datasets are provided without restriction on availability or use to aid researchers, primarily in the field of genomics and transcriptomics. All data is hosted by our collaborator Amazon Web Services on the Registry of Open Data on AWS. Our previous data releases have been used to aid the development of new algorithms for small variant and structural variant calling, and in the creation of new single-cell transcriptomics analyses. Data has also been used together with datasets from other sources for the complete telomere to telomere assembly of the Genome In A Bottle sample HG002.

Use of Oxford Nanopore Open Data

The data are freely available to all and can be used for:

Exploration of the characteristics of nanopore sequence data,
Assessment and reproduction of performance benchmarks,
Example datasets for analysis on EPI2ME, Oxford Nanopore Technologies’ cloud compute infrastructure,
Development of tools and methods.

The data deposited showcases DNA sequences from a representative subset of sequencing chemistries. The datasets correspond to publicly-available reference samples including the widely available Genome In A Bottle human reference samples. Raw data are provided with metadata and scripts to describe sample and data provenance.

Latest datasets and analyses

The list below represents the most recent dataset or analysis of its class. As new datasets, basecallers, and analysis methods are developed this list will change.

Genome In a Bottle samples sequenced with R10.4.2 and LSK114.
A Tumor Normal Pair sequenced with LSK114 to high coverage.
Phased CpG modification calling with Remora in matched bisulfite and nanopore sequencing.
Reduced Representation Methylation Sequencing (RRMS) Seqeuncing of a cancer/normal pair with adaptive sampling.
D.melanogaster iso-1 D. melanogaster (iso-1) sequenced with R10.4.1 Kit 14.

We strive to keep all analyses current, though some may fall behind the more current sequencing runs and basecalling results.

Archived datasets

Historical datasets can also by found in the repository. Our previous data release blog posts are archived under the data-releases category.

Data access

All data is available under from the Registry of Open Data on AWS. See the Oxford Nanopore Open Data Tutorials page for more information.

Use of Oxford Nanopore Open Data

Latest datasets and analyses

Archived datasets

Data access

Quick Links

Tutorials Workflows Open Data Contact

Oxford Nanopore Open Data project

Use of Oxford Nanopore Open Data

Latest datasets and analyses

Archived datasets

Data access

Share

Table Of Contents

Oxford Nanopore Open Data project

.css-3n7dj1{box-sizing:border-box;margin:0;min-width:0;display:block;color:var(--theme-ui-colors-heading,#edf2f7);font-weight:bold;-webkit-text-decoration:none;text-decoration:none;margin-bottom:1rem;font-size:1.5rem;position:relative;}Use of Oxford Nanopore Open Data

Latest datasets and analyses

Archived datasets

Data access

Share

Table Of Contents

Use of Oxford Nanopore Open Data