Introduction
This report contains tables and plots to help interpret the results of wf-amplicon. The workflow was run in variant calling mode. The individual sections of the report summarize the outcomes of the different steps of the workflow (read filtering, mapping against the reference file containing the amplicon sequences, variant calling).
Note: If the sequence IDs in the reference file contained special characters, they were replaced with underscores.
The input data contained:
3 samples:barcode01, barcode02, barcode03
2 amplicons:
katG_NC_000962_3_2154725_2155670, rpoB_NC_000962_3_760285_761376
Note: The data was downsampled to 1500 reads per sample.
At a glance
Key results for the individual samples are shown below. You can use the dropdown menu to view the results for a different sample.
Reads
165
Bases
162,744
Mean length
986.3
Mean quality
13.8
Amplicons detected
2 / 2
Mean coverage across all amplicons
68.9
Smallest mean coverage for any amplicon
66.8
SNVs
2
Indels
0
Reads
132
Bases
130,231
Mean length
986.6
Mean quality
13.8
Amplicons detected
2 / 2
Mean coverage across all amplicons
55.1
Smallest mean coverage for any amplicon
49.4
SNVs
2
Indels
0
Reads
89
Bases
83,668
Mean length
940.1
Mean quality
13.5
Amplicons detected
2 / 2
Mean coverage across all amplicons
30.5
Smallest mean coverage for any amplicon
3.9
SNVs
2
Indels
0
Preprocessing
Some basic stats covering the raw reads and the reads remaining after the initial filtering step (based on length and mean quality) as well as after downsampling and trimming are illustrated in the table below.
Condition | Reads | Bases | Min read length | Max read length | Mean quality |
---|---|---|---|---|---|
Raw | 407 | 435.5 k | 327 | 2,147 | 13.5 |
Filtered | 407 | 435.5 k | 327 | 2,147 | 13.5 |
Downsampled, trimmed | 386 | 376.6 k | 243 | 1,139 | 13.8 |
The following plots show the read quality and length distributions as well as the base yield after filtering (but before downsampling / trimming) for each sample (use the dropdown menu to view the plots for the individual samples).
Summary
The two tables below (one per tab) briefly summarize the main results of mapping the reads to the provided amplicon references and subsequent variant calling. Percentages of unmapped reads are relative to the number of reads for that particular sample. Other percentages are relative to the total number of reads / bases including all samples.
Sample alias | Reads | Bases | Median read length | Amplicons | Unmapped | Variants (indels) |
---|---|---|---|---|---|---|
barcode01 | 02 |
02 |
985 | 2 | 01 |
2 (0) |
barcode02 | 01 |
01 |
987 | 2 | 00 |
2 (0) |
barcode03 | 00 |
00 |
952 | 2 | 02 |
2 (0) |
Amplicon | Reads | Bases | Median read length | Samples | Mean cov. | Mean acc. | Variants (indels) |
---|---|---|---|---|---|---|---|
katG_NC_000962_3_2154725_2155670 | 02 |
02 |
954 | 3 | 97.4 | 95.3 | 3 (0) |
rpoB_NC_000962_3_760285_761376 | 01 |
01 |
1099 | 3 | 97.1 | 95.9 | 3 (0) |
Unmapped | 00 |
00 |
951 | 3 | 0.0 | 0.0 | 0 (0) |
The following table breaks the results down further (one sample–amplicon combination per row).
Sample | Amplicon | Reads | Bases | Median read length | Mean cov. | Mean acc. | Variants (indels) |
---|---|---|---|---|---|---|---|
barcode01 | katG_NC_000962_3_2154725_2155670 | 07 |
06 |
954 | 96.9 | 95.4 | 1 (0) |
barcode01 | rpoB_NC_000962_3_760285_761376 | 08 |
08 |
1099 | 97.2 | 95.8 | 1 (0) |
barcode01 | Unmapped | 02 |
02 |
964 | 0.0 | 0.0 | 0 (0) |
barcode02 | katG_NC_000962_3_2154725_2155670 | 04 |
04 |
953 | 96.5 | 95.3 | 1 (0) |
barcode02 | rpoB_NC_000962_3_760285_761376 | 05 |
07 |
1099 | 96.7 | 95.9 | 1 (0) |
barcode02 | Unmapped | 01 |
01 |
963 | 0.0 | 0.0 | 0 (0) |
barcode03 | katG_NC_000962_3_2154725_2155670 | 06 |
05 |
952 | 98.9 | 95.2 | 1 (0) |
barcode03 | rpoB_NC_000962_3_760285_761376 | 00 |
00 |
1098 | 99.8 | 97.0 | 1 (0) |
barcode03 | Unmapped | 03 |
03 |
937 | 0.0 | 0.0 | 0 (0) |
Depth of coverage
Coverage along the individual amplicon, (use the dropdown menu to view the plots for the individual amplicons).
Variants
Haploid variant calling was performed with Medaka. Variants with low depth (i.e. smaller than
--min_coverage
) are shown under the "Low depth" tab. The numbers in the "depth" column relate to the sequencing depth used to perform variant calling.
Sample | Amplicon | Position | Ref. allele | Alt. allele | Type | Depth |
---|---|---|---|---|---|---|
barcode01 | katG_NC_000962_3_2154725_2155670 | 443 | C | G | SNP | 68 |
barcode01 | rpoB_NC_000962_3_760285_761376 | 870 | C | T | SNP | 71 |
barcode02 | katG_NC_000962_3_2154725_2155670 | 443 | C | G | SNP | 50 |
barcode02 | rpoB_NC_000962_3_760285_761376 | 870 | C | T | SNP | 60 |
barcode03 | katG_NC_000962_3_2154725_2155670 | 443 | C | G | SNP | 62 |
Sample | Amplicon | Position | Ref. allele | Alt. allele | Type | Depth |
---|---|---|---|---|---|---|
barcode03 | rpoB_NC_000962_3_760285_761376 | 870 | C | T | SNP | 4 |