Contents [Hide]
1 Sequencing production
In this project, a total of 2 DNA samples were sequenced using CycloneSEQ platform. The data output and quality statistics are as follows: CYC sequencing produces an average of 2,359,869 reads per sample, and the total number of bases is 43.44 Gbp. The average N50 of sequencing read reaches 25,915 bp, the average read length reaches 18,421 bp, and the average sequencing quality reaches 17.55. See the table Table1 for specific statistical results. The sequence read length distribution is shown in the figure (Figure1). The estimate of read length and sequencing mass nuclear density (kde) is shown in (Figure2).
Table 1 Data Statistics (Download)
| Sample Sample ID | Total bases (Gb) Total bases | Read length N50 (bp) Reads length N50 | Mean read length (bp) Mean reads length | Median read length (bp) Median reads length | Mean read quality Mean read quality | Median read quality Median reads quality | Number of reads Number of reads |
|---|---|---|---|---|---|---|---|
| Moph15A | 42.62 | 26,727 | 18,805 | 15,736 | 17.60 | 18.20 | 2,266,456 |
| Moph11A | 44.25 | 25,103 | 18,037 | 15,471 | 17.50 | 18.20 | 2,453,283 |
| Average | 43.44 | 25,915 | 18,421 | 15,603 | 17.55 | 18.20 | 2,359,869 |
The annotation of table is as follows:
Samples: Sample ID
Total bases(Gb): Total bases
Read length N50(bp): N50 length of reads
Mean read length(bp): Mean length of reads
Median read length (bp): Median length of reads
Mean read quality: Mean quality of reads
Median read quality: Median quality of reads
Number of reads: Total reads
The X-axis represents the length of sequencing reads (after log conversion), and the Y-axis represents the number of reads with the corresponding length. Under normal circumstances, the average length of DNA we can obtain is more than 10,000 bp (different according to different special requirements), reflecting the main peak of the measured read length distribution in the figure to the right of 10,000 bp.
The X-axis represents the length of reads (the upper curve of the figure is the estimated distribution of read length and nuclear density), and the Y-axis represents the read sequencing quality (the curve on the right of the figure is the estimated distribution of reads sequencing quality> quantity nuclear density). The picture is a two-dimensional (length, quality) nuclear density estimation map. In theory, the higher the core, the better the sequencing quality of most reads, and the narrower the vertical distribution (especially the core area).It shows that the quality of sequencing is more stable.

