Basic Statistics
| Measure | Value |
|---|---|
| Filename | Geoduck-larvae-day5-RNA-EPI-99-1_S8_L001_R1_001.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 61304431 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 35-151 |
| %GC | 38 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTAT | 513122 | 0.8370063821324759 | TruSeq Adapter, Index 21 (97% over 40bp) |
| ATCGGAAGAGCACACGTCTGAACTCCAGTCACGTTTCGGAATCTCGTATG | 231135 | 0.3770282118759083 | TruSeq Adapter, Index 21 (97% over 39bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GATCGGA | 95280 | 0.0 | 81.73274 | 1 |
| TCGGAAG | 113800 | 0.0 | 68.818886 | 3 |
| ATCGGAA | 120015 | 0.0 | 65.34625 | 2 |
| GAAGAGC | 123300 | 0.0 | 63.665993 | 6 |
| CGGAAGA | 125760 | 0.0 | 62.425247 | 4 |
| AGAGCAC | 134020 | 0.0 | 59.597183 | 8 |
| GGAAGAG | 135915 | 0.0 | 57.818413 | 5 |
| GAGCACA | 143345 | 0.0 | 55.908737 | 9 |
| AAGAGCA | 146385 | 0.0 | 53.6779 | 7 |
| AGCACAC | 153835 | 0.0 | 23.741978 | 9 |
| TATGCCG | 99950 | 0.0 | 23.582556 | 45-49 |
| CGTTTCG | 101820 | 0.0 | 22.885721 | 30-34 |
| CGTATGC | 103230 | 0.0 | 22.768766 | 45-49 |
| GCCGTCT | 107670 | 0.0 | 21.721006 | 50-54 |
| GTTTCGG | 107250 | 0.0 | 21.712814 | 30-34 |
| ATGCCGT | 109245 | 0.0 | 21.45546 | 45-49 |
| CGGAATC | 108535 | 0.0 | 21.288916 | 35-39 |
| AGTCACG | 111150 | 0.0 | 21.100117 | 25-29 |
| TCGGAAT | 111015 | 0.0 | 20.769629 | 35-39 |
| AATCTCG | 111830 | 0.0 | 20.659914 | 40-44 |