Basic Statistics
| Measure | Value |
|---|---|
| Filename | Geoduck-heart-RNA-4_S26_L004_R1_001.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 7793 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 35-151 |
| %GC | 38 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 36 | 0.4619530347747979 | No Hit |
| GAACTCCAGTCACACAGTGATCTCGTATGCCGTCTTCTGCTTGAAAAAAA | 17 | 0.21814448864365454 | Illumina PCR Primer Index 5 (100% over 43bp) |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTAT | 12 | 0.15398434492493263 | TruSeq Adapter, Index 13 (97% over 40bp) |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTTCCGTATCTCGTAT | 11 | 0.14115231618118826 | TruSeq Adapter, Index 14 (97% over 44bp) |
| ACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | 9 | 0.11548825869369947 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| TCATATT | 15 | 0.0026416457 | 198.95891 | 145 |
| GACACCT | 10 | 0.0076353042 | 140.55484 | 3 |
| GAAGAGC | 20 | 4.212088E-4 | 105.41613 | 6 |
| ATCGGAA | 20 | 4.212088E-4 | 105.41613 | 2 |
| GATCGGA | 25 | 9.964519E-4 | 84.880516 | 1 |
| AAGAGCA | 25 | 0.0010226604 | 84.3329 | 7 |
| CGGAAGA | 25 | 0.0010226604 | 84.3329 | 4 |
| AGAGCAC | 25 | 0.0010226604 | 84.3329 | 8 |
| TCGGAAG | 30 | 0.0021088757 | 70.27741 | 3 |
| GAGCACA | 30 | 0.0021088757 | 70.27741 | 9 |
| GGAAGAG | 40 | 0.006591682 | 52.708065 | 5 |
| GGGGGGG | 25 | 6.076119E-4 | 28.110968 | 20-24 |