Bowtie 2 seems to be working fine (tested command '/home/shared/bowtie2-2.4.4-linux-x86_64/bowtie2 --version' [2.4.4]) Output format is BAM (default) Alignments will be written out in BAM format. Samtools found here: '/usr/bin/samtools' Reference genome folder provided is ../data/genome/ (absolute path is '/home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/data/genome/)' FastQ format assumed (by default) Attention: using more than 4 cores per alignment thread has been reported to have diminishing returns. If possible try to limit -p to a value of 4 Each Bowtie 2 instance is going to be run with 8 threads. Please monitor performance closely and tune down if necessary! Input files to be analysed (in current folder '/home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/code'): /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R1.fq.gz /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R2.fq.gz Library was specified to be not strand-specific (non-directional), therefore alignments to all four possible bisulfite strands (OT, CTOT, OB and CTOB) will be reported Output will be written into the directory: /home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/output/02/ Setting parallelization to single-threaded (default) Summary of all aligner options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 Current working directory is: /home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/code Now reading in and storing sequence information of the genome specified in: /home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/data/genome/ chr NC_061483.1 (32845235 bp) chr NC_061484.1 (32262650 bp) chr NC_061485.1 (31235357 bp) chr NC_061486.1 (28897756 bp) chr NC_061487.1 (29170857 bp) chr NC_061488.1 (28229454 bp) chr NC_061489.1 (28174910 bp) chr NC_061490.1 (27367920 bp) chr NC_061491.1 (29329910 bp) chr NC_061492.1 (27288303 bp) chr NC_061493.1 (25468064 bp) chr NC_061494.1 (25132731 bp) chr NC_061495.1 (24770608 bp) chr NC_061496.1 (28479552 bp) chr NC_061497.1 (25155020 bp) chr NC_061498.1 (23268637 bp) chr NC_061499.1 (22875723 bp) chr NC_061500.1 (21182577 bp) chr NC_061501.1 (20810600 bp) chr NC_061502.1 (21255706 bp) chr NC_061503.1 (20593300 bp) chr NC_061504.1 (18680547 bp) chr NC_061505.1 (17159251 bp) chr NC_061506.1 (11267222 bp) chr NW_025899746.1 (458395 bp) chr NW_025899747.1 (32022 bp) chr NW_025899748.1 (786 bp) chr NW_025899749.1 (255824 bp) chr NW_025899750.1 (3031 bp) chr NW_025899751.1 (5290 bp) chr NW_025899752.1 (3054 bp) chr NW_025899753.1 (48340 bp) chr NW_025899754.1 (45079 bp) chr NW_025899755.1 (25155 bp) chr NW_025899756.1 (113933 bp) chr NW_025899757.1 (1843 bp) chr NW_025899758.1 (82466 bp) chr NW_025899759.1 (4916 bp) chr NW_025899760.1 (71054 bp) chr NW_025899761.1 (8568 bp) chr NW_025899762.1 (11567 bp) chr NW_025899763.1 (6375 bp) chr NW_025899764.1 (53904 bp) chr NW_025899767.1 (3932 bp) chr NW_025899768.1 (2857 bp) chr NW_025899769.1 (2018 bp) chr NW_025899770.1 (1484 bp) chr NW_025899771.1 (996 bp) chr NW_025899772.1 (680 bp) chr NW_025899773.1 (673 bp) chr NW_025899774.1 (633 bp) chr NW_025899775.1 (582 bp) chr NC_009710.1 (17841 bp) Single-core mode: setting pid to 1 Paired-end alignments will be performed ======================================= The provided filenames for paired-end alignments are /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R1.fq.gz and /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R2.fq.gz Input files are in FastQ format Writing a C -> T converted version of the input file E1_T1_C2_R1.fq.gz to E1_T1_C2_R1.fq.gz_C_to_T.fastq Writing a G -> A converted version of the input file E1_T1_C2_R1.fq.gz to E1_T1_C2_R1.fq.gz_G_to_A.fastq Created C -> T as well as G -> A converted versions of the FastQ file E1_T1_C2_R1.fq.gz (104370060 sequences in total) Writing a C -> T converted version of the input file E1_T1_C2_R2.fq.gz to E1_T1_C2_R2.fq.gz_C_to_T.fastq Writing a G -> A converted version of the input file E1_T1_C2_R2.fq.gz to E1_T1_C2_R2.fq.gz_G_to_A.fastq Created C -> T as well as G -> A converted versions of the FastQ file E1_T1_C2_R2.fq.gz (104370060 sequences in total) Input files are E1_T1_C2_R1.fq.gz_C_to_T.fastq and E1_T1_C2_R1.fq.gz_G_to_A.fastq and E1_T1_C2_R2.fq.gz_C_to_T.fastq and E1_T1_C2_R2.fq.gz_G_to_A.fastq (FastQ) Now running 4 individual instances of Bowtie 2 against the bisulfite genome of /home/shared/16TB_HDD_01/sr320/github/Claudia-Halibut/project/data/genome/ with the specified options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 Now starting a Bowtie 2 paired-end alignment for CTread1GAread2CTgenome (reading in sequences from E1_T1_C2_R1.fq.gz_C_to_T.fastq and E1_T1_C2_R2.fq.gz_G_to_A.fastq, with the options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --norc)) Found first alignment: lh00134:396:22JL32LT3:2:1101:1148:1064_1:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/1 77 * 0 0 * * 0 0 TNAATTTAAATTTTATTATAAATTTAAAATTTTTATGTAATTTGAAATTTAAATTATATTTATATTAATTTTAAAATTAATTAATTTTTGATATTATTTTTAAATTTATAATTAATAAAATTTTAAAAATTGAGATTGGAAGATTATTTT 9#IIIIIII-III9IIIIIIIIIIIII9IIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIII9II9IIIIII9IIII99I9-IIII-III-I- YT:Z:UP lh00134:396:22JL32LT3:2:1101:1148:1064_3:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/2 141 * 0 0 * * 0 0 CAATTTTTAAAATTTTATTAATTATAAATTTAAAAATAATATCAAAAATTAATTAATTTTAAAATTAATATAAATATAATTTAAATTTCAAATTACATAAAAATTTTAAATTTATAATAAAATTTAAATTTAAAATCAAAAAAACATCAT II9I9IIII9IIIIIII9IIII9IIIIIIIIII9IIIIIIIIIIII9I-IIIII-IIIIIII9IIIIII9IIIIIII9IIIIIIII-IIIIII9IIIIIIIII9IIIIIIIII9IIIIIIII9III9IIIIIIIIIII-IIIIIIIIIII YT:Z:UP Now starting a Bowtie 2 paired-end alignment for GAread1CTread2GAgenome (reading in sequences from E1_T1_C2_R1.fq.gz_G_to_A.fastq and E1_T1_C2_R2.fq.gz_C_to_T.fastq, with the options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --norc)) Found first alignment: lh00134:396:22JL32LT3:2:1101:1148:1064_1:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/1 99 NC_061503.1_GA_converted 108825 2 136M4I10M = 108812 -159 CNAACTTAAACTCTACTACAAATCTAAAACCCCTACATAATTCAAAACCCAAACTACACCTATATTAATTTCAAAATCAACTAACCTCTAACATCATCTTTAAACCTATAACTAACAAAACTTTAAAAACCAAAATCAAAAAATCACTCT 9#IIIIIII-III9IIIIIIIIIIIII9IIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIII9II9IIIIII9IIII99I9-IIII-III-I- AS:i:-54 XS:i:-54 XN:i:0 XM:i:7 XO:i:1 XG:i:4 NM:i:11 MD:Z:1A63A68C4C3C0A0A0 YS:i:-54 YT:Z:CP lh00134:396:22JL32LT3:2:1101:1148:1064_3:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/2 147 NC_061503.1_GA_converted 108812 2 5M3I1M1I7M1I132M = 108825 159 ACAACACTCTTCCAATCTCAAACTTAAACTCTACTACAAATCTAAAACCCCTACATAATTCAAAACCCAAACTACACCTATATTAATTTCAAAATCAACTAACCTCTAACATCATCTTTAAACCTATAACTAACAAAACTTTAAAAACCA IIIIIIIIIII-IIIIIIIIIII9III9IIIIIIII9IIIIIIIII9IIIIIIIII9IIIIII-IIIIIIII9IIIIIII9IIIIII9IIIIIII-IIIII-I9IIIIIIIIIIII9IIIIIIIIII9IIII9IIIIIII9IIII9I9II AS:i:-54 XS:i:-55 XN:i:0 XM:i:4 XO:i:3 XG:i:5 NM:i:9 MD:Z:7A0A2A66A66 YS:i:-54 YT:Z:CP Now starting a Bowtie 2 paired-end alignment for GAread1CTread2CTgenome (reading in sequences from E1_T1_C2_R1.fq.gz_G_to_A.fastq and E1_T1_C2_R2.fq.gz_C_to_T.fastq, with the options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --nofw)) Found first alignment: lh00134:396:22JL32LT3:2:1101:1148:1064_1:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/1 83 NC_061505.1_CT_converted 16606866 0 10M4I136M = 16606880 161 AGAGTGATTTTTTGATTTTGGTTTTTAAAGTTTTGTTAGTTATAGGTTTAAAGATGATGTTAGAGGTTAGTTGATTTTGAAATTAATATAGGTGTAGTTTGGGTTTTGAATTATGTAGGGGTTTTAGATTTGTAGTAGAGTTTAAGTTNG -I-III-IIII-9I99IIII9IIIIII9II9IIIIIIIIIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IIIIIIIIIIIIIIII9IIIIIIIIIIIII9III-IIIIIII#9 AS:i:-54 XN:i:0 XM:i:7 XO:i:1 XG:i:4 NM:i:11 MD:Z:0T0T0G3G4G68T63T1 YS:i:-55 YT:Z:CP lh00134:396:22JL32LT3:2:1101:1148:1064_3:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/2 163 NC_061505.1_CT_converted 16606880 0 137M2I3M1I7M = 16606866 -161 TGGTTTTTAAAGTTTTGTTAGTTATAGGTTTAAAGATGATGTTAGAGGTTAGTTGATTTTGAAATTAATATAGGTGTAGTTTGGGTTTTGAATTATGTAGGGGTTTTAGATTTGTAGTAGAGTTTAAGTTTGAGATTGGAAGAGTGTTGT II9I9IIII9IIIIIII9IIII9IIIIIIIIII9IIIIIIIIIIII9I-IIIII-IIIIIII9IIIIII9IIIIIII9IIIIIIII-IIIIII9IIIIIIIII9IIIIIIIII9IIIIIIII9III9IIIIIIIIIII-IIIIIIIIIII AS:i:-55 XS:i:-85 XN:i:0 XM:i:6 XO:i:2 XG:i:3 NM:i:9 MD:Z:66T65G0T3T3A1G3 YS:i:-54 YT:Z:CP Now starting a Bowtie 2 paired-end alignment for CTread1GAread2GAgenome (reading in sequences from E1_T1_C2_R1.fq.gz_C_to_T.fastq and E1_T1_C2_R2.fq.gz_G_to_A.fastq, with the options: -q --score-min L,0,-0.8 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --nofw)) Found first alignment: lh00134:396:22JL32LT3:2:1101:1148:1064_1:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/1 77 * 0 0 * * 0 0 TNAATTTAAATTTTATTATAAATTTAAAATTTTTATGTAATTTGAAATTTAAATTATATTTATATTAATTTTAAAATTAATTAATTTTTGATATTATTTTTAAATTTATAATTAATAAAATTTTAAAAATTGAGATTGGAAGATTATTTT 9#IIIIIII-III9IIIIIIIIIIIII9IIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII9IIIIIIIIIIIIIIIIIIIIIII9II9IIIIII9IIII99I9-IIII-III-I- YT:Z:UP lh00134:396:22JL32LT3:2:1101:1148:1064_3:N:0:AGTTCAGG+TCTGTTGG:ACACACGAG/2 141 * 0 0 * * 0 0 CAATTTTTAAAATTTTATTAATTATAAATTTAAAAATAATATCAAAAATTAATTAATTTTAAAATTAATATAAATATAATTTAAATTTCAAATTACATAAAAATTTTAAATTTATAATAAAATTTAAATTTAAAATCAAAAAAACATCAT II9I9IIII9IIIIIII9IIII9IIIIIIIIII9IIIIIIIIIIII9I-IIIII-IIIIIII9IIIIII9IIIIIII9IIIIIIII-IIIIII9IIIIIIIII9IIIIIIIII9IIIIIIII9III9IIIIIIIIIII-IIIIIIIIIII YT:Z:UP >>> Writing bisulfite mapping results to E1_T1_C2_R1_bismark_bt2_pe.bam <<< Reading in the sequence files /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R1.fq.gz and /home/shared/16TB_HDD_01/fish546/data/RRBS-run2/raw-data/E1_T1_C2_R2.fq.gz Processed 1000000 sequence pairs so far Processed 2000000 sequence pairs so far Processed 3000000 sequence pairs so far Processed 4000000 sequence pairs so far Processed 5000000 sequence pairs so far Processed 6000000 sequence pairs so far Processed 7000000 sequence pairs so far Processed 8000000 sequence pairs so far Processed 9000000 sequence pairs so far Processed 10000000 sequence pairs so far Processed 11000000 sequence pairs so far Processed 12000000 sequence pairs so far Processed 13000000 sequence pairs so far Processed 14000000 sequence pairs so far Processed 15000000 sequence pairs so far Processed 16000000 sequence pairs so far Processed 17000000 sequence pairs so far Processed 18000000 sequence pairs so far Processed 19000000 sequence pairs so far Processed 20000000 sequence pairs so far Processed 21000000 sequence pairs so far Processed 22000000 sequence pairs so far Processed 23000000 sequence pairs so far Processed 24000000 sequence pairs so far Processed 25000000 sequence pairs so far Processed 26000000 sequence pairs so far Processed 27000000 sequence pairs so far Processed 28000000 sequence pairs so far Processed 29000000 sequence pairs so far Processed 30000000 sequence pairs so far Processed 31000000 sequence pairs so far Processed 32000000 sequence pairs so far Processed 33000000 sequence pairs so far Processed 34000000 sequence pairs so far Processed 35000000 sequence pairs so far Processed 36000000 sequence pairs so far Processed 37000000 sequence pairs so far Processed 38000000 sequence pairs so far Processed 39000000 sequence pairs so far Processed 40000000 sequence pairs so far Processed 41000000 sequence pairs so far Processed 42000000 sequence pairs so far Processed 43000000 sequence pairs so far Processed 44000000 sequence pairs so far Processed 45000000 sequence pairs so far Processed 46000000 sequence pairs so far Processed 47000000 sequence pairs so far Processed 48000000 sequence pairs so far Processed 49000000 sequence pairs so far Processed 50000000 sequence pairs so far Processed 51000000 sequence pairs so far Processed 52000000 sequence pairs so far Processed 53000000 sequence pairs so far Processed 54000000 sequence pairs so far Processed 55000000 sequence pairs so far Processed 56000000 sequence pairs so far Processed 57000000 sequence pairs so far Processed 58000000 sequence pairs so far Processed 59000000 sequence pairs so far Processed 60000000 sequence pairs so far Processed 61000000 sequence pairs so far Processed 62000000 sequence pairs so far Processed 63000000 sequence pairs so far Processed 64000000 sequence pairs so far Processed 65000000 sequence pairs so far Processed 66000000 sequence pairs so far Processed 67000000 sequence pairs so far Processed 68000000 sequence pairs so far Processed 69000000 sequence pairs so far Processed 70000000 sequence pairs so far Processed 71000000 sequence pairs so far Processed 72000000 sequence pairs so far Processed 73000000 sequence pairs so far Processed 74000000 sequence pairs so far Processed 75000000 sequence pairs so far Processed 76000000 sequence pairs so far Processed 77000000 sequence pairs so far Processed 78000000 sequence pairs so far Processed 79000000 sequence pairs so far Processed 80000000 sequence pairs so far Processed 81000000 sequence pairs so far Processed 82000000 sequence pairs so far Processed 83000000 sequence pairs so far Processed 84000000 sequence pairs so far Processed 85000000 sequence pairs so far Processed 86000000 sequence pairs so far Processed 87000000 sequence pairs so far Processed 88000000 sequence pairs so far Processed 89000000 sequence pairs so far Processed 90000000 sequence pairs so far Processed 91000000 sequence pairs so far Processed 92000000 sequence pairs so far Processed 93000000 sequence pairs so far Processed 94000000 sequence pairs so far Processed 95000000 sequence pairs so far Processed 96000000 sequence pairs so far Processed 97000000 sequence pairs so far Processed 98000000 sequence pairs so far Processed 99000000 sequence pairs so far Processed 100000000 sequence pairs so far Processed 101000000 sequence pairs so far Processed 102000000 sequence pairs so far Processed 103000000 sequence pairs so far Processed 104000000 sequence pairs so far 104370060 reads; of these: 104370060 (100.00%) were paired; of these: 104298466 (99.93%) aligned concordantly 0 times 49274 (0.05%) aligned concordantly exactly 1 time 22320 (0.02%) aligned concordantly >1 times 0.07% overall alignment rate 104370060 reads; of these: 104370060 (100.00%) were paired; of these: 81709475 (78.29%) aligned concordantly 0 times 9097660 (8.72%) aligned concordantly exactly 1 time 13562925 (13.00%) aligned concordantly >1 times 21.71% overall alignment rate 104370060 reads; of these: 104370060 (100.00%) were paired; of these: 104305393 (99.94%) aligned concordantly 0 times 43941 (0.04%) aligned concordantly exactly 1 time 20726 (0.02%) aligned concordantly >1 times 0.06% overall alignment rate 104370060 reads; of these: 104370060 (100.00%) were paired; of these: 81720016 (78.30%) aligned concordantly 0 times 9252957 (8.87%) aligned concordantly exactly 1 time 13397087 (12.84%) aligned concordantly >1 times 21.70% overall alignment rate Processed 104370060 sequences in total Failed to close filehandle AMBIG_1: Bad file descriptor at /home/shared/Bismark-0.24.0/bismark line 2641, line 417480240. Failed to close filehandle AMBIG_2: Bad file descriptor at /home/shared/Bismark-0.24.0/bismark line 2642, line 417480240. Failed to close filehandle UNMAPPED_1: Bad file descriptor at /home/shared/Bismark-0.24.0/bismark line 2643, line 417480240. Failed to close filehandle UNMAPPED_2: Bad file descriptor at /home/shared/Bismark-0.24.0/bismark line 2644, line 417480240. Successfully deleted the temporary files E1_T1_C2_R1.fq.gz_C_to_T.fastq, E1_T1_C2_R1.fq.gz_G_to_A.fastq, E1_T1_C2_R2.fq.gz_C_to_T.fastq and E1_T1_C2_R2.fq.gz_G_to_A.fastq Final Alignment report ====================== Sequence pairs analysed in total: 104370060 Number of paired-end alignments with a unique best hit: 24809426 Mapping efficiency: 23.8% Sequence pairs with no alignments under any condition: 73665453 Sequence pairs did not map uniquely: 5895181 Sequence pairs which were discarded because genomic sequence could not be extracted: 0 Number of sequence pairs with unique best (first) alignment came from the bowtie output: CT/GA/CT: 2457 ((converted) top strand) GA/CT/CT: 11644570 (complementary to (converted) top strand) GA/CT/GA: 13159346 (complementary to (converted) bottom strand) CT/GA/GA: 3053 ((converted) bottom strand) Final Cytosine Methylation Report ================================= Total number of C's analysed: 1807901486 Total methylated C's in CpG context: 193138601 Total methylated C's in CHG context: 14299427 Total methylated C's in CHH context: 34064869 Total methylated C's in Unknown context: 10655042 Total unmethylated C's in CpG context: 132374388 Total unmethylated C's in CHG context: 490849972 Total unmethylated C's in CHH context: 943174229 Total unmethylated C's in Unknown context: 12910164 C methylated in CpG context: 59.3% C methylated in CHG context: 2.8% C methylated in CHH context: 3.5% C methylated in unknown context (CN or CHN): 45.2% Bismark completed in 0d 10h 19m 46s ==================== Bismark run complete ====================