Bowtie 2 seems to be working fine (tested command 'bowtie2 --version' [2.5.4]) Output format is BAM (default) Alignments will be written out in BAM format. Samtools found here: '/srlab/programs/samtools-1.20/samtools' Reference genome folder provided is ../../data/genome/ (absolute path is '/mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/data/genome/)' FastQ format assumed (by default) Processing sequences up to read no. 10000 from the input file Attention: early reports suggested that high values of -p to have diminishing returns. Please test different values using a small subset of data for your hardware setting. Each Bowtie 2 instance is going to be run with 8 threads. Please monitor performance closely and tune down if necessary! Input files to be analysed (in current folder '/mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/output/04.2-bismark-align'): ../../data/EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz ../../data/EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz Library was specified to be not strand-specific (non-directional), therefore alignments to all four possible bisulfite strands (OT, CTOT, OB and CTOB) will be reported Output will be written into the directory: /mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/output/04.2-bismark-align/EF04-EM05-Larvae_score_L-1-0.6/ Setting parallelization to single-threaded (default) Summary of all aligner options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 Current working directory is: /mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/output/04.2-bismark-align Now reading in and storing sequence information of the genome specified in: /mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/data/genome/ Single-core mode: setting pid to 1 Paired-end alignments will be performed ======================================= The provided filenames for paired-end alignments are ../../data/EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz and ../../data/EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz Input files are in FastQ format Processing reads up to sequence no. 10000 from ../../data/EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz Writing a C -> T converted version of the input file EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz to EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_C_to_T.fastq Writing a G -> A converted version of the input file EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz to EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_G_to_A.fastq Created C -> T as well as G -> A converted versions of the FastQ file EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz (10001 sequences in total) Processing reads up to sequence no. 10000 from ../../data/EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz Writing a C -> T converted version of the input file EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz to EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_C_to_T.fastq Writing a G -> A converted version of the input file EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz to EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_G_to_A.fastq Created C -> T as well as G -> A converted versions of the FastQ file EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz (10001 sequences in total) Input files are EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_C_to_T.fastq and EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_G_to_A.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_C_to_T.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_G_to_A.fastq (FastQ) Now running 4 individual instances of Bowtie 2 against the bisulfite genome of /mmfs1/gscratch/scrubbed/sr320/github/ceasmallr/data/genome/ with the specified options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 Now starting a Bowtie 2 paired-end alignment for CTread1GAread2CTgenome (reading in sequences from EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_C_to_T.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_G_to_A.fastq, with the options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --norc)) Found first alignment: GWNJ-1012:512:GW210315000:4:1101:25979:1047_1:N:0:CTTGGTAT+GGACTTGG/1 77 * 0 0 * * 0 0 ANTAATGGGGAGTTATAGATTTTTTATGTGATAGTATAAATAGTGGTTTGATTATAGTTTGATTAAATTTTATTTAAAGATTTTTTTGTTAAGTTTGGTTGTAATTGGTTTAGTTATTTTGGTAAAGGAAATGAAAATGTGAAAAAATTT F#FFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFF:FFFFFFFFF:FFFF:FFFFFFFFF:FFFF:FFFFFFF,FFFFFF:FFFFFF:FFFFFFFFFF:FFFF YT:Z:UP GWNJ-1012:512:GW210315000:4:1101:25979:1047_2:N:0:CTTGGTAT+GGACTTGG/2 141 * 0 0 * * 0 0 ATCAAAATTTAAATATAATCAAATAAAAAACCACTCTCCCTTTTAAATAATAAATTATTTAAATTAACAAAAATATAAAAATAATATCATAAATAAATTTTCTCCTAAAAATCAAACTTTTTCACTAAATCTCATATTTAAACTCATCCA FF:FFFFFFFFFFFF:FFFFFFFFFFFFFFF:F:F:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFF:FFFFFFF,FF,FFFFF:FFFF,:F:FF:FFF:FFFF,FFFFFFFFFFF YT:Z:UP Now starting a Bowtie 2 paired-end alignment for GAread1CTread2GAgenome (reading in sequences from EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_G_to_A.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_C_to_T.fastq, with the options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --norc)) Found first alignment: GWNJ-1012:512:GW210315000:4:1101:25979:1047_1:N:0:CTTGGTAT+GGACTTGG/1 77 * 0 0 * * 0 0 ANTAATAAAAAATTATAAATTTTTTATATAATAATATAAATAATAATTTAATTATAATTTAATTAAATTTTATTTAAAAATTTTTTTATTAAATTTAATTATAATTAATTTAATTATTTTAATAAAAAAAATAAAAATATAAAAAAATTT F#FFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFF:FFFFFFFFF:FFFF:FFFFFFFFF:FFFF:FFFFFFF,FFFFFF:FFFFFF:FFFFFFFFFF:FFFF YT:Z:UP GWNJ-1012:512:GW210315000:4:1101:25979:1047_2:N:0:CTTGGTAT+GGACTTGG/2 141 * 0 0 * * 0 0 ATTAAAATTTAAATATAATTAAATAAAAAATTATTTTTTTTTTTAAATAATAAATTATTTAAATTAATAAAAATATAAAAATAATATTATAAATAAATTTTTTTTTAAAAATTGAATTTTTTTATTAAATTTTATATTTAAATTTATTTG FF:FFFFFFFFFFFF:FFFFFFFFFFFFFFF:F:F:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFF:FFFFFFF,FF,FFFFF:FFFF,:F:FF:FFF:FFFF,FFFFFFFFFFF YT:Z:UP Now starting a Bowtie 2 paired-end alignment for GAread1CTread2CTgenome (reading in sequences from EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_G_to_A.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_C_to_T.fastq, with the options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --nofw)) Found first alignment: GWNJ-1012:512:GW210315000:4:1101:25979:1047_1:N:0:CTTGGTAT+GGACTTGG/1 77 * 0 0 * * 0 0 ANTAATAAAAAATTATAAATTTTTTATATAATAATATAAATAATAATTTAATTATAATTTAATTAAATTTTATTTAAAAATTTTTTTATTAAATTTAATTATAATTAATTTAATTATTTTAATAAAAAAAATAAAAATATAAAAAAATTT F#FFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFF:FFFFFFFFF:FFFF:FFFFFFFFF:FFFF:FFFFFFF,FFFFFF:FFFFFF:FFFFFFFFFF:FFFF YT:Z:UP GWNJ-1012:512:GW210315000:4:1101:25979:1047_2:N:0:CTTGGTAT+GGACTTGG/2 141 * 0 0 * * 0 0 ATTAAAATTTAAATATAATTAAATAAAAAATTATTTTTTTTTTTAAATAATAAATTATTTAAATTAATAAAAATATAAAAATAATATTATAAATAAATTTTTTTTTAAAAATTGAATTTTTTTATTAAATTTTATATTTAAATTTATTTG FF:FFFFFFFFFFFF:FFFFFFFFFFFFFFF:F:F:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFF:FFFFFFF,FF,FFFFF:FFFF,:F:FF:FFF:FFFF,FFFFFFFFFFF YT:Z:UP Now starting a Bowtie 2 paired-end alignment for CTread1GAread2GAgenome (reading in sequences from EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_C_to_T.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_G_to_A.fastq, with the options: -q --score-min L,-1,-0.6 -p 8 --reorder --ignore-quals --no-mixed --no-discordant --dovetail --maxins 500 --nofw)) Found first alignment: GWNJ-1012:512:GW210315000:4:1101:25979:1047_1:N:0:CTTGGTAT+GGACTTGG/1 83 NC_035785.1_GA_converted 37740619 24 134M1D16M = 37740383 -387 AAATTTTTTCACATTTTCATTTCCTTTACCAAAATAACTAAACCAATTACAACCAAACTTAACAAAAAAATCTTTAAATAAAATTTAATCAAACTATAATCAAACCACTATTTATACTATCACATAAAAAATCTATAACTCCCCATTANT FFFF:FFFFFFFFFF:FFFFFF:FFFFFF,FFFFFFF:FFFF:FFFFFFFFF:FFFF:FFFFFFFFF:FFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFFFFFFFFFF#F AS:i:-33 XN:i:0 XM:i:5 XO:i:1 XG:i:1 NM:i:6 MD:Z:2T61T3T27A37^A14C1 YS:i:-26 YT:Z:CP GWNJ-1012:512:GW210315000:4:1101:25979:1047_2:N:0:CTTGGTAT+GGACTTGG/2 163 NC_035785.1_GA_converted 37740383 24 44M1D106M = 37740619 387 ATCAAAATTTAAATATAATCAAATAAAAAACCACTCTCCCTTTTAAATAATAAATTATTTAAATTAACAAAAATATAAAAATAATATCATAAATAAATTTTCTCCTAAAAATCAAACTTTTTCACTAAATCTCATATTTAAACTCATCCA FF:FFFFFFFFFFFF:FFFFFFFFFFFFFFF:F:F:FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFF:FFFFFFF,FF,FFFFF:FFFF,:F:FF:FFF:FFFF,FFFFFFFFFFF AS:i:-26 XN:i:0 XM:i:3 XO:i:1 XG:i:1 NM:i:4 MD:Z:1A0A41^A101T4 YS:i:-33 YT:Z:CP >>> Writing bisulfite mapping results to EF04-EM05-Larvae_L-1-0.6_pe.bam <<< Reading in the sequence files ../../data/EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz and ../../data/EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz 1000010000 reads; of these: reads; of these: 10000 reads; of these: 10000 10000 ( (10000100.00 (%100.00) were paired; of these:100.00% %) were paired; of these: ) were paired; of these: 8309 ( 807583.098002 (% (80.75) aligned concordantly 0 times80.02% %) aligned concordantly 0 times ) aligned concordantly 0 times 766 ( 8677.66920 (% (8.67) aligned concordantly exactly 1 time9.20% %) aligned concordantly exactly 1 time ) aligned concordantly exactly 1 time 925 ( 10589.251078 (% (10.58) aligned concordantly >1 times10.78% %) aligned concordantly >1 times16.91) aligned concordantly >1 times % 19.25 overall alignment rate19.98% % overall alignment rate overall alignment rate 10000 reads; of these: 10000 (100.00%) were paired; of these: 8417 (84.17%) aligned concordantly 0 times 684 (6.84%) aligned concordantly exactly 1 time 899 (8.99%) aligned concordantly >1 times 15.83% overall alignment rate Processed 10000 sequences in total Successfully deleted the temporary files EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_C_to_T.fastq, EF04-EM05-Larvae_R1_001.fastp-trim.20220827.fq.gz_G_to_A.fastq, EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_C_to_T.fastq and EF04-EM05-Larvae_R2_001.fastp-trim.20220827.fq.gz_G_to_A.fastq Final Alignment report ====================== Sequence pairs analysed in total: 10000 Final Cytosine Methylation Report ================================= Total number of C's analysed: 151595 Total methylated C's in CpG context: 3931 Total methylated C's in CHG context: 481 Total methylated C's in CHH context: 2042 Total methylated C's in Unknown context: 194 Total unmethylated C's in CpG context: 18146 Total unmethylated C's in CHG context: 32215 Total unmethylated C's in CHH context: 94780 Total unmethylated C's in Unknown context: 690 C methylated in CpG context: 17.8% C methylated in CHG context: 1.5% C methylated in CHH context: 2.1% C methylated in Unknown context (CN or CHN): 21.9% Bismark completed in 0d 0h 0m 15s ==================== Bismark run complete ====================