1 Intro

1.1 Inputs

This notebook is a continuation of using Bismark (Krueger and Andrews 2011) to perform methylation analysis. Read alignments were run in 02.00-bismark-bowtie2-alignment.Rmd. This step will perform deduplication (Bismark manual) of the BAM files generated by alignment. Results will be summarized by MultiQC (Ewels et al. 2016).

1.2 Outputs

  • *_dedup.sorted.bam: Deduplicated, sorted BAMs.

  • *_dedup.sorted.bam.bai: Deduplicated, sorted BAM index file. Useful for IGV.

  • *.deduplicated.bam: Deduplicated BAMs. These are not sorted.

  • *.deduplication_report.txt: Individual summary reports for each deduplication process.

  • multiqc_report.html: A summary report of the alignment results generated by MultiQC, in HTML format.

Due to the large file sizes of BAMS, these cannot be hosted in the ceasmallr GitHub repo. As such these files are available for download here:

2 Create a Bash variables file

This allows usage of Bash variables across R Markdown chunks.

{
echo "#### Assign Variables ####"
echo ""

echo "# Data directories"
echo 'export repo_dir=/home/shared/8TB_HDD_01/sam/gitrepos/ceasmallr'
echo 'export output_dir_top=${repo_dir}/output/02.10-bismark-deduplication'
echo 'export alignment_dir="${repo_dir}/output/02.00-bismark-bowtie2-alignment"'
echo ""


echo "# Paths to programs"
echo 'export programs_dir="/home/shared"'
echo 'export bismark_dir="${programs_dir}/Bismark-0.24.0"'
echo 'export bowtie2_dir="${programs_dir}/bowtie2-2.4.4-linux-x86_64"'
echo 'export multiqc="/home/sam/programs/mambaforge/bin/multiqc"'
echo 'export samtools_dir="${programs_dir}/samtools-1.12"'
echo ""

echo "# Set number of CPUs to use"
echo 'export threads=40'


echo "# Print formatting"
echo 'export line="--------------------------------------------------------"'
echo ""
} > .bashvars

cat .bashvars
#### Assign Variables ####

# Data directories
export repo_dir=/home/shared/8TB_HDD_01/sam/gitrepos/ceasmallr
export output_dir_top=${repo_dir}/output/02.10-bismark-deduplication
export alignment_dir="${repo_dir}/output/02.00-bismark-bowtie2-alignment"

# Paths to programs
export programs_dir="/home/shared"
export bismark_dir="${programs_dir}/Bismark-0.24.0"
export bowtie2_dir="${programs_dir}/bowtie2-2.4.4-linux-x86_64"
export multiqc="/home/sam/programs/mambaforge/bin/multiqc"
export samtools_dir="${programs_dir}/samtools-1.12"

# Set number of CPUs to use
export threads=40
# Print formatting
export line="--------------------------------------------------------"

3 Deduplication

# Load bash variables into memory
source .bashvars

# Make output directory, if it doesn't exist
mkdir --parents "${output_dir_top}"

cd "${alignment_dir}"

for bam in *_bismark_bt2_pe.bam
do
  # Set stderr filename
  stderr_name=$(echo "${bam}" | awk -F"." '{print $1"."$2"."$3".deduplication.stderr"}')
  
  # Run deduplication
  ${bismark_dir}/deduplicate_bismark \
  --samtools_path ${samtools_dir} \
  --paired \
  --bam \
  --output_dir ${output_dir_top} \
  ${bam} \
  > ${output_dir_top}/${stderr_name} 2>&1
done

4 Sorting

# Load bash variables into memory
source .bashvars

cd "${output_dir_top}"


#Sorting for Downstream Applications
find *deduplicated.bam \
| xargs basename -s _R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam \
| xargs -I{} ${samtools_dir}/samtools \
sort {}_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam \
-@ "${threads}" \
-o {}_dedup.sorted.bam

5 Indexing for Downstream Applications

# Load bash variables into memory
source .bashvars

cd "${output_dir_top}"
find *dedup.sorted.bam \
| xargs basename -s _dedup.sorted.bam \
| xargs -I{} ${samtools_dir}/samtools \
index {}_dedup.sorted.bam

6 MultiQC

# Load bash variables into memory
source .bashvars

cd "${output_dir_top}"

${multiqc} .

7 Checksums

# Load bash variables into memory
source .bashvars

cd "${output_dir_top}"

for file in *; do
  if [ "${file}" != "checksums.md5" ]; then
    md5sum "${file}" >> checksums.md5
  fi
done

8 List output files

# Load bash variables into memory
source .bashvars

cd "${output_dir_top}"

ls -lh
total 213G
-rw-r--r-- 1 sam sam 455M Nov 14 07:15 CF01-CM01-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 603K Nov 14 08:30 CF01-CM01-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 498M Nov 13 13:56 CF01-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  315 Nov 13 13:56 CF01-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 13:56 CF01-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.4G Nov 14 07:19 CF01-CM02-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:31 CF01-CM02-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.9G Nov 13 14:12 CF01-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 14:12 CF01-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 14:12 CF01-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 2.1G Nov 14 07:20 CF02-CM02-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 921K Nov 14 08:32 CF02-CM02-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 2.5G Nov 13 14:19 CF02-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 14:19 CF02-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 14:19 CF02-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.0G Nov 14 07:22 CF03-CM03-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.2M Nov 14 08:33 CF03-CM03-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 3.7G Nov 13 14:29 CF03-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 14:29 CF03-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 14:29 CF03-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.4G Nov 14 07:24 CF03-CM04-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.3M Nov 14 08:34 CF03-CM04-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 4.6G Nov 13 14:43 CF03-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  320 Nov 13 14:43 CF03-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 14:43 CF03-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.1G Nov 14 07:26 CF03-CM05-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:35 CF03-CM05-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.4G Nov 13 14:58 CF03-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 14:58 CF03-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 14:58 CF03-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.0G Nov 14 07:28 CF04-CM04-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.2M Nov 14 08:36 CF04-CM04-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 3.7G Nov 13 15:08 CF04-CM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 15:08 CF04-CM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 15:08 CF04-CM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 5.4G Nov 14 07:33 CF05-CM02-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.6M Nov 14 08:38 CF05-CM02-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 7.5G Nov 13 15:29 CF05-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  320 Nov 13 15:29 CF05-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 15:30 CF05-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.5G Nov 14 07:35 CF05-CM05-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.3M Nov 14 08:39 CF05-CM05-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 4.4G Nov 13 15:42 CF05-CM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 15:42 CF05-CM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 15:42 CF05-CM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.3G Nov 14 07:37 CF06-CM01-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.3M Nov 14 08:40 CF06-CM01-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 4.0G Nov 13 15:52 CF06-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 15:52 CF06-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 15:53 CF06-CM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 2.6G Nov 14 07:38 CF06-CM02-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.1M Nov 14 08:41 CF06-CM02-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 3.1G Nov 13 16:01 CF06-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 16:01 CF06-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:01 CF06-CM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 2.0G Nov 14 07:39 CF07-CM02-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 968K Nov 14 08:41 CF07-CM02-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 2.4G Nov 13 16:08 CF07-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 16:08 CF07-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:08 CF07-CM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 2.2G Nov 14 07:41 CF08-CM03-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 950K Nov 14 08:42 CF08-CM03-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 2.6G Nov 13 16:15 CF08-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 16:15 CF08-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:15 CF08-CM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam  58M Nov 14 07:41 CF08-CM04-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.1M Nov 14 08:42 CF08-CM04-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam  60M Nov 13 16:15 CF08-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  312 Nov 13 16:15 CF08-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:15 CF08-CM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam  50M Nov 14 07:41 CF08-CM05-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.1M Nov 14 08:42 CF08-CM05-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam  52M Nov 13 16:15 CF08-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  314 Nov 13 16:15 CF08-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:15 CF08-CM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam  16K Nov 14 09:44 checksums.md5
-rw-r--r-- 1 sam sam 2.8G Nov 14 07:42 EF01-EM01-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.2M Nov 14 08:43 EF01-EM01-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 3.5G Nov 13 16:25 EF01-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 16:25 EF01-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:25 EF01-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.0G Nov 14 07:44 EF02-EM02-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.2M Nov 14 08:44 EF02-EM02-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 3.7G Nov 13 16:35 EF02-EM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 16:35 EF02-EM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:35 EF02-EM02-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 2.0G Nov 14 07:45 EF03-EM03-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 897K Nov 14 08:45 EF03-EM03-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 2.3G Nov 13 16:41 EF03-EM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 16:41 EF03-EM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:41 EF03-EM03-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.1G Nov 14 07:47 EF03-EM04-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:46 EF03-EM04-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.1G Nov 13 16:55 EF03-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 16:55 EF03-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 16:56 EF03-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.1G Nov 14 07:50 EF03-EM05-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.5M Nov 14 08:47 EF03-EM05-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.4G Nov 13 17:10 EF03-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 17:10 EF03-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 17:11 EF03-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 1.1G Nov 14 07:50 EF04-EM04-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 600K Nov 14 08:47 EF04-EM04-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 1.2G Nov 13 17:14 EF04-EM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  317 Nov 13 17:14 EF04-EM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 17:14 EF04-EM04-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.9G Nov 14 07:55 EF04-EM05-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.5M Nov 14 08:49 EF04-EM05-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 6.8G Nov 13 17:33 EF04-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  320 Nov 13 17:33 EF04-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 17:33 EF04-EM05-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.1G Nov 14 07:57 EF05-EM01-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.2M Nov 14 08:50 EF05-EM01-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 4.1G Nov 13 17:45 EF05-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 17:45 EF05-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 17:45 EF05-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 702M Nov 14 07:57 EF05-EM05-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 567K Nov 14 08:50 EF05-EM05-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 756M Nov 13 17:47 EF05-EM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  316 Nov 13 17:47 EF05-EM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 17:47 EF05-EM05-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 5.0G Nov 14 08:02 EF05-EM06-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.6M Nov 14 08:52 EF05-EM06-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 7.0G Nov 13 18:07 EF05-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  320 Nov 13 18:07 EF05-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 18:07 EF05-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.9G Nov 14 08:06 EF06-EM01-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.6M Nov 14 08:53 EF06-EM01-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 6.8G Nov 13 18:26 EF06-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 18:26 EF06-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 18:26 EF06-EM01-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.0G Nov 14 08:09 EF06-EM02-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:55 EF06-EM02-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.2G Nov 13 18:41 EF06-EM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 18:41 EF06-EM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 18:41 EF06-EM02-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 1.2G Nov 14 08:09 EF06-EM06-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 574K Nov 14 08:55 EF06-EM06-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 1.3G Nov 13 18:48 EF06-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  318 Nov 13 18:48 EF06-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 18:48 EF06-EM06-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 1.5G Nov 14 08:10 EF07-EM01-Zygote_dedup.sorted.bam
-rw-r--r-- 1 sam sam 730K Nov 14 08:55 EF07-EM01-Zygote_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 1.7G Nov 13 18:53 EF07-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 18:53 EF07-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 18:53 EF07-EM01-Zygote_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.8G Nov 14 08:12 EF07-EM03-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:57 EF07-EM03-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 5.2G Nov 13 19:08 EF07-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 19:08 EF07-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 19:08 EF07-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 4.3G Nov 14 08:17 EF08-EM03-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.4M Nov 14 08:58 EF08-EM03-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 6.0G Nov 13 19:25 EF08-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  320 Nov 13 19:25 EF08-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 19:25 EF08-EM03-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
-rw-r--r-- 1 sam sam 3.8G Nov 14 08:19 EF08-EM04-Larvae_dedup.sorted.bam
-rw-r--r-- 1 sam sam 1.3M Nov 14 08:59 EF08-EM04-Larvae_dedup.sorted.bam.bai
-rw-r--r-- 1 sam sam 4.9G Nov 13 19:38 EF08-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplicated.bam
-rw-r--r-- 1 sam sam  319 Nov 13 19:38 EF08-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication_report.txt
-rw-r--r-- 1 sam sam 3.4K Nov 13 19:39 EF08-EM04-Larvae_R1_001.fastp-trim.20220827_bismark_bt2_pe.deduplication.stderr
drwxr-xr-x 2 sam sam 4.0K Nov 14 09:30 multiqc_data
-rw-r--r-- 1 sam sam 1.1M Nov 14 09:30 multiqc_report.html
Ewels, Philip, Måns Magnusson, Sverker Lundin, and Max Käller. 2016. “MultiQC: Summarize Analysis Results for Multiple Tools and Samples in a Single Report.” Bioinformatics 32 (19): 3047–48. https://doi.org/10.1093/bioinformatics/btw354.
Krueger, Felix, and Simon R. Andrews. 2011. “Bismark: A Flexible Aligner and Methylation Caller for Bisulfite-Seq Applications.” Bioinformatics 27 (11): 1571–72. https://doi.org/10.1093/bioinformatics/btr167.
LS0tCnRpdGxlOiAiMDIuMTAtYmlzbWFyay1kZWR1cGxpY2F0aW9uIgphdXRob3I6ICJTYW0gV2hpdGUiCmRhdGU6ICIyMDI0LTExLTA5IgpvdXRwdXQ6IAogIGJvb2tkb3duOjpodG1sX2RvY3VtZW50MjoKICAgIHRoZW1lOiBjb3NtbwogICAgdG9jOiB0cnVlCiAgICB0b2NfZmxvYXQ6IHRydWUKICAgIG51bWJlcl9zZWN0aW9uczogdHJ1ZQogICAgY29kZV9mb2xkaW5nOiBzaG93CiAgICBjb2RlX2Rvd25sb2FkOiB0cnVlCiAgZ2l0aHViX2RvY3VtZW50OgogICAgdG9jOiB0cnVlCiAgICBudW1iZXJfc2VjdGlvbnM6IHRydWUKICBodG1sX2RvY3VtZW50OgogICAgdGhlbWU6IGNvc21vCiAgICB0b2M6IHRydWUKICAgIHRvY19mbG9hdDogdHJ1ZQogICAgbnVtYmVyX3NlY3Rpb25zOiB0cnVlCiAgICBjb2RlX2ZvbGRpbmc6IHNob3cKICAgIGNvZGVfZG93bmxvYWQ6IHRydWUKYmlibGlvZ3JhcGh5OiByZWZlcmVuY2VzLmJpYgotLS0KIyBJbnRybwoKIyMgSW5wdXRzCgpUaGlzIG5vdGVib29rIGlzIGEgY29udGludWF0aW9uIG9mIHVzaW5nIFtCaXNtYXJrXShodHRwczovL2ZlbGl4a3J1ZWdlci5naXRodWIuaW8vQmlzbWFyay8pIFtAa3J1ZWdlcjIwMTFdIHRvIHBlcmZvcm0gbWV0aHlsYXRpb24gYW5hbHlzaXMuIFJlYWQgYWxpZ25tZW50cyB3ZXJlIHJ1biBpbiBbMDIuMDAtYmlzbWFyay1ib3d0aWUyLWFsaWdubWVudC5SbWRdKC4vMDIuMDAtYmlzbWFyay1ib3d0aWUyLWFsaWdubWVudC5SbWQpLiBUaGlzIHN0ZXAgd2lsbCBwZXJmb3JtIFtkZWR1cGxpY2F0aW9uXShodHRwczovL2ZlbGl4a3J1ZWdlci5naXRodWIuaW8vQmlzbWFyay9iaXNtYXJrL2RlZHVwbGljYXRpb24vKSAoQmlzbWFyayBtYW51YWwpIG9mIHRoZSBCQU0gZmlsZXMgZ2VuZXJhdGVkIGJ5IGFsaWdubWVudC4gUmVzdWx0cyB3aWxsIGJlIHN1bW1hcml6ZWQgYnkgW011bHRpUUNdKGh0dHBzOi8vZ2l0aHViLmNvbS9NdWx0aVFDL011bHRpUUMpIFtAZXdlbHMyMDE2XS4KCiMjIE91dHB1dHMKCi0gYCpfZGVkdXAuc29ydGVkLmJhbWA6IERlZHVwbGljYXRlZCwgc29ydGVkIEJBTXMuCgotIGAqX2RlZHVwLnNvcnRlZC5iYW0uYmFpYDogRGVkdXBsaWNhdGVkLCBzb3J0ZWQgQkFNIGluZGV4IGZpbGUuIFVzZWZ1bCBmb3IgSUdWLgoKLSBgKi5kZWR1cGxpY2F0ZWQuYmFtYDogRGVkdXBsaWNhdGVkIEJBTXMuIFRoZXNlIGFyZSBfbm90XyBzb3J0ZWQuCgotIGAqLmRlZHVwbGljYXRpb25fcmVwb3J0LnR4dGA6IEluZGl2aWR1YWwgc3VtbWFyeSByZXBvcnRzIGZvciBlYWNoIGRlZHVwbGljYXRpb24gcHJvY2Vzcy4KCi0gYG11bHRpcWNfcmVwb3J0Lmh0bWxgOiBBIHN1bW1hcnkgcmVwb3J0IG9mIHRoZSBhbGlnbm1lbnQgcmVzdWx0cyBnZW5lcmF0ZWQgYnkgW011bHRpUUNdKGh0dHBzOi8vZ2l0aHViLmNvbS9NdWx0aVFDL011bHRpUUMpLCBpbiBIVE1MIGZvcm1hdC4KCkR1ZSB0byB0aGUgbGFyZ2UgZmlsZSBzaXplcyBvZiBCQU1TLCB0aGVzZSBjYW5ub3QgYmUgaG9zdGVkIGluIHRoZSBbY2Vhc21hbGxyIEdpdEh1YiByZXBvXShodHRwczovL2dpdGh1Yi5jb20vc3IzMjAvY2Vhc21hbGxyKS4gQXMgc3VjaCB0aGVzZSBmaWxlcyBhcmUgYXZhaWxhYmxlIGZvciBkb3dubG9hZCBoZXJlOgoKLSBbaHR0cHM6Ly9nYW5uZXQuZmlzaC53YXNoaW5ndG9uLmVkdS9BdHVtZWZhY2llbnMvZ2l0cmVwb3MvY2Vhc21hbGxyL291dHB1dC8wMi4xMC1iaXNtYXJrLWRlZHVwbGljYXRpb24vXShodHRwczovL2dhbm5ldC5maXNoLndhc2hpbmd0b24uZWR1L0F0dW1lZmFjaWVucy9naXRyZXBvcy9jZWFzbWFsbHIvb3V0cHV0LzAyLjEwLWJpc21hcmstZGVkdXBsaWNhdGlvbi8pCgoKYGBge3Igc2V0dXAsIGluY2x1ZGU9RkFMU0V9CmxpYnJhcnkoa25pdHIpCmtuaXRyOjpvcHRzX2NodW5rJHNldCgKICBlY2hvID0gVFJVRSwgICAgICAgICAjIERpc3BsYXkgY29kZSBjaHVua3MKICBldmFsID0gRkFMU0UsICAgICAgICAjIEV2YWx1YXRlIGNvZGUgY2h1bmtzCiAgd2FybmluZyA9IEZBTFNFLCAgICAgIyBIaWRlIHdhcm5pbmdzCiAgbWVzc2FnZSA9IEZBTFNFLCAgICAgIyBIaWRlIG1lc3NhZ2VzCiAgY29tbWVudCA9ICIiICAgICAgICAgIyBQcmV2ZW50cyBhcHBlbmRpbmcgJyMjJyB0byBiZWdpbm5pbmcgb2YgbGluZXMgaW4gY29kZSBvdXRwdXQKKQpgYGAKCiMgQ3JlYXRlIGEgQmFzaCB2YXJpYWJsZXMgZmlsZQoKVGhpcyBhbGxvd3MgdXNhZ2Ugb2YgQmFzaCB2YXJpYWJsZXMgYWNyb3NzIFIgTWFya2Rvd24gY2h1bmtzLgoKYGBge3Igc2F2ZS1iYXNoLXZhcmlhYmxlcy10by1ydmFycy1maWxlLCBlbmdpbmU9J2Jhc2gnLCBldmFsPVRSVUV9CnsKZWNobyAiIyMjIyBBc3NpZ24gVmFyaWFibGVzICMjIyMiCmVjaG8gIiIKCmVjaG8gIiMgRGF0YSBkaXJlY3RvcmllcyIKZWNobyAnZXhwb3J0IHJlcG9fZGlyPS9ob21lL3NoYXJlZC84VEJfSEREXzAxL3NhbS9naXRyZXBvcy9jZWFzbWFsbHInCmVjaG8gJ2V4cG9ydCBvdXRwdXRfZGlyX3RvcD0ke3JlcG9fZGlyfS9vdXRwdXQvMDIuMTAtYmlzbWFyay1kZWR1cGxpY2F0aW9uJwplY2hvICdleHBvcnQgYWxpZ25tZW50X2Rpcj0iJHtyZXBvX2Rpcn0vb3V0cHV0LzAyLjAwLWJpc21hcmstYm93dGllMi1hbGlnbm1lbnQiJwplY2hvICIiCgoKZWNobyAiIyBQYXRocyB0byBwcm9ncmFtcyIKZWNobyAnZXhwb3J0IHByb2dyYW1zX2Rpcj0iL2hvbWUvc2hhcmVkIicKZWNobyAnZXhwb3J0IGJpc21hcmtfZGlyPSIke3Byb2dyYW1zX2Rpcn0vQmlzbWFyay0wLjI0LjAiJwplY2hvICdleHBvcnQgYm93dGllMl9kaXI9IiR7cHJvZ3JhbXNfZGlyfS9ib3d0aWUyLTIuNC40LWxpbnV4LXg4Nl82NCInCmVjaG8gJ2V4cG9ydCBtdWx0aXFjPSIvaG9tZS9zYW0vcHJvZ3JhbXMvbWFtYmFmb3JnZS9iaW4vbXVsdGlxYyInCmVjaG8gJ2V4cG9ydCBzYW10b29sc19kaXI9IiR7cHJvZ3JhbXNfZGlyfS9zYW10b29scy0xLjEyIicKZWNobyAiIgoKZWNobyAiIyBTZXQgbnVtYmVyIG9mIENQVXMgdG8gdXNlIgplY2hvICdleHBvcnQgdGhyZWFkcz00MCcKCgplY2hvICIjIFByaW50IGZvcm1hdHRpbmciCmVjaG8gJ2V4cG9ydCBsaW5lPSItLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLSInCmVjaG8gIiIKfSA+IC5iYXNodmFycwoKY2F0IC5iYXNodmFycwpgYGAKCiMgRGVkdXBsaWNhdGlvbgpgYGB7YmFzaCBkZWR1cGxpY2F0aW9uLCBlbmdpbmU9J2Jhc2gnLCBldmFsPUZBTFNFfQojIExvYWQgYmFzaCB2YXJpYWJsZXMgaW50byBtZW1vcnkKc291cmNlIC5iYXNodmFycwoKIyBNYWtlIG91dHB1dCBkaXJlY3RvcnksIGlmIGl0IGRvZXNuJ3QgZXhpc3QKbWtkaXIgLS1wYXJlbnRzICIke291dHB1dF9kaXJfdG9wfSIKCmNkICIke2FsaWdubWVudF9kaXJ9IgoKZm9yIGJhbSBpbiAqX2Jpc21hcmtfYnQyX3BlLmJhbQpkbwogICMgU2V0IHN0ZGVyciBmaWxlbmFtZQogIHN0ZGVycl9uYW1lPSQoZWNobyAiJHtiYW19IiB8IGF3ayAtRiIuIiAne3ByaW50ICQxIi4iJDIiLiIkMyIuZGVkdXBsaWNhdGlvbi5zdGRlcnIifScpCiAgCiAgIyBSdW4gZGVkdXBsaWNhdGlvbgogICR7YmlzbWFya19kaXJ9L2RlZHVwbGljYXRlX2Jpc21hcmsgXAogIC0tc2FtdG9vbHNfcGF0aCAke3NhbXRvb2xzX2Rpcn0gXAogIC0tcGFpcmVkIFwKICAtLWJhbSBcCiAgLS1vdXRwdXRfZGlyICR7b3V0cHV0X2Rpcl90b3B9IFwKICAke2JhbX0gXAogID4gJHtvdXRwdXRfZGlyX3RvcH0vJHtzdGRlcnJfbmFtZX0gMj4mMQpkb25lCmBgYAoKIyBTb3J0aW5nCmBgYHtiYXNoIHNvcnRpbmcsIGVuZ2luZT0nYmFzaCcsIGV2YWw9RkFMU0V9CiMgTG9hZCBiYXNoIHZhcmlhYmxlcyBpbnRvIG1lbW9yeQpzb3VyY2UgLmJhc2h2YXJzCgpjZCAiJHtvdXRwdXRfZGlyX3RvcH0iCgoKI1NvcnRpbmcgZm9yIERvd25zdHJlYW0gQXBwbGljYXRpb25zCmZpbmQgKmRlZHVwbGljYXRlZC5iYW0gXAp8IHhhcmdzIGJhc2VuYW1lIC1zIF9SMV8wMDEuZmFzdHAtdHJpbS4yMDIyMDgyN19iaXNtYXJrX2J0Ml9wZS5kZWR1cGxpY2F0ZWQuYmFtIFwKfCB4YXJncyAtSXt9ICR7c2FtdG9vbHNfZGlyfS9zYW10b29scyBcCnNvcnQge31fUjFfMDAxLmZhc3RwLXRyaW0uMjAyMjA4MjdfYmlzbWFya19idDJfcGUuZGVkdXBsaWNhdGVkLmJhbSBcCi1AICIke3RocmVhZHN9IiBcCi1vIHt9X2RlZHVwLnNvcnRlZC5iYW0KYGBgCgoKIyBJbmRleGluZyBmb3IgRG93bnN0cmVhbSBBcHBsaWNhdGlvbnMKYGBge3IgaW5kZXhpbmcsIGVuZ2luZT0nYmFzaCcsIGV2YWw9RkFMU0V9CiMgTG9hZCBiYXNoIHZhcmlhYmxlcyBpbnRvIG1lbW9yeQpzb3VyY2UgLmJhc2h2YXJzCgpjZCAiJHtvdXRwdXRfZGlyX3RvcH0iCmZpbmQgKmRlZHVwLnNvcnRlZC5iYW0gXAp8IHhhcmdzIGJhc2VuYW1lIC1zIF9kZWR1cC5zb3J0ZWQuYmFtIFwKfCB4YXJncyAtSXt9ICR7c2FtdG9vbHNfZGlyfS9zYW10b29scyBcCmluZGV4IHt9X2RlZHVwLnNvcnRlZC5iYW0KCmBgYAoKIyBNdWx0aVFDCmBgYHtyIG11bHRpcWMsIGVuZ2luZT0nYmFzaCcsIGV2YWw9RkFMU0V9CiMgTG9hZCBiYXNoIHZhcmlhYmxlcyBpbnRvIG1lbW9yeQpzb3VyY2UgLmJhc2h2YXJzCgpjZCAiJHtvdXRwdXRfZGlyX3RvcH0iCgoke211bHRpcWN9IC4KYGBgCgojIENoZWNrc3VtcwpgYGB7ciBnZW5lcmF0ZS1jaGVja3N1bXMsIGVuZ2luZT0nYmFzaCcsIGV2YWw9RkFMU0V9CiMgTG9hZCBiYXNoIHZhcmlhYmxlcyBpbnRvIG1lbW9yeQpzb3VyY2UgLmJhc2h2YXJzCgpjZCAiJHtvdXRwdXRfZGlyX3RvcH0iCgpmb3IgZmlsZSBpbiAqOyBkbwogIGlmIFsgIiR7ZmlsZX0iICE9ICJjaGVja3N1bXMubWQ1IiBdOyB0aGVuCiAgICBtZDVzdW0gIiR7ZmlsZX0iID4+IGNoZWNrc3Vtcy5tZDUKICBmaQpkb25lCmBgYAoKIyBMaXN0IG91dHB1dCBmaWxlcwpgYGB7ciBsaXN0LW91dHB1dHMsIGVuZ2luZT0nYmFzaCcsIGV2YWw9VFJVRX0KIyBMb2FkIGJhc2ggdmFyaWFibGVzIGludG8gbWVtb3J5CnNvdXJjZSAuYmFzaHZhcnMKCmNkICIke291dHB1dF9kaXJfdG9wfSIKCmxzIC1saApgYGA=