{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Characterizing the general methylation landscape" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "In this notebook, I will characterize the general methylation landscape. This will provide context I need to understand the significance of differentially methylated loci I obtain with `methylKit`. To characterize CpG methylation, I will use individual samples, as well as a union BEDgraph that concatenates all sample information.\n", "\n", "1. Concatenate coverage information\n", "2. Characterize methylation for each CpG dinucleotide in individual samples and union BEDgraph\n", "2. Determine genomic location of methylated, sparsely methylated, and unmethylated CpGs" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 0. Set working directory" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/Users/yaamini/Documents/project-oyster-oa/code/Haws\r\n" ] } ], "source": [ "!pwd" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/Users/yaamini/Documents/project-oyster-oa/analyses\n" ] } ], "source": [ "cd ../../analyses/" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": true }, "outputs": [], "source": [ "#!mkdir Haws_06-methylation-landscape" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/Users/yaamini/Documents/project-oyster-oa/analyses/Haws_06-methylation-landscape\n" ] } ], "source": [ "cd Haws_06-methylation-landscape/" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "0.18.1\n" ] } ], "source": [ "#Install pandas for this notebook\n", "import pandas as pd\n", "print(pd.__version__)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1. Obtain sample BEDgraphs" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "--2021-05-17 20:50:07-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/\n", "Resolving gannet.fish.washington.edu (gannet.fish.washington.edu)... 128.95.149.52\n", "Connecting to gannet.fish.washington.edu (gannet.fish.washington.edu)|128.95.149.52|:443... connected.\n", "WARNING: cannot verify gannet.fish.washington.edu's certificate, issued by ‘CN=InCommon RSA Server CA,OU=InCommon,O=Internet2,L=Ann Arbor,ST=MI,C=US’:\n", " Unable to locally verify the issuer's authority.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html.tmp’\n", "\n", "index.html.tmp [ <=> ] 168.69K --.-KB/s in 0.004s \n", "\n", "2021-05-17 20:50:12 (45.3 MB/s) - ‘./index.html.tmp’ saved [172743]\n", "\n", "Loading robots.txt; please ignore errors.\n", "--2021-05-17 20:50:12-- https://gannet.fish.washington.edu/robots.txt\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 404 Not Found\n", "2021-05-17 20:50:12 ERROR 404: Not Found.\n", "\n", "Removing ./index.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:12-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=N;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=N;O=D.tmp’\n", "\n", "index.html?C=N;O=D. [ <=> ] 168.69K --.-KB/s in 0.003s \n", "\n", "2021-05-17 20:50:16 (59.2 MB/s) - ‘./index.html?C=N;O=D.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=N;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:16-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=M;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=M;O=A.tmp’\n", "\n", "index.html?C=M;O=A. [ <=> ] 168.69K --.-KB/s in 0.003s \n", "\n", "2021-05-17 20:50:21 (57.9 MB/s) - ‘./index.html?C=M;O=A.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=M;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:21-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=S;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=S;O=A.tmp’\n", "\n", "index.html?C=S;O=A. [ <=> ] 168.69K --.-KB/s in 0.003s \n", "\n", "2021-05-17 20:50:26 (54.9 MB/s) - ‘./index.html?C=S;O=A.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=S;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:26-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=D;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=D;O=A.tmp’\n", "\n", "index.html?C=D;O=A. [ <=> ] 168.69K --.-KB/s in 0.003s \n", "\n", "2021-05-17 20:50:30 (52.7 MB/s) - ‘./index.html?C=D;O=A.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=D;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:30-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/bismark_summary_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3192063 (3.0M) [text/html]\n", "Saving to: ‘./bismark_summary_report.html.tmp’\n", "\n", "bismark_summary_rep 100%[===================>] 3.04M --.-KB/s in 0.06s \n", "\n", "2021-05-17 20:50:30 (49.8 MB/s) - ‘./bismark_summary_report.html.tmp’ saved [3192063/3192063]\n", "\n", "Removing ./bismark_summary_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html.tmp’\n", "\n", "index.html.tmp 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:50:31 (2.02 GB/s) - ‘./index.html.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 1448633 (1.4M) [text/html]\n", "Saving to: ‘./multiqc_report.html.tmp’\n", "\n", "multiqc_report.html 100%[===================>] 1.38M --.-KB/s in 0.03s \n", "\n", "2021-05-17 20:50:31 (50.3 MB/s) - ‘./multiqc_report.html.tmp’ saved [1448633/1448633]\n", "\n", "Removing ./multiqc_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162787 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_1_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.05s \n", "\n", "2021-05-17 20:50:31 (59.3 MB/s) - ‘./zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162787/3162787]\n", "\n", "Removing ./zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 238066536 (227M)\n", "Saving to: ‘./zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_1_R1_val_1_v 100%[===================>] 227.04M 77.0MB/s in 2.9s \n", "\n", "2021-05-17 20:50:34 (77.0 MB/s) - ‘./zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [238066536/238066536]\n", "\n", "--2021-05-17 20:50:34-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162899 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_2_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:50:34 (31.0 MB/s) - ‘./zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162899/3162899]\n", "\n", "Removing ./zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:34-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 259414417 (247M)\n", "Saving to: ‘./zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_2_R1_val_1_v 100%[===================>] 247.40M 70.3MB/s in 3.5s \n", "\n", "2021-05-17 20:50:38 (70.3 MB/s) - ‘./zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [259414417/259414417]\n", "\n", "--2021-05-17 20:50:38-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162848 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_3_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.09s \n", "\n", "2021-05-17 20:50:38 (32.2 MB/s) - ‘./zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162848/3162848]\n", "\n", "Removing ./zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:38-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 213140550 (203M)\n", "Saving to: ‘./zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_3_R1_val_1_v 100%[===================>] 203.27M 75.6MB/s in 2.7s \n", "\n", "2021-05-17 20:50:41 (75.6 MB/s) - ‘./zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [213140550/213140550]\n", "\n", "--2021-05-17 20:50:41-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162857 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_4_R1_val_1_v 100%[===================>] 3.02M 19.9MB/s in 0.2s \n", "\n", "2021-05-17 20:50:41 (19.9 MB/s) - ‘./zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162857/3162857]\n", "\n", "Removing ./zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:41-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 246432525 (235M)\n", "Saving to: ‘./zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_4_R1_val_1_v 100%[===================>] 235.02M 68.8MB/s in 3.5s \n", "\n", "2021-05-17 20:50:45 (67.5 MB/s) - ‘./zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [246432525/246432525]\n", "\n", "--2021-05-17 20:50:45-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162890 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_5_R1_val_1_v 100%[===================>] 3.02M 17.1MB/s in 0.2s \n", "\n", "2021-05-17 20:50:45 (17.1 MB/s) - ‘./zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162890/3162890]\n", "\n", "Removing ./zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:45-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 231531553 (221M)\n", "Saving to: ‘./zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_5_R1_val_1_v 100%[===================>] 220.81M 77.3MB/s in 2.9s \n", "\n", "2021-05-17 20:50:48 (77.3 MB/s) - ‘./zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [231531553/231531553]\n", "\n", "--2021-05-17 20:50:48-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162907 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_6_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:50:49 (24.0 MB/s) - ‘./zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162907/3162907]\n", "\n", "Removing ./zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:49-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 255016574 (243M)\n", "Saving to: ‘./zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_6_R1_val_1_v 100%[===================>] 243.20M 76.5MB/s in 3.2s \n", "\n", "2021-05-17 20:50:52 (76.5 MB/s) - ‘./zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [255016574/255016574]\n", "\n", "--2021-05-17 20:50:52-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3163096 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_7_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.07s \n", "\n", "2021-05-17 20:50:52 (42.7 MB/s) - ‘./zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3163096/3163096]\n", "\n", "Removing ./zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:52-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 312303076 (298M)\n", "Saving to: ‘./zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_7_R1_val_1_v 100%[===================>] 297.83M 76.8MB/s in 3.9s \n", "\n", "2021-05-17 20:50:56 (75.9 MB/s) - ‘./zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [312303076/312303076]\n", "\n", "--2021-05-17 20:50:56-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162893 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_8_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.06s \n", "\n", "2021-05-17 20:50:57 (48.8 MB/s) - ‘./zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162893/3162893]\n", "\n", "Removing ./zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:50:57-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 223070567 (213M)\n", "Saving to: ‘./zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_8_R1_val_1_v 100%[===================>] 212.74M 82.8MB/s in 2.6s \n", "\n", "2021-05-17 20:50:59 (82.8 MB/s) - ‘./zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [223070567/223070567]\n", "\n", "--2021-05-17 20:50:59-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162880 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_9_R1_val_1_v 100%[===================>] 3.02M --.-KB/s in 0.07s \n", "\n", "2021-05-17 20:51:00 (42.7 MB/s) - ‘./zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162880/3162880]\n", "\n", "Removing ./zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:00-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 228264905 (218M)\n", "Saving to: ‘./zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_9_R1_val_1_v 100%[===================>] 217.69M 83.9MB/s in 2.6s \n", "\n", "2021-05-17 20:51:02 (83.9 MB/s) - ‘./zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [228264905/228264905]\n", "\n", "--2021-05-17 20:51:02-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162948 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_10_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.07s \n", "\n", "2021-05-17 20:51:03 (41.1 MB/s) - ‘./zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162948/3162948]\n", "\n", "Removing ./zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:03-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 251375395 (240M)\n", "Saving to: ‘./zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_10_R1_val_1_ 100%[===================>] 239.73M 78.6MB/s in 3.1s \n", "\n", "2021-05-17 20:51:06 (78.6 MB/s) - ‘./zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [251375395/251375395]\n", "\n", "--2021-05-17 20:51:06-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162937 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_11_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.09s \n", "\n", "2021-05-17 20:51:06 (34.2 MB/s) - ‘./zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162937/3162937]\n", "\n", "Removing ./zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:06-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 263423261 (251M)\n", "Saving to: ‘./zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_11_R1_val_1_ 100%[===================>] 251.22M 78.4MB/s in 3.3s \n", "\n", "2021-05-17 20:51:09 (75.5 MB/s) - ‘./zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [263423261/263423261]\n", "\n", "--2021-05-17 20:51:10-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162813 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_12_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.05s \n", "\n", "2021-05-17 20:51:10 (61.0 MB/s) - ‘./zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162813/3162813]\n", "\n", "Removing ./zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:10-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 238209851 (227M)\n", "Saving to: ‘./zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_12_R1_val_1_ 100%[===================>] 227.17M 74.1MB/s in 3.1s \n", "\n", "2021-05-17 20:51:13 (74.1 MB/s) - ‘./zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [238209851/238209851]\n", "\n", "--2021-05-17 20:51:13-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162939 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_13_R1_val_1_ 100%[===================>] 3.02M 19.0MB/s in 0.2s \n", "\n", "2021-05-17 20:51:13 (19.0 MB/s) - ‘./zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162939/3162939]\n", "\n", "Removing ./zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:13-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 259038737 (247M)\n", "Saving to: ‘./zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_13_R1_val_1_ 100%[===================>] 247.04M 77.7MB/s in 3.3s \n", "\n", "2021-05-17 20:51:17 (75.7 MB/s) - ‘./zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [259038737/259038737]\n", "\n", "--2021-05-17 20:51:17-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162943 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_14_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:51:17 (21.0 MB/s) - ‘./zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162943/3162943]\n", "\n", "Removing ./zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:17-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 200672316 (191M)\n", "Saving to: ‘./zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_14_R1_val_1_ 100%[===================>] 191.38M 77.1MB/s in 2.5s \n", "\n", "2021-05-17 20:51:20 (77.1 MB/s) - ‘./zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [200672316/200672316]\n", "\n", "--2021-05-17 20:51:20-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3163012 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_15_R1_val_1_ 100%[===================>] 3.02M 13.4MB/s in 0.2s \n", "\n", "2021-05-17 20:51:20 (13.4 MB/s) - ‘./zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3163012/3163012]\n", "\n", "Removing ./zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:20-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 270532466 (258M)\n", "Saving to: ‘./zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_15_R1_val_1_ 100%[===================>] 258.00M 78.3MB/s in 3.4s \n", "\n", "2021-05-17 20:51:23 (75.8 MB/s) - ‘./zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [270532466/270532466]\n", "\n", "--2021-05-17 20:51:24-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162888 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_16_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:51:24 (27.9 MB/s) - ‘./zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162888/3162888]\n", "\n", "Removing ./zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:24-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 237862261 (227M)\n", "Saving to: ‘./zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_16_R1_val_1_ 100%[===================>] 226.84M 74.9MB/s in 3.0s \n", "\n", "2021-05-17 20:51:27 (74.9 MB/s) - ‘./zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [237862261/237862261]\n", "\n", "--2021-05-17 20:51:27-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162911 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_17_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.07s \n", "\n", "2021-05-17 20:51:27 (45.3 MB/s) - ‘./zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162911/3162911]\n", "\n", "Removing ./zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:27-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 255762365 (244M)\n", "Saving to: ‘./zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_17_R1_val_1_ 100%[===================>] 243.91M 78.2MB/s in 3.1s \n", "\n", "2021-05-17 20:51:30 (78.2 MB/s) - ‘./zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [255762365/255762365]\n", "\n", "--2021-05-17 20:51:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162932 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_18_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:51:31 (26.6 MB/s) - ‘./zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162932/3162932]\n", "\n", "Removing ./zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:31-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 246151472 (235M)\n", "Saving to: ‘./zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_18_R1_val_1_ 100%[===================>] 234.75M 73.9MB/s in 3.2s \n", "\n", "2021-05-17 20:51:34 (73.8 MB/s) - ‘./zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [246151472/246151472]\n", "\n", "--2021-05-17 20:51:34-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162886 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_19_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.07s \n", "\n", "2021-05-17 20:51:34 (45.6 MB/s) - ‘./zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162886/3162886]\n", "\n", "Removing ./zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:34-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 243551222 (232M)\n", "Saving to: ‘./zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_19_R1_val_1_ 100%[===================>] 232.27M 75.2MB/s in 3.1s \n", "\n", "2021-05-17 20:51:37 (75.2 MB/s) - ‘./zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [243551222/243551222]\n", "\n", "--2021-05-17 20:51:38-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162920 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_20_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.04s \n", "\n", "2021-05-17 20:51:38 (72.8 MB/s) - ‘./zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162920/3162920]\n", "\n", "Removing ./zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:38-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 255785102 (244M)\n", "Saving to: ‘./zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_20_R1_val_1_ 100%[===================>] 243.94M 81.8MB/s in 3.0s \n", "\n", "2021-05-17 20:51:41 (81.8 MB/s) - ‘./zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [255785102/255785102]\n", "\n", "--2021-05-17 20:51:41-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162929 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_21_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.06s \n", "\n", "2021-05-17 20:51:41 (48.0 MB/s) - ‘./zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162929/3162929]\n", "\n", "Removing ./zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:41-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 251936294 (240M)\n", "Saving to: ‘./zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_21_R1_val_1_ 100%[===================>] 240.26M 66.9MB/s in 3.5s \n", "\n", "2021-05-17 20:51:45 (68.9 MB/s) - ‘./zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [251936294/251936294]\n", "\n", "--2021-05-17 20:51:45-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162894 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_22_R1_val_1_ 100%[===================>] 3.02M 19.9MB/s in 0.2s \n", "\n", "2021-05-17 20:51:45 (19.9 MB/s) - ‘./zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162894/3162894]\n", "\n", "Removing ./zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:45-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 236205711 (225M)\n", "Saving to: ‘./zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_22_R1_val_1_ 100%[===================>] 225.26M 77.7MB/s in 2.9s \n", "\n", "2021-05-17 20:51:48 (77.7 MB/s) - ‘./zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [236205711/236205711]\n", "\n", "--2021-05-17 20:51:48-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162830 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_23_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.06s \n", "\n", "2021-05-17 20:51:48 (50.5 MB/s) - ‘./zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162830/3162830]\n", "\n", "Removing ./zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:48-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 210792563 (201M)\n", "Saving to: ‘./zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_23_R1_val_1_ 100%[===================>] 201.03M 71.8MB/s in 2.8s \n", "\n", "2021-05-17 20:51:51 (71.8 MB/s) - ‘./zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [210792563/210792563]\n", "\n", "--2021-05-17 20:51:51-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 3162911 (3.0M) [text/html]\n", "Saving to: ‘./zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’\n", "\n", "zr3644_24_R1_val_1_ 100%[===================>] 3.02M --.-KB/s in 0.1s \n", "\n", "2021-05-17 20:51:51 (22.5 MB/s) - ‘./zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp’ saved [3162911/3162911]\n", "\n", "Removing ./zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_PE_report.html.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:51:51-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 250700953 (239M)\n", "Saving to: ‘./zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’\n", "\n", "zr3644_24_R1_val_1_ 100%[===================>] 239.09M 74.8MB/s in 3.3s \n", "\n", "2021-05-17 20:51:55 (72.9 MB/s) - ‘./zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph’ saved [250700953/250700953]\n", "\n", "--2021-05-17 20:51:55-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=N;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=N;O=A.tmp’\n", "\n", "index.html?C=N;O=A. [ <=> ] 168.69K --.-KB/s in 0.003s \n", "\n", "2021-05-17 20:52:00 (61.0 MB/s) - ‘./index.html?C=N;O=A.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=N;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:00-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=M;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=M;O=D.tmp’\n", "\n", "index.html?C=M;O=D. [ <=> ] 168.69K --.-KB/s in 0.004s \n", "\n", "2021-05-17 20:52:04 (39.0 MB/s) - ‘./index.html?C=M;O=D.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=M;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:04-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=S;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=S;O=D.tmp’\n", "\n", "index.html?C=S;O=D. [ <=> ] 168.69K --.-KB/s in 0.004s \n", "\n", "2021-05-17 20:52:09 (39.0 MB/s) - ‘./index.html?C=S;O=D.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=S;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:09-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/?C=D;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: unspecified [text/html]\n", "Saving to: ‘./index.html?C=D;O=D.tmp’\n", "\n", "index.html?C=D;O=D. [ <=> ] 168.69K --.-KB/s in 0.004s \n", "\n", "2021-05-17 20:52:14 (38.8 MB/s) - ‘./index.html?C=D;O=D.tmp’ saved [172743]\n", "\n", "Removing ./index.html?C=D;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=N;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=N;O=D.tmp’\n", "\n", "index.html?C=N;O=D. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=N;O=D.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=N;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=M;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=M;O=A.tmp’\n", "\n", "index.html?C=M;O=A. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=M;O=A.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=M;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=S;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=S;O=A.tmp’\n", "\n", "index.html?C=S;O=A. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=S;O=A.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=S;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=D;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=D;O=A.tmp’\n", "\n", "index.html?C=D;O=A. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=D;O=A.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=D;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=N;O=A\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=N;O=A.tmp’\n", "\n", "index.html?C=N;O=A. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=N;O=A.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=N;O=A.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=M;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=M;O=D.tmp’\n", "\n", "index.html?C=M;O=D. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=M;O=D.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=M;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=S;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=S;O=D.tmp’\n", "\n", "index.html?C=S;O=D. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=S;O=D.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=S;O=D.tmp since it should be rejected.\n", "\n", "--2021-05-17 20:52:14-- https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/multiqc_data/?C=D;O=D\n", "Reusing existing connection to gannet.fish.washington.edu:443.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 2171 (2.1K) [text/html]\n", "Saving to: ‘./index.html?C=D;O=D.tmp’\n", "\n", "index.html?C=D;O=D. 100%[===================>] 2.12K --.-KB/s in 0s \n", "\n", "2021-05-17 20:52:14 (2.02 GB/s) - ‘./index.html?C=D;O=D.tmp’ saved [2171/2171]\n", "\n", "Removing ./index.html?C=D;O=D.tmp since it should be rejected.\n", "\n", "FINISHED --2021-05-17 20:52:14--\n", "Total wall clock time: 2m 7s\n", "Downloaded: 68 files, 5.6G in 1m 17s (74.0 MB/s)\n" ] } ], "source": [ "#Download 5x bedgraphs\n", "!wget -r \\\n", "--no-check-certificate --no-directories --no-parent --reject \"index.html*\" \\\n", "-P . \\\n", "-A \"*5x.bedgraph\" https://gannet.fish.washington.edu/spartina/project-oyster-oa/Haws/bismark-2/" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "total 11482968\r\n", "-rw-r--r-- 1 yaamini staff 240M Mar 11 02:25 zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 251M Mar 11 02:25 zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 227M Mar 11 02:25 zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 247M Mar 11 02:26 zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 191M Mar 11 02:26 zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 258M Mar 11 02:26 zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 227M Mar 11 02:26 zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 244M Mar 11 02:27 zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 235M Mar 11 02:27 zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 232M Mar 11 02:27 zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 227M Mar 11 02:27 zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 244M Mar 11 02:28 zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 240M Mar 11 02:28 zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 225M Mar 11 02:28 zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 201M Mar 11 02:28 zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 239M Mar 11 02:29 zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 247M Mar 11 02:29 zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 203M Mar 11 02:29 zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 235M Mar 11 02:29 zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 221M Mar 11 02:30 zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 243M Mar 11 02:30 zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 298M Mar 11 02:30 zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 213M Mar 11 02:31 zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n", "-rw-r--r-- 1 yaamini staff 218M Mar 11 02:31 zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph\r\n" ] } ], "source": [ "#Check directory for all files\n", "!ls -lh" ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "MD5 (zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = f0dc26c38229b3640fa93fb29e1fa491\n", "MD5 (zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 0fd1e7003a0cb80de0e094cfdb8a7d0a\n", "MD5 (zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = f4c8c3b70c40770c6d3376a2b7140925\n", "MD5 (zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 54cc1f3a915e03c34aa905fff5be2b63\n", "MD5 (zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = cba3c994a0dc7502a64c5e0ae2c8727d\n", "MD5 (zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 9d54e2bd92b198b7ba4ab036d297b801\n", "MD5 (zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 54a28e3fd4ce60f6908ff11fc84e72c2\n", "MD5 (zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 650e074555739b4aac40df54abc79814\n", "MD5 (zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 2736303b8d17bce072892b08ac1ad978\n", "MD5 (zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 28ec1746f1cd1122eaea62bec434d38d\n", "MD5 (zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 2503628f1878ccbd7896e344d7b59b7e\n", "MD5 (zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 8fb3cddf5456db9fa34f811160ed43d4\n", "MD5 (zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 715237cee96784068c8e8235751ebd59\n", "MD5 (zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = c1afbf70c7246c2f024b839917df82fb\n", "MD5 (zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = c86bcefa56dc2bdde66882c090c72d7b\n", "MD5 (zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = ff86f7c813db1977bbfd498b47b39aa8\n", "MD5 (zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 2f3f9b9a55a112f70a7c77ec58f58792\n", "MD5 (zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 9e0ec9815033382dd7e0bbb19881568c\n", "MD5 (zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = a5d35602cd7072a3c126dc966c4016fb\n", "MD5 (zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = b2262d9ebd98790380fe724ee2c02edf\n", "MD5 (zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = a292bfa5e869aa2bb0ead3ba26d94e31\n", "MD5 (zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 304a82d417d018107f4361f57a8eb047\n", "MD5 (zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 9e4ba208aa2b28af19f34c089c8c8b36\n", "MD5 (zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.bedgraph) = 0e9c8026b61cc46e0726bd3d977a5de0\n" ] } ], "source": [ "#Obtain md5\n", "!md5 *" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 2. Concatenate 1x methylation information" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "I will use `unionBedGraphs` to concatenate information for all loci across samples." ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "collapsed": true }, "outputs": [], "source": [ "bedtoolsDirectory = \"/Users/Shared/bioinformatics/bedtools2/bin/\"" ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "\r\n", "Tool: bedtools unionbedg (aka unionBedGraphs)\r\n", "Version: v2.26.0\r\n", "Summary: Combines multiple BedGraph files into a single file,\r\n", "\t allowing coverage comparisons between them.\r\n", "\r\n", "Usage: bedtools unionbedg [OPTIONS] -i FILE1 FILE2 .. FILEn\r\n", "\t Assumes that each BedGraph file is sorted by chrom/start \r\n", "\t and that the intervals in each are non-overlapping.\r\n", "\r\n", "Options: \r\n", "\t-header\t\tPrint a header line.\r\n", "\t\t\t(chrom/start/end + names of each file).\r\n", "\r\n", "\t-names\t\tA list of names (one/file) to describe each file in -i.\r\n", "\t\t\tThese names will be printed in the header line.\r\n", "\r\n", "\t-g\t\tUse genome file to calculate empty regions.\r\n", "\t\t\t- STRING.\r\n", "\r\n", "\t-empty\t\tReport empty regions (i.e., start/end intervals w/o\r\n", "\t\t\tvalues in all files).\r\n", "\t\t\t- Requires the '-g FILE' parameter.\r\n", "\r\n", "\t-filler TEXT\tUse TEXT when representing intervals having no value.\r\n", "\t\t\t- Default is '0', but you can use 'N/A' or any text.\r\n", "\r\n", "\t-examples\tShow detailed usage examples.\r\n", "\r\n" ] } ], "source": [ "!{bedtoolsDirectory}unionBedGraphs -h" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### 2a. Create a union BEDgraph" ] }, { "cell_type": "code", "execution_count": 17, "metadata": { "collapsed": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "building file list ... \n", "24 files to consider\n", "zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 443999580 100% 48.14MB/s 0:00:08 (xfer#1, to-check=23/24)\n", "zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 449661867 100% 45.31MB/s 0:00:09 (xfer#2, to-check=22/24)\n", "zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 439244022 100% 46.69MB/s 0:00:08 (xfer#3, to-check=21/24)\n", "zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 447737467 100% 44.12MB/s 0:00:09 (xfer#4, to-check=20/24)\n", "zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 410215416 100% 43.46MB/s 0:00:09 (xfer#5, to-check=19/24)\n", "zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 451618171 100% 44.59MB/s 0:00:09 (xfer#6, to-check=18/24)\n", "zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 436757128 100% 45.79MB/s 0:00:09 (xfer#7, to-check=17/24)\n", "zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 434181566 100% 46.89MB/s 0:00:08 (xfer#8, to-check=16/24)\n", "zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 441807091 100% 44.97MB/s 0:00:09 (xfer#9, to-check=15/24)\n", "zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 442130872 100% 47.06MB/s 0:00:08 (xfer#10, to-check=14/24)\n", "zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 424135061 100% 44.24MB/s 0:00:09 (xfer#11, to-check=13/24)\n", "zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 447382883 100% 47.79MB/s 0:00:08 (xfer#12, to-check=12/24)\n", "zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 445117118 100% 44.52MB/s 0:00:09 (xfer#13, to-check=11/24)\n", "zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 436556632 100% 46.20MB/s 0:00:09 (xfer#14, to-check=10/24)\n", "zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 427636223 100% 45.55MB/s 0:00:08 (xfer#15, to-check=9/24)\n", "zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 442700790 100% 44.31MB/s 0:00:09 (xfer#16, to-check=8/24)\n", "zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 436094286 100% 44.73MB/s 0:00:09 (xfer#17, to-check=7/24)\n", "zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 432708920 100% 48.23MB/s 0:00:08 (xfer#18, to-check=6/24)\n", "zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 444562817 100% 44.07MB/s 0:00:09 (xfer#19, to-check=5/24)\n", "zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 436724546 100% 45.71MB/s 0:00:09 (xfer#20, to-check=4/24)\n", "zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 445362200 100% 46.70MB/s 0:00:09 (xfer#21, to-check=3/24)\n", "zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 470209215 100% 45.57MB/s 0:00:09 (xfer#22, to-check=2/24)\n", "zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 433334314 100% 44.71MB/s 0:00:09 (xfer#23, to-check=1/24)\n", "zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\n", " 434307274 100% 48.02MB/s 0:00:08 (xfer#24, to-check=0/24)\n", "\n", "sent 10555476948 bytes received 548 bytes 50625791.35 bytes/sec\n", "total size is 10554185459 speedup is 1.00\n" ] } ], "source": [ "!rsync --archive --progress --verbose /Volumes/web/spartina/project-oyster-oa/Haws/bismark-2/*cov ." ] }, { "cell_type": "code", "execution_count": 18, "metadata": { "collapsed": false, "scrolled": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n", "zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov\r\n" ] } ], "source": [ "!find *cov" ] }, { "cell_type": "code", "execution_count": 33, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "NC_001276.1\t34\t36\t5.303030\t7\t125\r\n", "NC_001276.1\t123\t125\t1.315789\t5\t375\r\n", "NC_001276.1\t305\t307\t3.873239\t11\t273\r\n", "NC_001276.1\t433\t435\t2.153110\t9\t409\r\n", "NC_001276.1\t457\t459\t1.830664\t8\t429\r\n", "NC_001276.1\t482\t484\t0.477327\t2\t417\r\n", "NC_001276.1\t609\t611\t1.716738\t8\t458\r\n", "NC_001276.1\t781\t783\t1.434426\t7\t481\r\n", "NC_001276.1\t826\t828\t1.162791\t5\t425\r\n", "NC_001276.1\t951\t953\t3.603604\t12\t321\r\n" ] } ], "source": [ "#Columns: chr, start, end, %meth, reads meth, reads unmeth\n", "!head zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe..CpG_report.merged_CpG_evidence.cov" ] }, { "cell_type": "code", "execution_count": 34, "metadata": { "collapsed": true }, "outputs": [], "source": [ "%%bash\n", "\n", "for f in *.cov\n", "do\n", " awk '{print $1\"\\t\"$2\"\\t\"$3\"\\t\"$4}' ${f} \\\n", " > $(basename ${f%..CpG_report.merged_CpG_evidence.cov})_1x.bedgraph\n", "done" ] }, { "cell_type": "code", "execution_count": 38, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n", "zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.bedgraph\r\n" ] } ], "source": [ "!find *1x.bedgraph" ] }, { "cell_type": "code", "execution_count": 39, "metadata": { "collapsed": true }, "outputs": [], "source": [ "%%bash\n", "\n", "for f in *1x.bedgraph\n", "do\n", " /Users/Shared/bioinformatics/bedtools2/bin/sortBed \\\n", " -i ${f} \\\n", " > $(basename ${f%_1x.bedgraph})_1x.sort.bedgraph\n", "done" ] }, { "cell_type": "code", "execution_count": 40, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n", "zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe_1x.sort.bedgraph\r\n" ] } ], "source": [ "!ls *1x.sort.bedgraph" ] }, { "cell_type": "code", "execution_count": 41, "metadata": { "collapsed": false }, "outputs": [], "source": [ "#Create union BEDgraph from sorted files\n", "#Include a header\n", "#Use N/A when there is no data for a CpG in a sample\n", "#Define sample IDs\n", "#Use sorted bedgraphs\n", "#Save output\n", "!{bedtoolsDirectory}unionBedGraphs \\\n", "-header \\\n", "-filler N/A \\\n", "-names 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 \\\n", "-i \\\n", "*1x.sort.bedgraph \\\n", "> union_1x.bedgraph" ] }, { "cell_type": "code", "execution_count": 42, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "chrom\tstart\tend\t1\t2\t3\t4\t5\t6\t7\t8\t9\t10\t11\t12\t13\t14\t15\t16\t17\t18\t19\t20\t21\t22\t23\t24\n", "NC_001276.1\t34\t36\t3.092784\t1.162791\t3.061224\t3.401361\t3.614458\t2.659574\t4.375000\t4.347826\t4.054054\t3.603604\t2.343750\t3.468208\t2.083333\t2.453988\t6.250000\t5.479452\t9.615385\t6.862745\t5.217391\t1.785714\t2.150538\t3.389831\t2.380952\t5.303030\n", "NC_001276.1\t123\t125\t0.851064\t0.425532\t0.826446\t0.755668\t1.562500\t0.821355\t0.493827\t1.100917\t0.819672\t0.387597\t2.006689\t0.956938\t0.000000\t0.512821\t1.115242\t1.020408\t0.378788\t0.396825\t1.689189\t0.727273\t1.176471\t0.845666\t1.287554\t1.315789\n", "NC_001276.1\t305\t307\t3.314917\t1.081081\t3.225806\t1.955307\t6.578947\t2.739726\t1.973684\t2.403846\t2.427184\t1.777778\t2.525253\t3.669725\t4.848485\t3.846154\t3.255814\t5.166052\t2.843602\t5.687204\t2.517986\t1.621622\t1.630435\t4.450262\t4.040404\t3.873239\n", "NC_001276.1\t433\t435\t1.986755\t1.167315\t1.133144\t0.569260\t0.881057\t1.798561\t1.212121\t1.658375\t1.292407\t1.273885\t0.284900\t1.541426\t0.743494\t0.649351\t1.562500\t1.834862\t1.277955\t2.473498\t0.997506\t1.506024\t1.683502\t1.680672\t2.135231\t2.153110\n", "NC_001276.1\t457\t459\t1.538462\t2.158273\t0.529101\t0.172117\t1.195219\t1.487603\t0.560748\t1.397516\t0.765697\t1.479290\t1.033592\t0.910747\t1.754386\t0.769231\t0.857143\t0.643777\t1.159420\t0.619195\t2.142857\t1.983003\t1.238390\t1.374046\t2.317881\t1.830664\n", "NC_001276.1\t482\t484\t0.638978\t2.076125\t1.749271\t0.743494\t0.000000\t1.587302\t1.242236\t1.490066\t1.162791\t1.428571\t1.149425\t0.589391\t0.729927\t1.568627\t1.269841\t1.207729\t2.395210\t1.612903\t1.790281\t1.436782\t0.911854\t1.176471\t1.736111\t0.477327\n", "NC_001276.1\t609\t611\t1.479290\t1.543210\t2.425876\t1.589825\t1.742160\t1.497006\t1.876173\t1.923077\t2.737752\t1.734104\t0.997506\t2.101576\t2.654867\t1.883562\t2.777778\t2.024291\t1.837270\t3.954802\t0.938967\t3.359173\t3.200000\t2.352941\t2.866242\t1.716738\n", "NC_001276.1\t781\t783\t1.457726\t1.298701\t1.941748\t1.379310\t2.238806\t1.121795\t1.740812\t3.153153\t2.489019\t2.747253\t2.843602\t1.325758\t1.597444\t1.449275\t3.651685\t1.840491\t1.608579\t2.295082\t1.909308\t2.209945\t2.727273\t3.744150\t3.508772\t1.434426\n", "NC_001276.1\t826\t828\t1.346801\t2.205882\t2.298851\t0.200401\t0.823045\t2.161100\t0.883002\t2.117264\t1.036269\t1.796407\t0.831025\t1.237113\t1.379310\t1.131222\t1.010101\t1.345291\t1.333333\t1.503759\t0.879765\t1.577287\t1.048951\t2.229846\t1.886792\t1.162791\n", " 12794075 union_1x.bedgraph\n" ] } ], "source": [ "#Check output\n", "!head union_1x.bedgraph\n", "!wc -l union_1x.bedgraph" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 3. Concatenate 5x methylation information" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### 3a. Create a union BEDgraph" ] }, { "cell_type": "code", "execution_count": 11, "metadata": { "collapsed": true }, "outputs": [], "source": [ "%%bash\n", "\n", "for f in *5x.bedgraph\n", "do\n", "/Users/Shared/bioinformatics/bedtools2/bin/sortBed \\\n", "-i ${f} \\\n", "> $(basename ${f%_5x.bedgraph})_5x.sort.bedgraph\n", "done" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "zr3644_10_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_11_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_12_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_13_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_14_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_15_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_16_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_17_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_18_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_19_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_1_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_20_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_21_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_22_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_23_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_24_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_2_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_3_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_4_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_5_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_6_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_7_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_8_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n", "zr3644_9_R1_val_1_val_1_val_1_bismark_bt2_pe._5x.sort.bedgraph\r\n" ] } ], "source": [ "!ls *sort*" ] }, { "cell_type": "code", "execution_count": 13, "metadata": { "collapsed": false }, "outputs": [], "source": [ "#Create union BEDgraph from sorted files\n", "#Include a header\n", "#Use N/A when there is no data for a CpG in a sample\n", "#Define sample IDs\n", "#Use sorted bedgraphs\n", "#Save output\n", "!{bedtoolsDirectory}unionBedGraphs \\\n", "-header \\\n", "-filler N/A \\\n", "-names 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 \\\n", "-i \\\n", "*5x.sort.bedgraph \\\n", "> union_5x.bedgraph" ] }, { "cell_type": "code", "execution_count": 14, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "chrom\tstart\tend\t1\t2\t3\t4\t5\t6\t7\t8\t9\t10\t11\t12\t13\t14\t15\t16\t17\t18\t19\t20\t21\t22\t23\t24\n", "NC_001276.1\t34\t36\t3.092784\t1.162791\t3.061224\t3.401361\t3.614458\t2.659574\t4.375000\t4.347826\t4.054054\t3.603604\t2.343750\t3.468208\t2.083333\t2.453988\t6.250000\t5.479452\t9.615385\t6.862745\t5.217391\t1.785714\t2.150538\t3.389831\t2.380952\t5.303030\n", "NC_001276.1\t123\t125\t0.851064\t0.425532\t0.826446\t0.755668\t1.562500\t0.821355\t0.493827\t1.100917\t0.819672\t0.387597\t2.006689\t0.956938\t0.000000\t0.512821\t1.115242\t1.020408\t0.378788\t0.396825\t1.689189\t0.727273\t1.176471\t0.845666\t1.287554\t1.315789\n", "NC_001276.1\t305\t307\t3.314917\t1.081081\t3.225806\t1.955307\t6.578947\t2.739726\t1.973684\t2.403846\t2.427184\t1.777778\t2.525253\t3.669725\t4.848485\t3.846154\t3.255814\t5.166052\t2.843602\t5.687204\t2.517986\t1.621622\t1.630435\t4.450262\t4.040404\t3.873239\n", "NC_001276.1\t433\t435\t1.986755\t1.167315\t1.133144\t0.569260\t0.881057\t1.798561\t1.212121\t1.658375\t1.292407\t1.273885\t0.284900\t1.541426\t0.743494\t0.649351\t1.562500\t1.834862\t1.277955\t2.473498\t0.997506\t1.506024\t1.683502\t1.680672\t2.135231\t2.153110\n", "NC_001276.1\t457\t459\t1.538462\t2.158273\t0.529101\t0.172117\t1.195219\t1.487603\t0.560748\t1.397516\t0.765697\t1.479290\t1.033592\t0.910747\t1.754386\t0.769231\t0.857143\t0.643777\t1.159420\t0.619195\t2.142857\t1.983003\t1.238390\t1.374046\t2.317881\t1.830664\n", "NC_001276.1\t482\t484\t0.638978\t2.076125\t1.749271\t0.743494\t0.000000\t1.587302\t1.242236\t1.490066\t1.162791\t1.428571\t1.149425\t0.589391\t0.729927\t1.568627\t1.269841\t1.207729\t2.395210\t1.612903\t1.790281\t1.436782\t0.911854\t1.176471\t1.736111\t0.477327\n", "NC_001276.1\t609\t611\t1.479290\t1.543210\t2.425876\t1.589825\t1.742160\t1.497006\t1.876173\t1.923077\t2.737752\t1.734104\t0.997506\t2.101576\t2.654867\t1.883562\t2.777778\t2.024291\t1.837270\t3.954802\t0.938967\t3.359173\t3.200000\t2.352941\t2.866242\t1.716738\n", "NC_001276.1\t781\t783\t1.457726\t1.298701\t1.941748\t1.379310\t2.238806\t1.121795\t1.740812\t3.153153\t2.489019\t2.747253\t2.843602\t1.325758\t1.597444\t1.449275\t3.651685\t1.840491\t1.608579\t2.295082\t1.909308\t2.209945\t2.727273\t3.744150\t3.508772\t1.434426\n", "NC_001276.1\t826\t828\t1.346801\t2.205882\t2.298851\t0.200401\t0.823045\t2.161100\t0.883002\t2.117264\t1.036269\t1.796407\t0.831025\t1.237113\t1.379310\t1.131222\t1.010101\t1.345291\t1.333333\t1.503759\t0.879765\t1.577287\t1.048951\t2.229846\t1.886792\t1.162791\n", " 10497321 union_5x.bedgraph\n" ] } ], "source": [ "#Check output\n", "!head union_5x.bedgraph\n", "!wc -l union_5x.bedgraph" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### 3b. Manipulate with `pandas`" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "collapsed": false }, "outputs": [ { "data": { "text/html": [ "
\n", " | chrom | \n", "start | \n", "end | \n", "1 | \n", "2 | \n", "3 | \n", "4 | \n", "5 | \n", "6 | \n", "7 | \n", "... | \n", "15 | \n", "16 | \n", "17 | \n", "18 | \n", "19 | \n", "20 | \n", "21 | \n", "22 | \n", "23 | \n", "24 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "NC_001276.1 | \n", "34 | \n", "36 | \n", "3.092784 | \n", "1.162791 | \n", "3.061224 | \n", "3.401361 | \n", "3.614458 | \n", "2.659574 | \n", "4.375000 | \n", "... | \n", "6.250000 | \n", "5.479452 | \n", "9.615385 | \n", "6.862745 | \n", "5.217391 | \n", "1.785714 | \n", "2.150538 | \n", "3.389831 | \n", "2.380952 | \n", "5.303030 | \n", "
1 | \n", "NC_001276.1 | \n", "123 | \n", "125 | \n", "0.851064 | \n", "0.425532 | \n", "0.826446 | \n", "0.755668 | \n", "1.562500 | \n", "0.821355 | \n", "0.493827 | \n", "... | \n", "1.115242 | \n", "1.020408 | \n", "0.378788 | \n", "0.396825 | \n", "1.689189 | \n", "0.727273 | \n", "1.176471 | \n", "0.845666 | \n", "1.287554 | \n", "1.315789 | \n", "
2 | \n", "NC_001276.1 | \n", "305 | \n", "307 | \n", "3.314917 | \n", "1.081081 | \n", "3.225806 | \n", "1.955307 | \n", "6.578947 | \n", "2.739726 | \n", "1.973684 | \n", "... | \n", "3.255814 | \n", "5.166052 | \n", "2.843602 | \n", "5.687204 | \n", "2.517986 | \n", "1.621622 | \n", "1.630435 | \n", "4.450262 | \n", "4.040404 | \n", "3.873239 | \n", "
3 | \n", "NC_001276.1 | \n", "433 | \n", "435 | \n", "1.986755 | \n", "1.167315 | \n", "1.133144 | \n", "0.569260 | \n", "0.881057 | \n", "1.798561 | \n", "1.212121 | \n", "... | \n", "1.562500 | \n", "1.834862 | \n", "1.277955 | \n", "2.473498 | \n", "0.997506 | \n", "1.506024 | \n", "1.683502 | \n", "1.680672 | \n", "2.135231 | \n", "2.153110 | \n", "
4 | \n", "NC_001276.1 | \n", "457 | \n", "459 | \n", "1.538462 | \n", "2.158273 | \n", "0.529101 | \n", "0.172117 | \n", "1.195219 | \n", "1.487603 | \n", "0.560748 | \n", "... | \n", "0.857143 | \n", "0.643777 | \n", "1.159420 | \n", "0.619195 | \n", "2.142857 | \n", "1.983003 | \n", "1.238390 | \n", "1.374046 | \n", "2.317881 | \n", "1.830664 | \n", "
5 rows × 27 columns
\n", "\n", " | chrom | \n", "start | \n", "end | \n", "1 | \n", "2 | \n", "3 | \n", "4 | \n", "5 | \n", "6 | \n", "7 | \n", "... | \n", "16 | \n", "17 | \n", "18 | \n", "19 | \n", "20 | \n", "21 | \n", "22 | \n", "23 | \n", "24 | \n", "total | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
10497310 | \n", "NW_022994998.1 | \n", "54647 | \n", "54649 | \n", "0.0 | \n", "0.0 | \n", "9.090909 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.378788 | \n", "
10497311 | \n", "NW_022994998.1 | \n", "54770 | \n", "54772 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "
10497312 | \n", "NW_022994998.1 | \n", "54834 | \n", "54836 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "
10497313 | \n", "NW_022994998.1 | \n", "54843 | \n", "54845 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "10.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.954545 | \n", "
10497314 | \n", "NW_022994998.1 | \n", "54860 | \n", "54862 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "
10497315 | \n", "NW_022994998.1 | \n", "54872 | \n", "54874 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "
10497316 | \n", "NW_022994998.1 | \n", "54934 | \n", "54936 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "
10497317 | \n", "NW_022994998.1 | \n", "54949 | \n", "54951 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "
10497318 | \n", "NW_022994998.1 | \n", "54953 | \n", "54955 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "
10497319 | \n", "NW_022994998.1 | \n", "54958 | \n", "54960 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "
10 rows × 28 columns
\n", "\n", " | chrom | \n", "start | \n", "end | \n", "1 | \n", "2 | \n", "3 | \n", "4 | \n", "5 | \n", "6 | \n", "7 | \n", "... | \n", "18 | \n", "19 | \n", "20 | \n", "21 | \n", "22 | \n", "23 | \n", "24 | \n", "total | \n", "diploid | \n", "triploid | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
10497310 | \n", "NW_022994998.1 | \n", "54647 | \n", "54649 | \n", "0.0 | \n", "0.0 | \n", "9.090909 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.378788 | \n", "0.757576 | \n", "0.0 | \n", "
10497311 | \n", "NW_022994998.1 | \n", "54770 | \n", "54772 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497312 | \n", "NW_022994998.1 | \n", "54834 | \n", "54836 | \n", "0.0 | \n", "0.0 | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497313 | \n", "NW_022994998.1 | \n", "54843 | \n", "54845 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "10.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.954545 | \n", "1.909091 | \n", "0.0 | \n", "
10497314 | \n", "NW_022994998.1 | \n", "54860 | \n", "54862 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497315 | \n", "NW_022994998.1 | \n", "54872 | \n", "54874 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497316 | \n", "NW_022994998.1 | \n", "54934 | \n", "54936 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497317 | \n", "NW_022994998.1 | \n", "54949 | \n", "54951 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497318 | \n", "NW_022994998.1 | \n", "54953 | \n", "54955 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10497319 | \n", "NW_022994998.1 | \n", "54958 | \n", "54960 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "0.0 | \n", "NaN | \n", "0.000000 | \n", "0.000000 | \n", "0.0 | \n", "
10 rows × 30 columns
\n", "\n", " | chrom | \n", "start | \n", "end | \n", "1 | \n", "2 | \n", "3 | \n", "4 | \n", "5 | \n", "6 | \n", "7 | \n", "... | \n", "15 | \n", "16 | \n", "17 | \n", "18 | \n", "19 | \n", "20 | \n", "21 | \n", "22 | \n", "23 | \n", "24 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "NC_001276.1 | \n", "34 | \n", "36 | \n", "97.0 | \n", "86.0 | \n", "98.0 | \n", "147.0 | \n", "83.0 | \n", "188.0 | \n", "160.0 | \n", "... | \n", "112.0 | \n", "146.0 | \n", "104.0 | \n", "102.0 | \n", "115.0 | \n", "112.0 | \n", "93.0 | \n", "177.0 | \n", "84.0 | \n", "132.0 | \n", "
1 | \n", "NC_001276.1 | \n", "123 | \n", "125 | \n", "235.0 | \n", "235.0 | \n", "242.0 | \n", "397.0 | \n", "192.0 | \n", "487.0 | \n", "405.0 | \n", "... | \n", "269.0 | \n", "392.0 | \n", "264.0 | \n", "252.0 | \n", "296.0 | \n", "275.0 | \n", "255.0 | \n", "473.0 | \n", "233.0 | \n", "380.0 | \n", "
2 | \n", "NC_001276.1 | \n", "305 | \n", "307 | \n", "181.0 | \n", "185.0 | \n", "186.0 | \n", "358.0 | \n", "152.0 | \n", "365.0 | \n", "304.0 | \n", "... | \n", "215.0 | \n", "271.0 | \n", "211.0 | \n", "211.0 | \n", "278.0 | \n", "185.0 | \n", "184.0 | \n", "382.0 | \n", "198.0 | \n", "284.0 | \n", "
3 | \n", "NC_001276.1 | \n", "433 | \n", "435 | \n", "302.0 | \n", "257.0 | \n", "353.0 | \n", "527.0 | \n", "227.0 | \n", "556.0 | \n", "495.0 | \n", "... | \n", "320.0 | \n", "436.0 | \n", "313.0 | \n", "283.0 | \n", "401.0 | \n", "332.0 | \n", "297.0 | \n", "595.0 | \n", "281.0 | \n", "418.0 | \n", "
4 | \n", "NC_001276.1 | \n", "457 | \n", "459 | \n", "325.0 | \n", "278.0 | \n", "378.0 | \n", "581.0 | \n", "251.0 | \n", "605.0 | \n", "535.0 | \n", "... | \n", "350.0 | \n", "466.0 | \n", "345.0 | \n", "323.0 | \n", "420.0 | \n", "353.0 | \n", "323.0 | \n", "655.0 | \n", "302.0 | \n", "437.0 | \n", "
5 rows × 27 columns
\n", "\n", " | chrom | \n", "start | \n", "end | \n", "1 | \n", "2 | \n", "3 | \n", "4 | \n", "5 | \n", "6 | \n", "7 | \n", "... | \n", "17 | \n", "18 | \n", "19 | \n", "20 | \n", "21 | \n", "22 | \n", "23 | \n", "24 | \n", "diploid | \n", "triploid | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
12794064 | \n", "NW_022994998.1 | \n", "54834 | \n", "54836 | \n", "7.0 | \n", "5.0 | \n", "13.0 | \n", "3.0 | \n", "8.0 | \n", "7.0 | \n", "10.0 | \n", "... | \n", "6.0 | \n", "5.0 | \n", "7.0 | \n", "8.0 | \n", "6.0 | \n", "14.0 | \n", "4.0 | \n", "2.0 | \n", "8.083333 | \n", "7.000000 | \n", "
12794065 | \n", "NW_022994998.1 | \n", "54843 | \n", "54845 | \n", "7.0 | \n", "4.0 | \n", "12.0 | \n", "4.0 | \n", "8.0 | \n", "7.0 | \n", "10.0 | \n", "... | \n", "6.0 | \n", "5.0 | \n", "7.0 | \n", "8.0 | \n", "6.0 | \n", "12.0 | \n", "4.0 | \n", "3.0 | \n", "8.000000 | \n", "6.833333 | \n", "
12794066 | \n", "NW_022994998.1 | \n", "54860 | \n", "54862 | \n", "5.0 | \n", "4.0 | \n", "12.0 | \n", "3.0 | \n", "7.0 | \n", "6.0 | \n", "11.0 | \n", "... | \n", "6.0 | \n", "6.0 | \n", "7.0 | \n", "6.0 | \n", "6.0 | \n", "14.0 | \n", "4.0 | \n", "3.0 | \n", "7.416667 | \n", "6.833333 | \n", "
12794067 | \n", "NW_022994998.1 | \n", "54872 | \n", "54874 | \n", "5.0 | \n", "4.0 | \n", "11.0 | \n", "2.0 | \n", "7.0 | \n", "6.0 | \n", "8.0 | \n", "... | \n", "6.0 | \n", "6.0 | \n", "6.0 | \n", "6.0 | \n", "6.0 | \n", "13.0 | \n", "4.0 | \n", "3.0 | \n", "6.583333 | \n", "6.333333 | \n", "
12794068 | \n", "NW_022994998.1 | \n", "54934 | \n", "54936 | \n", "6.0 | \n", "2.0 | \n", "5.0 | \n", "2.0 | \n", "3.0 | \n", "7.0 | \n", "3.0 | \n", "... | \n", "3.0 | \n", "4.0 | \n", "2.0 | \n", "2.0 | \n", "2.0 | \n", "5.0 | \n", "5.0 | \n", "1.0 | \n", "4.181818 | \n", "2.916667 | \n", "
12794069 | \n", "NW_022994998.1 | \n", "54949 | \n", "54951 | \n", "4.0 | \n", "2.0 | \n", "3.0 | \n", "1.0 | \n", "2.0 | \n", "6.0 | \n", "2.0 | \n", "... | \n", "1.0 | \n", "4.0 | \n", "2.0 | \n", "1.0 | \n", "2.0 | \n", "3.0 | \n", "5.0 | \n", "1.0 | \n", "2.909091 | \n", "2.500000 | \n", "
12794070 | \n", "NW_022994998.1 | \n", "54953 | \n", "54955 | \n", "2.0 | \n", "2.0 | \n", "3.0 | \n", "1.0 | \n", "2.0 | \n", "6.0 | \n", "2.0 | \n", "... | \n", "1.0 | \n", "3.0 | \n", "2.0 | \n", "1.0 | \n", "2.0 | \n", "3.0 | \n", "5.0 | \n", "1.0 | \n", "2.636364 | \n", "2.363636 | \n", "
12794071 | \n", "NW_022994998.1 | \n", "54958 | \n", "54960 | \n", "2.0 | \n", "2.0 | \n", "3.0 | \n", "1.0 | \n", "2.0 | \n", "6.0 | \n", "2.0 | \n", "... | \n", "1.0 | \n", "3.0 | \n", "2.0 | \n", "1.0 | \n", "2.0 | \n", "3.0 | \n", "5.0 | \n", "1.0 | \n", "2.636364 | \n", "2.272727 | \n", "
12794072 | \n", "NW_022994998.1 | \n", "55001 | \n", "55003 | \n", "NaN | \n", "1.0 | \n", "1.0 | \n", "1.0 | \n", "1.0 | \n", "4.0 | \n", "NaN | \n", "... | \n", "NaN | \n", "1.0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "3.0 | \n", "2.0 | \n", "NaN | \n", "1.600000 | \n", "1.600000 | \n", "
12794073 | \n", "NW_022994998.1 | \n", "55026 | \n", "55028 | \n", "NaN | \n", "1.0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "1.000000 | \n", "1.000000 | \n", "
10 rows × 29 columns
\n", "