--- author: Sam White toc-title: Contents toc-depth: 5 toc-location: left date: 2016-12-30 21:38:38+00:00 layout: post slug: data-management-geoduck-rrbs-data-integrity-verification title: Data Management - Geoduck RRBS Data Integrity Verification categories: - 2016 - Miscellaneous tags: - geoduck - jupyter notebook - md5 - Panopea generosa - RRBS --- [Yesterday, I downloaded the Illumina FASTQ files provided by Genewiz](https://robertslab.github.io/sams-notebook/posts/2016/2016-12-30-data-received-geoduck-rrbs-sequencing-data/) for Hollie Putnam's reduced representation bisulfite geoduck libraries. However, Genewiz had not provided a checksum file at the time. I received the checksum file from Genewiz and have verified that the data is intact. Verification is described in the Jupyter notebook below. Data files are located here: [owl/web/nightingales/P_generosa](https://owl.fish.washington.edu/nightingales/P_generosa/) Jupyter notebook (GitHub): [20161230_docker_geoduck_RRBS_md5_checks.ipynb](https://github.com/sr320/LabDocs/blob/master/jupyter_nbs/sam/20161230_docker_geoduck_RRBS_md5_checks.ipynb)