--- author: Sam White toc-title: Contents toc-depth: 5 toc-location: left date: 2017-02-28 00:29:16+00:00 layout: post slug: data-received-jays-coral-radseq-and-hollies-geoduck-epi-radseq title: Data Received - Jay's Coral RADseq and Hollie's Geoduck Epi-RADseq categories: - 2017 - Miscellaneous tags: - Anthopleura elegantissima - docker - geoduck - jupyter notebook - Panopea generosa - Porites astreoides - wget --- Jay received notice from [UC Berkeley](https://qb3.berkeley.edu/gsl/) that the sequencing data from [his coral RADseq](http://onsnetwork.org/jdimond/2017/02/14/rad-library-prep/) was ready. In addition, [the sequencing contains some epiRADseq data from samples provided by Hollie Putnam](https://hputnam.github.io/Putnam_Lab_Notebook/Geoduck_Larval_DNA_Extractions/). See his notebook for multiple links that describe library preparation (indexing and barcodes), sample pooling, and species breakdown. For quickest reference, here's Jay's spreadsheet with virtually all the sample/index/barcode/pooling info (Google Sheet): [ddRAD/EpiRAD_Jan_16](https://docs.google.com/spreadsheets/d/1zS7lGuESGLiRUs8qdDf1aYxaYBmNHnwx51YtsAs83O4/edit#gid=1930556752) I've downloaded both the demultiplexed and non-demultiplexed data, verified data integrity by generating and comparing MD5 checksums, copied the files to each of the three species folders on owl/nightingales that were sequenced (_Panopea generosa_, _Anthopleura elegantissima_, _Porites astreoides_), generated and compared MD5 checksums for the files in their directories on owl/nightingales, and created/updated the readme files in each respective folder. Data management is detailed in the Jupyter notebook below. The notebook is embedded in this post, but it may be easier to view on GitHub (linked below). Readme files were updated outside of the notebook. Jupyter notebook (GitHub): [20170227_docker_jay_ngs_data_retrieval.ipynb](https://github.com/sr320/LabDocs/blob/master/jupyter_nbs/sam/20170227_docker_jay_ngs_data_retrieval.ipynb)