--- author: Sam White toc-title: Contents toc-depth: 5 toc-location: left date: 2017-11-30 23:46:29+00:00 layout: post slug: genome-assembly-olympia-oyster-illumina-pacbio-using-pb-jelly-wbgi-scaffold-assembly-2 title: Genome Assembly – Olympia Oyster Illumina & PacBio Using PB Jelly w/BGI Scaffold Assembly categories: - 2017 - Olympia Oyster Genome Sequencing tags: - assembly - Illumina - jupyter notebook - olympia oyster - Ostrea lurida - PacBio - PB Jelly - pbjelly - QUAST --- [After another attempt to fix PB Jelly](https://robertslab.github.io/sams-notebook/posts/2017/2017-11-30-troubleshooting-pb-jelly-install-on-emu-continued/), I ran it again. We'll see how it goes this time... Re-ran this using the [BGI Illumina scaffolds FASTA](https://owl.fish.washington.edu/O_lurida_genome_assemblies_BGI/20161201/cdts-hk.genomics.cn/Ostrea_lurida/Ostrea_lurida.fa). * [PB Jelly Documentation](https://sourceforge.net/p/pb-jelly/wiki/Home/) Here's a brief rundown of how this was run: * Default PB Jelly settings (including default settings for blasr). * Illumina reference FASTA: [BGI Illumina scaffolds FASTA](https://owl.fish.washington.edu/O_lurida_genome_assemblies_BGI/20161201/cdts-hk.genomics.cn/Ostrea_lurida/Ostrea_lurida.fa) * PacBio reads for mapping * [Protocol.xml file needed for PB Jelly to run](https://owl.fish.washington.edu/Athaliana/20171130_oly_pbjelly/Protocol.xml) See the Jupyter Notebook for full details of run (see Results section below). #### Results: Output folder: [https://owl.fish.washington.edu/Athaliana/20171130_oly_pbjelly/](http://owl.fish.washington.edu/Athaliana/20171130_oly_pbjelly/) Output FASTA file: [https://owl.fish.washington.edu/Athaliana/20171130_oly_pbjelly/jelly.out.fasta](http://owl.fish.washington.edu/Athaliana/20171130_oly_pbjelly/jelly.out.fasta) Quast assessment of output FASTA: Assembly jelly.out
# contigs (>= 0 bp) 696946
# contigs (>= 1000 bp) 159429
# contigs (>= 5000 bp) 68750
# contigs (>= 10000 bp) 35320
# contigs (>= 25000 bp) 7048
# contigs (>= 50000 bp) 894
Total length (>= 0 bp) 1253001795
Total length (>= 1000 bp) 1140787867
Total length (>= 5000 bp) 932263178
Total length (>= 10000 bp) 691523275
Total length (>= 25000 bp) 261425921
Total length (>= 50000 bp) 57741906
# contigs 213264
Largest contig 194507
Total length 1180563613
GC (%) 36.57
N50 12433
N75 5983
L50 26241
L75 60202
# N's per 100 kbp 6580.58
Have added this assembly to our [Olympia oyster genome assemblies table](https://github.com/RobertsLab/project-olympia.oyster-genomic/wiki/Genome-Assemblies). This took an insanely long time to complete (nearly six weeks)!!! After some internet searching, I've found a pontential solution to this and have initiated another PB Jelly run to see if it will run faster. Regardless, it'll be interesting to see how the results compare from two independent runs of PB Jelly. Jupyter Notebook (GitHub): [20171130_emu_pbjelly.ipynb](https://github.com/sr320/LabDocs/blob/master/jupyter_nbs/sam/20171130_emu_pbjelly.ipynb)