1 Data

Existing RNA-Seq data was retrieved from the following complete RNASeq data available in the NCBI database:
- SRR19782039 Exposure to Valsartan & Carbamazepine
- SRR16771870 Exposure to a synthetic hormone 17 a-Ethinylestradiol (EE2)
- SRR7725722 Diarrhetic Shellfish Poisoning (DSP) toxins associated with Harmful Algal Blooms (HABs)
- SRR13013756 Hypoxia
- Mytilus galloprovincialis Reference Genome

Moving CDS file from other place on Raven

cp /home/shared/8TB_HDD_01/sr320/ncbi/ncbi_dataset/data/GCA_900618805.1/cds_from_genomic.fna ../data/cds_from_genomic.fna 
head ../data/cds_from_genomic.fna
## >lcl|UYJE01000001.1_cds_VDH88688.1_1 [locus_tag=MGAL_10B017214] [protein=Hypothetical predicted protein] [protein_id=VDH88688.1] [location=complement(join(55975..56224,59152..59365,60239..60337,61332..61522,64535..64608))] [gbkey=CDS]
## ATGAATAGAATTACTGATAGGGACTACGACTACTATGACTTTGAAGATGACAGTGACCACGAGCCTTGCGATAGTTCTGA
## TGATGATATCGAGGTTATTTTACATGGAACACCTGAACAGAAGCGTAAATTACAGACCAAAGTCCAACAAAGACATGATT
## CTTCAAGTGAAGATGACTTTGAAAAGGAAATGAATAATGAACTTAACAAACATATTAAAGGACTGGTAAATGAAAGATCA
## AGTAATGTTGCAGAAACTGTTCAAGGTAGTAGCAAAGCTCAAGACCAAGAGAAACCAACAGAACAACAACAATTTTATGA
## TGATATTTATTACGATTCAGAAGAAGAGGAAATGGTTTTACAAGGTGATGAACGTGTCAAAAGAAGACAACCTGTTCAAA
## GCAATGATGACTTATTGTACGATCCTGACCTAGACGAAGAAGACCAGCGATGGGTTGATGCTGAACGACAAGCTTATCAG
## CTGCCTGTACCCTCAGGATCCAAATCAAAACGTCAAAACAGTGATGCAGTTTTAAACTGTCCCGCTTGTATGACATTACT
## GTGTCTTGATTGTCAGGGGCATGATGTTTATGAAAACCAGTACAGAGCTATGTTTGTTAAGAACTGTCGTGTCGATACAT
## CAGAATTATTAAAACAGCCGTTACAGAAGAAAAAACGTAAAAAAAAACAGAAGACATTGGACACTACAAATAATGAAACA

2 Index Gene Set

Create the index file to align my short read files to the genes from the MGAL_10

/home/shared/kallisto/kallisto \
index \
-i ../data/MGAL_cds.index \
../data/cds_from_genomic.fna

3 Runnning Kallisto

/home/shared/kallisto/kallisto quant
## kallisto 0.46.1
## Computes equivalence classes for reads and quantifies abundances
## 
## Usage: kallisto quant [arguments] FASTQ-files
## 
## Required arguments:
## -i, --index=STRING            Filename for the kallisto index to be used for
##                               quantification
## -o, --output-dir=STRING       Directory to write output to
## 
## Optional arguments:
##     --bias                    Perform sequence based bias correction
## -b, --bootstrap-samples=INT   Number of bootstrap samples (default: 0)
##     --seed=INT                Seed for the bootstrap sampling (default: 42)
##     --plaintext               Output plaintext instead of HDF5
##     --fusion                  Search for fusions for Pizzly
##     --single                  Quantify single-end reads
##     --single-overhang         Include reads where unobserved rest of fragment is
##                               predicted to lie outside a transcript
##     --fr-stranded             Strand specific reads, first read forward
##     --rf-stranded             Strand specific reads, first read reverse
## -l, --fragment-length=DOUBLE  Estimated average fragment length
## -s, --sd=DOUBLE               Estimated standard deviation of fragment length
##                               (default: -l, -s values are estimated from paired
##                                end data, but are required when using --single)
## -t, --threads=INT             Number of threads to use (default: 1)
##     --pseudobam               Save pseudoalignments to transcriptome to BAM file
##     --genomebam               Project pseudoalignments to genome sorted BAM file
## -g, --gtf                     GTF file for transcriptome information
##                               (required for --genomebam)
## -c, --chromosomes             Tab separated file with chromosome names and lengths
##                               (optional for --genomebam, but recommended)
ls /home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/
## SRR13013756.fastq
## SRR13013756_fastqc.html
## SRR13013756_fastqc.zip
## SRR16771870_1.fastq
## SRR16771870_1_fastqc.html
## SRR16771870_1_fastqc.zip
## SRR16771870_2.fastq
## SRR16771870_2_fastqc.html
## SRR16771870_2_fastqc.zip
## SRR19782039.fastq
## SRR19782039_fastqc.html
## SRR19782039_fastqc.zip
## SRR7725722_1.fastq
## SRR7725722_1_fastqc.html
## SRR7725722_1_fastqc.zip
## SRR7725722_2.fastq
## SRR7725722_2_fastqc.html
## SRR7725722_2_fastqc.zip
#mkdir ../output
#mkdir ../output/kallisto_01

find /home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/*_1.fastq \
| xargs basename -s _1.fastq  | xargs -I{} /home/shared/kallisto/kallisto \
quant -i ../data/MGAL_cds.index \
-o ../output/kallisto_01/{} \
-t 4 \
/home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/{}_1.fastq \
/home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/{}_2.fastq
LS0tCnRpdGxlOiAiUk5BLXNlcSIKYXV0aG9yOiBTdGV2ZW4gUm9iZXJ0cwpkYXRlOiAiYHIgZm9ybWF0KFN5cy50aW1lKCksICclZCAlQiwgJVknKWAiICAKb3V0cHV0OiAKICBodG1sX2RvY3VtZW50OgogICAgdGhlbWU6IHJlYWRhYmxlCiAgICBoaWdobGlnaHQ6IHplbmJ1cm4KICAgIHRvYzogdHJ1ZQogICAgdG9jX2Zsb2F0OiB0cnVlCiAgICBudW1iZXJfc2VjdGlvbnM6IHRydWUKICAgIGNvZGVfZm9sZGluZzogc2hvdwogICAgY29kZV9kb3dubG9hZDogdHJ1ZQotLS0KCmBgYHtyIHNldHVwLCBpbmNsdWRlPUZBTFNFfQpsaWJyYXJ5KGtuaXRyKQpsaWJyYXJ5KHRpZHl2ZXJzZSkKbGlicmFyeShrYWJsZUV4dHJhKQpsaWJyYXJ5KERFU2VxMikKbGlicmFyeShwaGVhdG1hcCkKbGlicmFyeShSQ29sb3JCcmV3ZXIpCmxpYnJhcnkoZGF0YS50YWJsZSkKbGlicmFyeShEVCkKbGlicmFyeShCaW9zdHJpbmdzKQprbml0cjo6b3B0c19jaHVuayRzZXQoCiAgZWNobyA9IFRSVUUsICAgICAgICAgIyBEaXNwbGF5IGNvZGUgY2h1bmtzCiAgZXZhbCA9IEZBTFNFLCAgICAgICAgICMgRXZhbHVhdGUgY29kZSBjaHVua3MKICB3YXJuaW5nID0gRkFMU0UsICAgICAjIEhpZGUgd2FybmluZ3MKICBtZXNzYWdlID0gRkFMU0UsICAgICAjIEhpZGUgbWVzc2FnZXMKICBmaWcud2lkdGggPSA2LCAgICAgICAjIFNldCBwbG90IHdpZHRoIGluIGluY2hlcwogIGZpZy5oZWlnaHQgPSA0LCAgICAgICMgU2V0IHBsb3QgaGVpZ2h0IGluIGluY2hlcwogIGZpZy5hbGlnbiA9ICJjZW50ZXIiICMgQWxpZ24gcGxvdHMgdG8gdGhlIGNlbnRlcgopCmBgYAoKCiMgKipEYXRhKioKCkV4aXN0aW5nIFJOQS1TZXEgZGF0YSB3YXMgcmV0cmlldmVkIGZyb20gdGhlIGZvbGxvd2luZyBjb21wbGV0ZSBSTkFTZXEgZGF0YSBhdmFpbGFibGUgaW4gdGhlIFtOQ0JJIGRhdGFiYXNlXShodHRwczovL3d3dy5uY2JpLm5sbS5uaWguZ292Lyk6XAotIFNSUjE5NzgyMDM5IFtFeHBvc3VyZSB0byBWYWxzYXJ0YW4gJiBDYXJiYW1hemVwaW5lXShodHRwczovL3d3dy5uY2JpLm5sbS5uaWguZ292L3NyYS9TUlgxNTgyNjI5MSU1QmFjY24lNUQpXAotIFNSUjE2NzcxODcwIFtFeHBvc3VyZSB0byBhIHN5bnRoZXRpYyBob3Jtb25lIDE3IGEtRXRoaW55bGVzdHJhZGlvbCAoRUUyKV0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9zcmEvU1JYMTI5NzE3OTIlNUJhY2NuJTVEKVwKLSBTUlI3NzI1NzIyIFtEaWFycmhldGljIFNoZWxsZmlzaCBQb2lzb25pbmcgKERTUCkgdG94aW5zIGFzc29jaWF0ZWQgd2l0aCBIYXJtZnVsIEFsZ2FsIEJsb29tcyAoSEFCcyldKGh0dHBzOi8vd3d3Lm5jYmkubmxtLm5paC5nb3Yvc3JhL1NSWDQ1ODIyMDQlNUJhY2NuJTVEKVwKLSBTUlIxMzAxMzc1NiBbSHlwb3hpYV0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9zcmEvU1JYOTQ2NDc2NiU1QmFjY24lNUQpXAotICpNeXRpbHVzIGdhbGxvcHJvdmluY2lhbGlzKiBbUmVmZXJlbmNlIEdlbm9tZV0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9kYXRhLWh1Yi9nZW5vbWUvR0NBXzkwMDYxODgwNS4xLykKCk1vdmluZyBDRFMgZmlsZSBmcm9tIG90aGVyIHBsYWNlIG9uIFJhdmVuCgpgYGB7YmFzaH0KY3AgL2hvbWUvc2hhcmVkLzhUQl9IRERfMDEvc3IzMjAvbmNiaS9uY2JpX2RhdGFzZXQvZGF0YS9HQ0FfOTAwNjE4ODA1LjEvY2RzX2Zyb21fZ2Vub21pYy5mbmEgLi4vZGF0YS9jZHNfZnJvbV9nZW5vbWljLmZuYSAKYGBgCgoKYGBge3IsIGVuZ2luZT0nYmFzaCcsIGV2YWw9VFJVRX0KaGVhZCAuLi9kYXRhL2Nkc19mcm9tX2dlbm9taWMuZm5hCmBgYAoKIyBJbmRleCBHZW5lIFNldAoKQ3JlYXRlIHRoZSBpbmRleCBmaWxlIHRvIGFsaWduIG15IHNob3J0IHJlYWQgZmlsZXMgdG8gdGhlIGdlbmVzIGZyb20gdGhlIE1HQUxfMTAKYGBge3IsIGVuZ2luZT0nYmFzaCd9Ci9ob21lL3NoYXJlZC9rYWxsaXN0by9rYWxsaXN0byBcCmluZGV4IFwKLWkgLi4vZGF0YS9NR0FMX2Nkcy5pbmRleCBcCi4uL2RhdGEvY2RzX2Zyb21fZ2Vub21pYy5mbmEKYGBgCgoKIyBSdW5ubmluZyBLYWxsaXN0bwoKYGBge3IsIGVuZ2luZT0nYmFzaCcsIGV2YWw9VFJVRX0KL2hvbWUvc2hhcmVkL2thbGxpc3RvL2thbGxpc3RvIHF1YW50CmBgYApgYGB7ciwgZW5naW5lPSdiYXNoJywgZXZhbD1UUlVFfQpscyAvaG9tZS9zaGFyZWQvOFRCX0hERF8wMi9jbm1udGduYS9HaXRIdWIvY2hyaXMtbXVzc2VsY29uL291dHB1dC9uY2JpLwpgYGAKCgpgYGB7ciwgZW5naW5lPSdiYXNoJ30KI21rZGlyIC4uL291dHB1dAojbWtkaXIgLi4vb3V0cHV0L2thbGxpc3RvXzAxCgpmaW5kIC9ob21lL3NoYXJlZC84VEJfSEREXzAyL2NubW50Z25hL0dpdEh1Yi9jaHJpcy1tdXNzZWxjb24vb3V0cHV0L25jYmkvKl8xLmZhc3RxIFwKfCB4YXJncyBiYXNlbmFtZSAtcyBfMS5mYXN0cSAgfCB4YXJncyAtSXt9IC9ob21lL3NoYXJlZC9rYWxsaXN0by9rYWxsaXN0byBcCnF1YW50IC1pIC4uL2RhdGEvTUdBTF9jZHMuaW5kZXggXAotbyAuLi9vdXRwdXQva2FsbGlzdG9fMDEve30gXAotdCA0IFwKL2hvbWUvc2hhcmVkLzhUQl9IRERfMDIvY25tbnRnbmEvR2l0SHViL2NocmlzLW11c3NlbGNvbi9vdXRwdXQvbmNiaS97fV8xLmZhc3RxIFwKL2hvbWUvc2hhcmVkLzhUQl9IRERfMDIvY25tbnRnbmEvR2l0SHViL2NocmlzLW11c3NlbGNvbi9vdXRwdXQvbmNiaS97fV8yLmZhc3RxCmBgYAo=