Data
Existing RNA-Seq data was retrieved from the following complete RNASeq data available in the NCBI database:
- SRR19782039 Exposure to Valsartan & Carbamazepine
- SRR16771870 Exposure to a synthetic hormone 17 a-Ethinylestradiol (EE2)
- SRR7725722 Diarrhetic Shellfish Poisoning (DSP) toxins associated with Harmful Algal Blooms (HABs)
- SRR13013756 Hypoxia
- Mytilus galloprovincialis Reference Genome

Moving CDS file from other place on Raven

cp /home/shared/8TB_HDD_01/sr320/ncbi/ncbi_dataset/data/GCA_900618805.1/cds_from_genomic.fna ../data/cds_from_genomic.fna 

0.0.1 Pulling my reference genome from prefab code provided by NCBI

curled genome and copied just cds file

head ../data/cds_from_genomic.fna
## >lcl|UYJE01000001.1_cds_VDH88688.1_1 [locus_tag=MGAL_10B017214] [protein=Hypothetical predicted protein] [protein_id=VDH88688.1] [location=complement(join(55975..56224,59152..59365,60239..60337,61332..61522,64535..64608))] [gbkey=CDS]
## ATGAATAGAATTACTGATAGGGACTACGACTACTATGACTTTGAAGATGACAGTGACCACGAGCCTTGCGATAGTTCTGA
## TGATGATATCGAGGTTATTTTACATGGAACACCTGAACAGAAGCGTAAATTACAGACCAAAGTCCAACAAAGACATGATT
## CTTCAAGTGAAGATGACTTTGAAAAGGAAATGAATAATGAACTTAACAAACATATTAAAGGACTGGTAAATGAAAGATCA
## AGTAATGTTGCAGAAACTGTTCAAGGTAGTAGCAAAGCTCAAGACCAAGAGAAACCAACAGAACAACAACAATTTTATGA
## TGATATTTATTACGATTCAGAAGAAGAGGAAATGGTTTTACAAGGTGATGAACGTGTCAAAAGAAGACAACCTGTTCAAA
## GCAATGATGACTTATTGTACGATCCTGACCTAGACGAAGAAGACCAGCGATGGGTTGATGCTGAACGACAAGCTTATCAG
## CTGCCTGTACCCTCAGGATCCAAATCAAAACGTCAAAACAGTGATGCAGTTTTAAACTGTCCCGCTTGTATGACATTACT
## GTGTCTTGATTGTCAGGGGCATGATGTTTATGAAAACCAGTACAGAGCTATGTTTGTTAAGAACTGTCGTGTCGATACAT
## CAGAATTATTAAAACAGCCGTTACAGAAGAAAAAACGTAAAAAAAAACAGAAGACATTGGACACTACAAATAATGAAACA

1 Create the index file to align my short read files to the genes from the MGAL_10

/home/shared/kallisto/kallisto \
index \
-i ../data/MGAL_cds.index \
../data/cds_from_genomic.fna

2 Where is data

#Runnning Kallisto

/home/shared/kallisto/kallisto quant
## kallisto 0.46.1
## Computes equivalence classes for reads and quantifies abundances
## 
## Usage: kallisto quant [arguments] FASTQ-files
## 
## Required arguments:
## -i, --index=STRING            Filename for the kallisto index to be used for
##                               quantification
## -o, --output-dir=STRING       Directory to write output to
## 
## Optional arguments:
##     --bias                    Perform sequence based bias correction
## -b, --bootstrap-samples=INT   Number of bootstrap samples (default: 0)
##     --seed=INT                Seed for the bootstrap sampling (default: 42)
##     --plaintext               Output plaintext instead of HDF5
##     --fusion                  Search for fusions for Pizzly
##     --single                  Quantify single-end reads
##     --single-overhang         Include reads where unobserved rest of fragment is
##                               predicted to lie outside a transcript
##     --fr-stranded             Strand specific reads, first read forward
##     --rf-stranded             Strand specific reads, first read reverse
## -l, --fragment-length=DOUBLE  Estimated average fragment length
## -s, --sd=DOUBLE               Estimated standard deviation of fragment length
##                               (default: -l, -s values are estimated from paired
##                                end data, but are required when using --single)
## -t, --threads=INT             Number of threads to use (default: 1)
##     --pseudobam               Save pseudoalignments to transcriptome to BAM file
##     --genomebam               Project pseudoalignments to genome sorted BAM file
## -g, --gtf                     GTF file for transcriptome information
##                               (required for --genomebam)
## -c, --chromosomes             Tab separated file with chromosome names and lengths
##                               (optional for --genomebam, but recommended)
ls /home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/
## SRR13013756.fastq
## SRR13013756_fastqc.html
## SRR13013756_fastqc.zip
## SRR16771870_1.fastq
## SRR16771870_1_fastqc.html
## SRR16771870_1_fastqc.zip
## SRR16771870_2.fastq
## SRR16771870_2_fastqc.html
## SRR16771870_2_fastqc.zip
## SRR19782039.fastq
## SRR19782039_fastqc.html
## SRR19782039_fastqc.zip
## SRR7725722_1.fastq
## SRR7725722_1_fastqc.html
## SRR7725722_1_fastqc.zip
## SRR7725722_2.fastq
## SRR7725722_2_fastqc.html
## SRR7725722_2_fastqc.zip
#mkdir ../output
#mkdir ../output/kallisto_01

#OLD CODE

find /home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/*_1.fastq \
| xargs basename -s _1.fastq  | xargs -I{} /home/shared/kallisto/kallisto \
quant -i ../data/MGAL_cds.index \
-o ../output/kallisto_01/{} \
-t 4 \
/home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/{}_1.fastq \
/home/shared/8TB_HDD_02/cnmntgna/GitHub/chris-musselcon/output/ncbi/{}_2.fastq
LS0tCnRpdGxlOiAiUk5BLXNlcSIKYXV0aG9yOiBTdGV2ZW4gUm9iZXJ0cwpkYXRlOiAiYHIgZm9ybWF0KFN5cy50aW1lKCksICclZCAlQiwgJVknKWAiICAKb3V0cHV0OiAKICBodG1sX2RvY3VtZW50OgogICAgdGhlbWU6IHJlYWRhYmxlCiAgICBoaWdobGlnaHQ6IHplbmJ1cm4KICAgIHRvYzogdHJ1ZQogICAgdG9jX2Zsb2F0OiB0cnVlCiAgICBudW1iZXJfc2VjdGlvbnM6IHRydWUKICAgIGNvZGVfZm9sZGluZzogc2hvdwogICAgY29kZV9kb3dubG9hZDogdHJ1ZQotLS0KCmBgYHtyIHNldHVwLCBpbmNsdWRlPUZBTFNFfQpsaWJyYXJ5KGtuaXRyKQpsaWJyYXJ5KHRpZHl2ZXJzZSkKbGlicmFyeShrYWJsZUV4dHJhKQpsaWJyYXJ5KERFU2VxMikKbGlicmFyeShwaGVhdG1hcCkKbGlicmFyeShSQ29sb3JCcmV3ZXIpCmxpYnJhcnkoZGF0YS50YWJsZSkKbGlicmFyeShEVCkKbGlicmFyeShCaW9zdHJpbmdzKQprbml0cjo6b3B0c19jaHVuayRzZXQoCiAgZWNobyA9IFRSVUUsICAgICAgICAgIyBEaXNwbGF5IGNvZGUgY2h1bmtzCiAgZXZhbCA9IEZBTFNFLCAgICAgICAgICMgRXZhbHVhdGUgY29kZSBjaHVua3MKICB3YXJuaW5nID0gRkFMU0UsICAgICAjIEhpZGUgd2FybmluZ3MKICBtZXNzYWdlID0gRkFMU0UsICAgICAjIEhpZGUgbWVzc2FnZXMKICBmaWcud2lkdGggPSA2LCAgICAgICAjIFNldCBwbG90IHdpZHRoIGluIGluY2hlcwogIGZpZy5oZWlnaHQgPSA0LCAgICAgICMgU2V0IHBsb3QgaGVpZ2h0IGluIGluY2hlcwogIGZpZy5hbGlnbiA9ICJjZW50ZXIiICMgQWxpZ24gcGxvdHMgdG8gdGhlIGNlbnRlcgopCmBgYAoKCioqRGF0YSoqXApFeGlzdGluZyBSTkEtU2VxIGRhdGEgd2FzIHJldHJpZXZlZCBmcm9tIHRoZSBmb2xsb3dpbmcgY29tcGxldGUgUk5BU2VxIGRhdGEgYXZhaWxhYmxlIGluIHRoZSBbTkNCSSBkYXRhYmFzZV0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi8pOlwKLSBTUlIxOTc4MjAzOSBbRXhwb3N1cmUgdG8gVmFsc2FydGFuICYgQ2FyYmFtYXplcGluZV0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9zcmEvU1JYMTU4MjYyOTElNUJhY2NuJTVEKVwKLSBTUlIxNjc3MTg3MCBbRXhwb3N1cmUgdG8gYSBzeW50aGV0aWMgaG9ybW9uZSAxNyBhLUV0aGlueWxlc3RyYWRpb2wgKEVFMildKGh0dHBzOi8vd3d3Lm5jYmkubmxtLm5paC5nb3Yvc3JhL1NSWDEyOTcxNzkyJTVCYWNjbiU1RClcCi0gU1JSNzcyNTcyMiBbRGlhcnJoZXRpYyBTaGVsbGZpc2ggUG9pc29uaW5nIChEU1ApIHRveGlucyBhc3NvY2lhdGVkIHdpdGggSGFybWZ1bCBBbGdhbCBCbG9vbXMgKEhBQnMpXShodHRwczovL3d3dy5uY2JpLm5sbS5uaWguZ292L3NyYS9TUlg0NTgyMjA0JTVCYWNjbiU1RClcCi0gU1JSMTMwMTM3NTYgW0h5cG94aWFdKGh0dHBzOi8vd3d3Lm5jYmkubmxtLm5paC5nb3Yvc3JhL1NSWDk0NjQ3NjYlNUJhY2NuJTVEKVwKLSAqTXl0aWx1cyBnYWxsb3Byb3ZpbmNpYWxpcyogW1JlZmVyZW5jZSBHZW5vbWVdKGh0dHBzOi8vd3d3Lm5jYmkubmxtLm5paC5nb3YvZGF0YS1odWIvZ2Vub21lL0dDQV85MDA2MTg4MDUuMS8pCgpNb3ZpbmcgQ0RTIGZpbGUgZnJvbSBvdGhlciBwbGFjZSBvbiBSYXZlbgoKYGBge2Jhc2h9CmNwIC9ob21lL3NoYXJlZC84VEJfSEREXzAxL3NyMzIwL25jYmkvbmNiaV9kYXRhc2V0L2RhdGEvR0NBXzkwMDYxODgwNS4xL2Nkc19mcm9tX2dlbm9taWMuZm5hIC4uL2RhdGEvY2RzX2Zyb21fZ2Vub21pYy5mbmEgCmBgYAoKCgoKIyMjIFB1bGxpbmcgbXkgcmVmZXJlbmNlIGdlbm9tZSBmcm9tIHByZWZhYiBjb2RlIHByb3ZpZGVkIGJ5IE5DQkkKY3VybGVkIGdlbm9tZSBhbmQgY29waWVkIGp1c3QgY2RzIGZpbGUKCmBgYHtyLCBlbmdpbmU9J2Jhc2gnLCBldmFsPVRSVUV9CmhlYWQgLi4vZGF0YS9jZHNfZnJvbV9nZW5vbWljLmZuYQpgYGAKCgoKIyBDcmVhdGUgdGhlIGluZGV4IGZpbGUgdG8gYWxpZ24gbXkgc2hvcnQgcmVhZCBmaWxlcyB0byB0aGUgZ2VuZXMgZnJvbSB0aGUgTUdBTF8xMApgYGB7ciwgZW5naW5lPSdiYXNoJ30KL2hvbWUvc2hhcmVkL2thbGxpc3RvL2thbGxpc3RvIFwKaW5kZXggXAotaSAuLi9kYXRhL01HQUxfY2RzLmluZGV4IFwKLi4vZGF0YS9jZHNfZnJvbV9nZW5vbWljLmZuYQpgYGAKCgoKCiMgV2hlcmUgaXMgZGF0YQoKCiNSdW5ubmluZyBLYWxsaXN0bwoKYGBge3IsIGVuZ2luZT0nYmFzaCcsIGV2YWw9VFJVRX0KL2hvbWUvc2hhcmVkL2thbGxpc3RvL2thbGxpc3RvIHF1YW50CmBgYApgYGB7ciwgZW5naW5lPSdiYXNoJywgZXZhbD1UUlVFfQpscyAvaG9tZS9zaGFyZWQvOFRCX0hERF8wMi9jbm1udGduYS9HaXRIdWIvY2hyaXMtbXVzc2VsY29uL291dHB1dC9uY2JpLwpgYGAKCgpgYGB7ciwgZW5naW5lPSdiYXNoJ30KI21rZGlyIC4uL291dHB1dAojbWtkaXIgLi4vb3V0cHV0L2thbGxpc3RvXzAxCgojT0xEIENPREUKCmZpbmQgL2hvbWUvc2hhcmVkLzhUQl9IRERfMDIvY25tbnRnbmEvR2l0SHViL2NocmlzLW11c3NlbGNvbi9vdXRwdXQvbmNiaS8qXzEuZmFzdHEgXAp8IHhhcmdzIGJhc2VuYW1lIC1zIF8xLmZhc3RxICB8IHhhcmdzIC1Je30gL2hvbWUvc2hhcmVkL2thbGxpc3RvL2thbGxpc3RvIFwKcXVhbnQgLWkgLi4vZGF0YS9NR0FMX2Nkcy5pbmRleCBcCi1vIC4uL291dHB1dC9rYWxsaXN0b18wMS97fSBcCi10IDQgXAovaG9tZS9zaGFyZWQvOFRCX0hERF8wMi9jbm1udGduYS9HaXRIdWIvY2hyaXMtbXVzc2VsY29uL291dHB1dC9uY2JpL3t9XzEuZmFzdHEgXAovaG9tZS9zaGFyZWQvOFRCX0hERF8wMi9jbm1udGduYS9HaXRIdWIvY2hyaXMtbXVzc2VsY29uL291dHB1dC9uY2JpL3t9XzIuZmFzdHEKYGBgCg==