---
title: "16-BLAST-pisaster"
output: html_document
date: "2025-03-12"
---
Rmd to run BLAST on _P. ochraceus_ genome to annotate it.
Will use the example code provided at [Roberts Lab - resources - bio-Annotation](https://robertslab.github.io/resources/bio-Annotation/)
Specifically: [https://robertslab.github.io/tusk/modules/04-blast.html](https://robertslab.github.io/tusk/modules/04-blast.html)
The genome fasta lives:
`../data/jordan-uni3279-mb-hirise-slc3v_06-07-2022_final_assembly.fasta`
```{r}
library(Biostrings)
library(reshape2)
```
# Create Database
```{r}
current_time <- format(Sys.time(), "%B %d, %Y %H:%M:%S")
cat("current date and time is ", current_time)
```
```{bash}
pwd
```
this has been done in 15-BLAST-pycno.Rmd
```{bash}
#cd ../data
#curl -O https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_sprot.fasta.gz
#mv uniprot_sprot.fasta.gz uniprot_sprot_r2023_04.fasta.gz
#gunzip -k uniprot_sprot_r2023_04.fasta.gz
```
this has also been done in 15-BLAST-pycno.Rmd
make the database
```{bash}
#mkdir ../blastdb
#/home/shared/ncbi-blast-2.11.0+/bin/makeblastdb \
#-in ../data/uniprot_sprot_r2023_04.fasta \
#-dbtype prot \
#-out ../blastdb/uniprot_sprot_r2023_01
```
# Query Fasta File
as noted above, it lives: `../data/jordan-uni3279-mb-hirise-slc3v__06-07-2022__final_assembly.fasta`
```{bash}
head -3 ../data/jordan-uni3279-mb-hirise-slc3v__06-07-2022__final_assembly.fasta
```
```{bash}
echo "How many sequences are there?"
grep -c ">" ../data/jordan-uni3279-mb-hirise-slc3v__06-07-2022__final_assembly.fasta
```
What? only 122?