Tag Archives: BLAST FASTA compressed GZIP search sequence

How to run BLAST on a compressed FASTA file

FASTA files with sequences from next-generation sequencing projects can be large and are usually stored as compressed files using compression algorithms such as GZIP or ZIP. If you do not want to extract all the data or create another copy of the file when performing a BLAST search, you can combine the file extraction and the BLAST search using a pipe in Unix-based operating systems and tell the BLAST program to use the standard input. Here is one example on how to do that (assuming you have a GZIP compressed FASTA file): Continue reading