We developed PhiSpy several years ago to identify prophages, and during that process we used 41 genomes to validate our approach.
Here is the validation data so you can test other prophage finding tools.
At the moment we provide links to either genbank files or SEED format directories. We have not yet annotated the locations of the prophages, but we are working on it!
Genbank Files
You can download a single tarball with all the genbank files. This is 150M
SEED Directories
You can download each of the SEED directories as a single tarball. Each one varies between 2.5 and 17M.
- 71421.1.tar.gz
- 83331.1.tar.gz
- 83332.1.tar.gz
- 83333.1.tar.gz
- 83334.1.tar.gz
- 100226.1.tar.gz
- 122586.1.tar.gz
- 122587.1.tar.gz
- 155864.1.tar.gz
- 158878.1.tar.gz
- 160488.1.tar.gz
- 160490.1.tar.gz
- 160492.1.tar.gz
- 169963.1.tar.gz
- 183190.1.tar.gz
- 186103.1.tar.gz
- 187410.1.tar.gz
- 190486.1.tar.gz
- 190650.1.tar.gz
- 195102.1.tar.gz
- 196620.1.tar.gz
- 198214.1.tar.gz
- 198466.1.tar.gz
- 199310.1.tar.gz
- 206672.1.tar.gz
- 208435.1.tar.gz
- 208964.1.tar.gz
- 211586.1.tar.gz
- 212717.1.tar.gz
- 214092.1.tar.gz
- 220341.1.tar.gz
- 224308.1.tar.gz
- 224914.1.tar.gz
- 243230.1.tar.gz
- 243277.1.tar.gz
- 266835.1.tar.gz
- 267608.1.tar.gz
- 272558.1.tar.gz
- 272623.1.tar.gz
- 272626.1.tar.gz
- 272843.1.tar.gz