Skip to main content

Usage

StrainID compares a database of VCF files against an aligned BAM file to check for the presence of SNPs in order to determine a likely cell line/strain used in the experiment.

Figure1C

bash identify-Strain.sh -i /path/to/BAM -g /path/to/genome/fASTA -v /path/to/VCF/files -o /path/to/output [ -s intSeed (default=None) ]
eg: bash identify-Strain.sh -i /input -g /sacCer3.fa -v /sacCer3_VCF -o /output -s 5

Dependencies

The following dependencies are needed to run StrainID.

You can install these dependencies as a conda environment if you have conda setup on your machine using the following command:

conda create -n strainid-env -c bioconda -c conda-forge scipy pysam wget samtools

...and then turn on the environment with:

conda activate strainid-env