The BayesPI-BAR2 package is a set of command line tools residing in the python folder. Run python <tool_name.py> --help command to see the full usage information for a particular tool. The purpose of each tool and the data involved in the processing are described below:
differential_expression.py: computes differentially expressed genes between two groups of samples, usually the patient samples and normal tissue samples. This implements a relatively simple and fast method based on Kolmogorov-Smirnov test comparing two sets of normalized RPKM values. For a more precise analysis, the baySeq R package could be used instead.gene_regions.py: extracts gene regions based on offsets from transcription start sites.intersect_vcf.py: computes intersection between VCF files.mussd.py: implements the Mutation filtering based on the Space and Sample Distribution (MuSSD) algorithm which finds DNA regions that are highly mutated in multiple patients (mutation blocks).bayespi_bar.pyhighly_mutated_blocks.py: a simple method to find which blocks appear to have higher than average mutation rate, used to discard blocks with few mutations.mussd.pybayespi_bar.py: implements the BayesPI-BAR [3] method to compute the protein-DNA binding affinity changes due to sequence variants.data/pwm folderchoose_background_parameters.py: selects appropriate parameters (sequence size and mutation count distribution) for the background model based on chosen foreground (i.e. patient) mutation blocks.mussd.py and bayespi_bar.py, optionally a mutation signature file or a set of background mutations to usebackground_affinity_changes.py with chosen parametersbackground_affinity_changes.py: selects random mutation blocks for the background model and computes the background model using bayespi_bar.py.bayespi_bar.py)affinity_change_significance_test.py: performs the Wilcoxon rank-sum testing, comparing the patient affinity changes in mutation blocks to the background model.bayespi_bar.py, for the background model and a mutation blockplot_result.py: produces heatmaps displaying predicted affinity changes across transcription factors and patient samples.affinity_change_significance_test.py and bayespi_bar.py