Running AltAnalyze from Command Line Interface

In addition to the graphical user interface (GUI), AltAnalyze can be easily run by command-line. This includes jobs run locallly, on a remote Linux server or cluster. This works fine given that the user knows the file paths of the directories containing input files, the output directory and has already created files containing the groups and comparisons for all samples analyzed.

Creating Groups and Comparison Files - Creating groups and comparison files is needed beforehand, but is fairly easy. Just follow the directions listed here. This can be done in an automated fashion as well, if input files have a defined naming structure.

Running Command Line from Source or Compiled Versions - The command-line can be run from the source code or OS-specific binaries. The binaries are recommended since these already contain graphical, statistical and webservice dependencies that need to be separately installed for the source code (see more information here).

When running with OS-specific binaries of AltAnalyze directly call the binary files themselves:

  • Windows OS AltAnalyze.exe
  • Mac OS X AltAnalyze.app/Contents/MacOS/AltAnalyze
  • PiPy (pip) installed altanalyze or AltAnalyze
  • Python source code python AltAnalyze.py

Example Options

Downloading and installing a species specific database (mouse)

python AltAnalyze.py --species Mm --update Official --version EnsMart72
  --additional all

Analyzing RNA-Seq files – FASTQ file directory using ICGS Population Discovery

python AltAnalyze.py --runICGS yes --platform "RNASeq" --species Mm 
  --column_method hopach --rho 0.2 --ExpressionCutoff 1 --FoldDiff 4 
  --SamplesDiffering 4 --excludeCellCycle conservative --output "C:/FASTQ_Files/" 
  --expname "Mm_HSCs" --fastq_dir "C:/FASTQ_Files/" --runKallisto yes

Analyzing RNA-Seq files – BAM files using default options and GO-Elite

python AltAnalyze.py --species Hs --platform RNASeq --bedDir "C:/BAMFiles"
  --groupdir "C:/BAMFiles/groups.YourExperiment.txt"
  --compdir "C:/BAMFiles/comps.YourExperiment.txt" --output "C:/BAMFiles"
  --expname "YourExperiment --runGOElite yes" --returnPathways all

Analyzing CEL files – Affymetrix 3’ array using default options and GO-Elite

python AltAnalyze.py --species Mm --platform "3'array" --celdir "C:/CELFiles"
  --groupdir "C:/CELFiles/groups.YourExperiment.txt"
  --compdir "C:/CELFiles/comps.YourExperiment.txt" --output "C:/CELFiles"
  --expname "YourExperiment" --runGOElite yes --returnPathways all

Analyzing RNA-Seq files – TPM text file using ICGS Population Discovery

python AltAnalyze.py --platform RNASeq --species Mm --column_method hopach
  --ExpressionCutoff 1 --FoldDiff 4 --SamplesDiffering 4 --restrictBy protein_coding
  --excludeCellCycle conservative --removeOutliers no --row_method hopach
  --expdir tests/demo_data/Fluidigim_TPM/input/BoneMarrow-scRNASeq.txt
  --output tests/demo_data/Fluidigim_TPM/output/ --rho 0.2
  --runICGS yes --expname BoneMarrow-scRNASeq --column_metric cosine

Analyzing RNA-Seq files – BAM file directory using ICGS Population Discovery

python AltAnalyze.py --platform RNASeq --species Hs --column_method hopach 
  --column_metric cosine --rho 0.2 --removeOutliers no --row_method hopach
  --SamplesDiffering 3 --restrictBy protein_coding --excludeCellCycle no --FoldDiff 4
  --bedDir tests/demo_data/BAM/input/ --expname cancer --ExpressionCutoff 1 
  --output /tests/demo_data/BAM/input/ --runICGS yes

Analyzing RNA-Seq files – 10X Genomics Sparse Matrix file using ICGS Population Discovery

python AltAnalyze.py --platform RNASeq --species Hs --column_method hopach 
  --column_metric cosine --rho 0.2 --removeOutliers no --row_method hopach
  --SamplesDiffering 3 --restrictBy protein_coding --excludeCellCycle no --FoldDiff 4
  --ChromiumSparseMatrix tests/demo_data/10X/input/hg19/matrix.mtx --expname cancer 
  --ExpressionCutoff 1 --output /tests/demo_data/FASTQ/output/ --runICGS yes

Details

Many more additional example workflow analysis options and detailed option descriptions for various AltAnalyze functions are provided in the below links.

Full AltAnalyze Workflows

Pathway Enrichment Analysis and Visualization

Clustering, QC, and Alternative Exons Visualization

File comparison, ID translation and visualization

LineageProfiler and Sample Classification