MetaCerberus Options

Important

If the MetaCerberus environment is not used, make sure the dependencies are in PATH or specified in the config file.

Run metacerberus.py with the options required for your project.

Usage of `metacerberus.py`:

Note

The following are different options/arguments to modify the execution of MetaCerberus.

• Setup arguments:

Argument/Option	Function [Default]	Usage Format	Accepted format	Example (Type as one line)
`--setup`	Setup additional dependencies [False]	`--setup`	N/A	`metacerberus.py --setup`
`--update`	Update downloaded databases [False]	`--update`	N/A	`metacerberus.py --update`
`--list-db`	List available and downloaded databases [False]	`--list-db`	N/A	`metacerberus.py --list-db`
`--download`	Downloads selected HMMs. Use the option `--list-db` for a list of available databases, default is to download all available databases	`--download [DOWNLOAD ...]`	`--download [.HMM FILE]`	`--download path/to/example/directory.hmm`
`--uninstall`	Remove downloaded databases and FragGeneScan+ [False]	`--uninstall`	N/A	`metacerberus.py --uninstall`

• Input File Arguments:

Important

At least one sequence is required.

Accepted formats: [.fastq, .fq, .fasta, .fa, .fna, .ffn, .faa]

Example:

metacerberus.py --prodigal file1.fasta
metacerberus.py --config file.config

If a sequence is given in [.fastq, .fq] format, one of --nanopore, --illumina, or --pacbio is required.:

Option format interpretation:

--setup = accepts no additional options
--download DOWNLOAD = accepts one option, (represented by capitalized command 'DOWNLOAD')
--fraggenescan FRAGGENESCAN [FRAGGENESCAN...] = accepts one or greater options (represented by capitalized commands)

Argument/Option	Function	Usage Format	Accepted format	# Options Accepted	Example (Type as one line)
`-c` or `--config`	Path to config file, command line takes priority	`-c CONFIG` or `--config CONFIG`	Path to config file	1	`-c path/to/config/file`
`--prodigal`	Prokaryote nucleotide sequence (includes microbes, bacteriophage)	`--prodigal PRODIGAL [PRODIGAL ...]`	Sequence file	=>1	`--prodigal FILE1 FILE2...`
`--fraggenescan`	Eukaryote nucleotide sequence (includes other viruses, works all around for everything)	`--fraggenescan FRAGGENESCAN [FRAGGENESCAN ...]`	Sequence file	=>1	`--fraggenescan FILE1 FILE2...`
`--super`	Run sequence in both `--prodigal` and `--fraggenescan` modes	`--super SUPER [SUPER ...]`	Sequence file	=>1	`--super FILE1 FILE2...`
`--prodigalgv`	Giant virus nucleotide sequence	`--prodigalgv PRODIGALGV [PRODIGALGV ...]`	Sequence file	=>1	`--prodigalgv FILE1 FILE2...`
`--phanotate`	Phage sequence	`--phanotate PHANOTATE [PHANOTATE ...]`	Sequence file	=>1	`--phanotate FILE1 FILE2...`
`--protein` or `--amino`	Protein Amino Acid sequence	`--protein PROTEIN [PROTEIN ...]` or `--amino PROTEIN [PROTEIN ...]`	Sequence file	=>1	`--protein FILE1 FILE2...` or `--amino FILE1 FILE2...`
`--hmmer-tsv`	Annotations tsv file from HMMER (experimental)	`--hmmer-tsv HMMER_TSV [HMMER_TSV ...]`	Sequence file	=>1	`--hmmer-tsv FILE1 FILE2...`
`--class`	path to a tsv file which has class information for the samples. If this file is included, scripts will be included to run Pathview in R	`--class CLASS`	Path to TSV file	1	`--class TSV_FILE1
`--illumina`	Specifies that the given FASTQ files are from Illumina	`--illumina`	N/A	N/A	`metacerberus.py --illumina`
`--nanopore`	Specifies that the given FASTQ files are from Nanopore	`--nanopore`	N/A	N/A	`metacerberus.py --nanopore`
`--pacbio`	Specifies that the given FASTQ files are from PacBio	`--pacbio`	N/A	N/A	`metacerberus.py --pacbio`

• Output options:

Argument/Option	Function [DEFAULT]	Usage Format	Accepted format	# Options Accepted	Example (Type as one line)
`--dir-out`	path to output directory, defaults to "results-metacerberus" in current directory. [./results-metacerberus]	`--dir-out DIR_OUT`	output file path	1	`--dir-out path/to/output/file`
`--replace`	Flag to replace existing files. [False]	`--replace`	`metacerberus.py` option	N/A	`metacerberus.py --replace`
`--keep`	Flag to keep temporary files. [False]	`--keep`	`metacerberus.py` option	N/A	`metacerberus.py --keep`
`--tmpdir`	Temp directory for RAY (experimental) [system tmp dir]	`--tmpdir TMPDIR`	`metacerberus.py` option	1	`--tmpdir TEMPFILE1`

• Database options:

Argument/Option	Function [DEFAULT]	Usage Format	Accepted format	# Options Accepted	Example (Type as one line)
`--hmm`	A list of databases for HMMER. Use the option `--list-db` for a list of available databases [KOFam_all]	`--hmm HMM [HMM ...]`	`metacerberus.py` option	=>1	`metacerberus.py --hmm DATABASE1 DATABASE2...`
`--db-path`	Path to folder of databases [Default: under the library path of MetaCerberus]	`--db-path DB_PATH`	path to databases folder	1	`--db-path path/to/databases/folder`