Skip to main content

Synopsis

hhmake -i <file> -o <file> [options]

Description

HHmake builds an HMM from an input alignment in A2M, A3M, or FASTA format, or converts between HMMER format (.hmm) and HHsearch format (.hhm).

Required Parameters

-i
file
required
Query alignment (A2M, A3M, or FASTA), or query HMM

Output Options

-o
file
HMM file to be written to (default: <infile.hhm>)
-a
file
HMM file to be appended to
-v
integer
default:"2"
Verbose mode:
  • 0: no screen output
  • 1: only warnings
  • 2: verbose
-seq
integer
default:"10"
Max. number of query/template sequences displayed. Beware of overflows! All these sequences are stored in memory.
-add_cons
flag
Make consensus sequence master sequence of query MSA
-name
string
Use this name for HMM (default: use name of first sequence)

Filter Query Multiple Sequence Alignment

-id
integer
default:"90"
Maximum pairwise sequence identity (%) (range: 0-100)
-diff
integer
default:"100"
Filter MSA by selecting most diverse set of sequences, keeping at least this many seqs in each MSA block of length 50
-cov
integer
default:"0"
Minimum coverage with query (%) (range: 0-100)
-qid
integer
default:"0"
Minimum sequence identity with query (%) (range: 0-100)
-qsc
float
default:"-20.0"
Minimum score per column with query
-neff
float
Target diversity of alignment (default: off, range: 1-inf)

Input Alignment Format

-M
string
default:"a2m"
Input alignment format:
  • a2m or a3m: upper case = Match; lower case = Insert; ’-’ = Delete; ’.’ = gaps aligned to inserts
  • first: FASTA format where columns with residue in 1st sequence are match states
  • [0-100]: FASTA format where columns with fewer than X% gaps are match states

Pseudocount Options

Context-specific HHM pseudocounts

-pc_hhm_contxt_mode
integer
default:"0"
Position dependence of pc admixture ‘tau’ (pc mode):
  • 0: no pseudo counts: tau = 0
  • 1: constant: tau = a
  • 2: diversity-dependent: tau = a/(1+((Neff[i]-1)/b)^c)
  • 3: CSBlast admixture: tau = a(1+b)/(Neff[i]+b)
-pc_hhm_contxt_a
float
default:"0.9"
Overall pseudocount admixture (range: 0-1)
-pc_hhm_contxt_b
float
default:"4.0"
Neff threshold value for mode 2 (range: 1-inf)
-pc_hhm_contxt_c
float
default:"1.0"
Extinction exponent c for mode 2 (range: 0-3)

Context-independent HHM pseudocounts

-pc_hhm_nocontxt_mode
integer
default:"2"
Position dependence of pc admixture ‘tau’ (used for templates; used for query if contxt file is not available)
-pc_hhm_nocontxt_a
float
default:"1.0"
Overall pseudocount admixture (range: 0-1)
-pc_hhm_nocontxt_b
float
default:"1.5"
Neff threshold value for mode 2 (range: 1-inf)
-pc_hhm_nocontxt_c
float
default:"1.0"
Extinction exponent c for mode 2 (range: 0-3)

Context-specific pseudocounts

-nocontxt
flag
Use substitution-matrix instead of context-specific pseudocounts
-contxt
file
Context file for computing context-specific pseudocounts

Other Options

-maxseq
integer
default:"65535"
Max number of input rows
-maxres
integer
default:"20000"
Max number of HMM columns

Examples

Build HMM from alignment

hhmake -i test.a3m -o stdout

Build HMM with custom name

hhmake -i alignment.a3m -o model.hhm -name MyProtein

Exit Codes

  • 0: Success
  • 1: File format error
  • 2: File access error
  • 3: Memory error
  • 4: Command line error

See Also

Build docs developers (and LLMs) love