Description

Annotates carbohydrate-active enzyme (CAZyme) families from protein sequences using protein language model (ESM) embeddings and FAISS-based nearest-neighbour search. Performs three-level hierarchical classification: binary CAZyme detection (Level 0), CAZy class assignment (Level 1), and CAZy family assignment (Level 2).

Input

Name
Description
Pattern

0 ()

1 ()

0 ()

1 ()

2 ()

Output

Name
Description
Pattern

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

0 ()

Tools

caalm Documentation

CAALM (Carbohydrate Activity Annotation with protein Language Models) predicts CAZyme class and family membership from protein FASTA sequences using ESM-based embeddings and FAISS nearest-neighbour retrieval.

License: MIT