Datasets & Models | FLAIR Lab

Model

A family of efficient protein language models pre-trained on large-scale sequence data.

Protein LM ESM-compatible

Model

Structure-aligned variants of AMPLIFY, enriched with protein structural knowledge via a lightweight post-training.

Protein LM ESM-compatible

Model

Coming soon.

Protein LM ESM-compatible

Dataset

A curated large-scale protein sequence dataset built from UniProt, SCOP, and OAS, used to pre-train the AMPLIFY family of models.

Proteins Sequences Pre-training

Dataset

Coming soon.

Proteins Sequences Pre-training

Code

A unified codebase for pre-training, fine-tuning, and evaluating the FLAIR Lab's protein language models.

Python PyTorch Hugging Face