P

Prostt5

Developed by Rostlab
ProstT5 is a protein language model capable of translating between protein sequences and structures.
Downloads 252.91k
Release Time : 7/21/2023

Model Overview

ProstT5 (Protein Structure Sequence T5) is based on ProtT5-XL-U50 and achieves bidirectional translation between protein sequences and 3D structures through fine-tuning. It supports predicting 3D structures from amino acid sequences (folding) and generating amino acid sequences from 3D structures (inverse folding).

Model Features

Bidirectional translation ability
Supports bidirectional translation between protein sequences (AA) and structures (3Di), including folding (AA→3Di) and inverse folding (3Di→AA)
Fine-tuned based on ProtT5-XL-U50
Fine-tuned on 17 million high-quality 3D structure-predicted proteins, inheriting the powerful representation ability of ProtT5-XL-U50
Structural feature extraction
Capable of extracting features from 3D structures represented by 3Di tokens, expanding the functions of traditional protein language models

Model Capabilities

Protein sequence-to-structure translation
Protein structure-to-sequence translation
Protein sequence feature extraction
Protein structure feature extraction

Use Cases

Bioinformatics
Remote homology detection
Combined with Foldseek through the predicted 3Di strings, remote homology detection can be performed without explicitly calculating 3D structures.
Protein design
Generate possible amino acid sequences from 3D structures through inverse folding to assist protein design.
Computational biology
Protein structure prediction
Predict a simplified representation (3Di tokens) of 3D structures from amino acid sequences.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase