C

Caduceus Ps Seqlen 131k D Model 256 N Layer 16

Developed by kuleshov-group
Caduceus-PS is a DNA sequence modeling model with reverse-complement equivariance, designed for processing long sequences.
Downloads 2,618
Release Time : 2/29/2024

Model Overview

This model is used for masked language modeling of DNA sequences, capable of handling long sequences and supporting reverse-complement equivariance.

Model Features

Reverse-complement equivariance
The model has reverse-complement equivariance and does not require RC data augmentation during training.
Long sequence processing
Supports processing DNA sequences up to 131,072 base pairs in length.
Efficient training
Pre-trained on the human reference genome for 50k steps, with each step containing approximately 1 million base pairs/tokens.

Model Capabilities

DNA sequence modeling
Long sequence processing
Masked language modeling

Use Cases

Genome research
DNA sequence prediction
Used to predict masked portions in DNA sequences.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase