Caduceus Ph Seqlen 131k D Model 256 N Layer 16
C
Caduceus Ph Seqlen 131k D Model 256 N Layer 16
Developed by kuleshov-group
Caduceus-Ph is a DNA sequence modeling model based on the MambaDNA architecture, with a hidden dimension of 256 and a 16-layer structure.
Downloads 5,455
Release Time : 2/26/2024
Model Overview
This model is specifically designed for DNA sequence modeling, pre-trained through masked language modeling tasks, and suitable for sequence analysis tasks in the field of bioinformatics.
Model Features
Long sequence processing capability
Supports sequence lengths of up to 131,072 base pairs
Bidirectional equivariant training
Pre-trained with reverse complement (RC) data augmentation to enhance model performance
Efficient inference
Utilizes the MambaDNA architecture, which is more efficient compared to traditional Transformers
Model Capabilities
DNA sequence modeling
Masked language modeling
Long sequence analysis
Use Cases
Genomics research
DNA sequence feature extraction
Extracts meaningful feature representations from long DNA sequences
Genome annotation
Assists in identifying and annotating functional regions of the genome
Featured Recommended AI Models
Š 2025AIbase