H

Hyenadna Small 32k Seqlen Hf

Developed by LongSafari
HyenaDNA is a long-range genomic foundation model pre-trained at single-nucleotide resolution with a context length of up to 1 million tokens.
Downloads 2,885
Release Time : 11/3/2023

Model Overview

HyenaDNA is a long-range genomic foundation model based on Hyena operators, capable of processing context lengths up to 1 million tokens at single-nucleotide resolution. It achieves more efficient genomic sequence modeling than traditional Transformers through sub-quadratic operations.

Model Features

Ultra-long context processing
Supports context lengths up to 1 million tokens, 500 times longer than traditional Transformer models
Single-nucleotide resolution
Uses single-character tokenization for precise modeling at the single-nucleotide level
Efficient training
160x faster training speed than Flash Attention at 1M sequence length
Global receptive field
Implicit long convolutions give each layer a global receptive field

Model Capabilities

Long-sequence genomic modeling
Regulatory element prediction
Chromatin profile analysis
Species classification
In-context learning
Instruction fine-tuning

Use Cases

Genomic research
Regulatory element prediction
Predicting the location of regulatory elements in the genome
Set new SotA on 23 downstream tasks
Species classification
Species classification based on genomic sequences
Biomedical research
Chromatin profile analysis
Analyzing chromatin structural features
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase