H

Hyenadna Large 1m Seqlen Hf

Developed by LongSafari
HyenaDNA is a long-range genomic foundation model with a pre-training context length of up to 1 million tokens and single-nucleotide resolution.
Downloads 775
Release Time : 11/3/2023

Model Overview

HyenaDNA is a foundation model focused on genomics, capable of processing sequences up to 1 million tokens in length and achieving single-nucleotide-level analysis.

Model Features

Long-range context processing
Supports processing sequences up to 1 million tokens in length, 500 times longer than traditional Transformer models.
Single-nucleotide resolution
Achieves single-nucleotide-level analysis precision using a single-character tokenizer.
Efficient training
Training speed is 160 times faster than Flash Attention at 1 million sequence length.
Global receptive field
Implicit long convolutions give each layer a global receptive field.

Model Capabilities

Genomic sequence analysis
Sequence classification
Long sequence processing
Single-nucleotide resolution analysis

Use Cases

Genomics research
Regulatory element prediction
Predicts the location and function of regulatory elements in the genome.
Established new SotA on 23 downstream tasks.
Chromatin profiling
Analyzes chromatin structure and function.
Species classification
Classifies species based on genomic sequences.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase