N

Nucleotide Transformer V2 50m Multi Species

Developed by InstaDeepAI
The Nucleotide Transformer is a set of foundational language models pre-trained on whole-genome DNA sequences, integrating genomic data from over 3,200 human genomes and 850 diverse species.
Downloads 18.72k
Release Time : 7/27/2023

Model Overview

This model is a transformer with 50 million parameters, designed to process DNA sequences and is more accurate than existing methods in molecular phenotype prediction.

Model Features

Multi-genome integration
Integrates genomic data from over 3,200 different human genomes and 850 genomes from diverse species
High-precision prediction
More accurate than existing methods in molecular phenotype prediction
Large-scale pre-training
Pre-trained on a dataset of 174 billion nucleotides (approximately 29 billion tokens)

Model Capabilities

DNA sequence embedding
Molecular phenotype prediction
DNA sequence feature extraction

Use Cases

Genomics research
Molecular phenotype prediction
Predict molecular phenotype features associated with DNA sequences
More accurate than existing methods
Genomic comparative analysis
Compare genomic sequence features across species
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase