N

Nucleotide Transformer 500m 1000g

Developed by InstaDeepAI
A 500-million-parameter DNA sequence analysis model pre-trained on 3,202 genetically diverse human genomes
Downloads 8,341
Release Time : 4/4/2023

Model Overview

A Transformer model specifically designed for genomics, integrating DNA sequence information from over 3,200 human genomes and 850 species, providing high-precision molecular phenotype prediction capabilities

Model Features

Multi-source Genome Integration
Integrates DNA sequence data from 3,202 diverse human genomes and 850 species
Large-scale Pre-training
Trained on 300 billion tokens, covering 1.9212 trillion nucleotides
Precision Prediction Capability
Provides more accurate molecular phenotype predictions compared to existing methods
Dual Framework Support
Offers both TensorFlow and PyTorch implementation versions

Model Capabilities

DNA sequence analysis
Molecular phenotype prediction
Genomic feature extraction
Masked nucleotide prediction

Use Cases

Genomic Research
Human Genome Variation Analysis
Utilizes the model to analyze genomic variation characteristics across different populations
Can identify 125 million mutation sites (including 111 million SNPs)
Cross-species Genome Comparison
Analyzes conserved regions in DNA sequences of 850 species
Biomedical
Disease-related Gene Prediction
Predicts disease-related gene loci based on DNA sequence features
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase