D

Dna2vec

Developed by roychowdhuryresearch
DNA sequence embedding model based on Transformer architecture, supporting sequence alignment and genomics applications
Downloads 557
Release Time : 2/16/2025

Model Overview

DNA2Vec is an innovative DNA sequence embedding model that uses Transformer architecture to map DNA sequences into a shared vector space, enabling efficient similarity search and sequence alignment.

Model Features

Reference-Free Genome Embedding
Innovatively achieves DNA sequence embedding without requiring a reference genome
Contrastive Loss Training
Uses self-supervised contrastive loss to ensure robust sequence similarity learning
Dual Version Support
Provides both Hugging Face model and locally trainable versions
Efficient Vector Search
Transforms whole-genome alignment into a local search problem through DNA vector databases

Model Capabilities

DNA sequence vectorization
Sequence similarity calculation
Cross-species sequence alignment
Genomic variant detection

Use Cases

Genomics Research
Sequence Alignment
Efficiently aligns sequencing reads with reference genomes
High-quality read recall rate >99%
Cross-Species Analysis
Analyzes DNA sequence similarity across different species
Successfully aligned sequences from species like Thermus aquaticus and Rattus norvegicus
Bioinformatics Tools
Variant Detection
Identifies insertions and deletions in DNA sequences
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase