C2S Pythia 410m Cell Type Prediction
C
C2S Pythia 410m Cell Type Prediction
Developed by vandijklab
A cell type prediction model based on the Pythia-410m architecture, fine-tuned by converting scRNA-seq data into 'cell sentences' through the Cell2Sentence method, focusing on cell type prediction in single-cell RNA sequencing data.
Downloads 266
Release Time : 9/3/2024
Model Overview
This model leverages the natural language processing capabilities of large language models, transforming single-cell RNA sequencing data into sequences of gene names via the innovative Cell2Sentence method to achieve cell type prediction.
Model Features
Cell2Sentence Innovative Method
Converts scRNA-seq data into sequences of gene names sorted by expression levels ('cell sentences'), enabling LLMs to adapt to single-cell biology research.
Large-scale Training Data
Trained on over 57 million human and mouse cells from more than 800 single-cell RNA sequencing datasets, covering various tissues and cell types.
Cross-species Applicability
Training data includes human and mouse cells, potentially enabling cross-species cell type prediction.
Model Capabilities
Single-cell RNA sequencing data analysis
Cell type prediction
Gene expression pattern recognition
Use Cases
Biomedical Research
Cell Atlas Construction
Helps researchers quickly classify and annotate cell types in large-scale single-cell sequencing data
Accelerates the cell annotation process for projects like the Human Cell Atlas
Disease Research
Identifies abnormal cell types or states in disease samples
May discover new disease-related cell subpopulations
Featured Recommended AI Models