C

C2S Scale Pythia 1b Pt

Developed by vandijklab
A model pre-trained on the Pythia-1b architecture and fine-tuned on single-cell RNA sequencing data via the Cell2Sentence framework, suitable for various single-cell and multicellular analysis tasks.
Downloads 156
Release Time : 4/16/2025

Model Overview

This model employs the Cell2Sentence method to transform scRNA-seq data into ordered sequences of gene names, enabling large language models to adapt to single-cell biology research and perform a wide range of single-cell and multicellular tasks.

Model Features

Cell2Sentence Framework
Transforms scRNA-seq data into ordered sequences of gene names based on expression levels, enabling LLMs to adapt to single-cell biology research.
Large-scale Training Data
Training data covers over 800 single-cell RNA sequencing datasets, encompassing more than 57 million human and mouse cells.
Multitasking Capability
Capable of performing various tasks, including single-cell and multicellular analysis, gene set analysis, and more.
Extended Context Length
Extends the default Pythia model's context length to 8192 tokens via rotary position embedding technology.

Model Capabilities

Unconditional single-cell generation
Cell type prediction
Cell type conditional generation
Unconditional multicellular generation
Tissue origin prediction
Tissue conditional multicellular generation
Cell type conditional multicellular generation
Multicellular to summary generation
Summary to multicellular generation
Gene set name to gene generation
Gene to gene set name generation

Use Cases

Biomedical Research
Single-cell RNA Sequencing Data Analysis
Analyze single-cell RNA sequencing data to predict cell types or states.
Cell Generation
Generate simulated single-cell or multicellular data based on specific conditions.
Gene Set Analysis
Generate gene lists from gene set names or vice versa.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase