G

Galactica 125m

Developed by facebook
GALACTICA is a series of language models trained on large-scale scientific corpora, specializing in scientific task processing.
Downloads 193.82k
Release Time : 11/16/2022

Model Overview

The GALACTICA model is designed to perform scientific tasks, including citation prediction, scientific Q&A, mathematical reasoning, summarization, document generation, molecular property prediction, and entity extraction.

Model Features

Scientific specialization training
Trained on 106 billion tokens of open-source scientific texts and data, covering professional content such as papers, textbooks, and scientific websites.
Multimodal support
Supports processing scientific data formats such as SMILES molecular formulas and amino acid sequences.
Low-toxicity output
Exhibits significantly lower toxicity rates compared to other large language models.

Model Capabilities

Citation prediction
Scientific Q&A
Mathematical reasoning
Summarization
Document generation
Molecular property prediction
Entity extraction

Use Cases

Academic research
Literature citation prediction
Predicts potential literature citations for given text passages.
Larger models exhibit citation behaviors close to real-world patterns.
Scientific concept explanation
Generates explanatory notes for scientific terms and concepts.
Trained on high-quality academic corpora, explanations demonstrate professionalism.
Education
Mathematical problem-solving
Solves physics and mathematics problems.
Capable of handling complex problems involving formulas and calculations.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase