Electra Base Gc4 64k 500000 Cased Generator
A massive German language model trained on the cleaned German Common Crawl corpus (GC4), totaling approximately 844GB, which may contain biases.
Downloads 16
Release Time : 3/2/2022
Model Overview
This model is a large language model trained specifically for German, primarily intended for research purposes, especially in bias identification and prevention.
Model Features
Large-scale German Corpus Training
Trained on a cleaned German Common Crawl corpus (GC4) totaling 844GB.
Research-Oriented
Primarily aimed at advancing research on large-scale German pretrained language models, particularly in bias identification and prevention.
Contains Biases
Due to training data sourced from internet-crawled text, the model may encode stereotypical associations related to gender, race, ethnicity, and disability status.
Model Capabilities
German Text Generation
German Text Understanding
Use Cases
Research
Bias Identification Research
Used to identify and prevent bias issues in language models.
German Language Model Research
Advancing research on large-scale German pretrained language models.
Featured Recommended AI Models
Š 2025AIbase