E

Electra Base Gc4 64k 500000 Cased Generator

Developed by stefan-it
A massive German language model trained on the cleaned German Common Crawl corpus (GC4), totaling approximately 844GB, which may contain biases.
Downloads 16
Release Time : 3/2/2022

Model Overview

This model is a large language model trained specifically for German, primarily intended for research purposes, especially in bias identification and prevention.

Model Features

Large-scale German Corpus Training
Trained on a cleaned German Common Crawl corpus (GC4) totaling 844GB.
Research-Oriented
Primarily aimed at advancing research on large-scale German pretrained language models, particularly in bias identification and prevention.
Contains Biases
Due to training data sourced from internet-crawled text, the model may encode stereotypical associations related to gender, race, ethnicity, and disability status.

Model Capabilities

German Text Generation
German Text Understanding

Use Cases

Research
Bias Identification Research
Used to identify and prevent bias issues in language models.
German Language Model Research
Advancing research on large-scale German pretrained language models.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase