đ mT5-small-sum-de-mit-v1
This is a German summarization model based on the multilingual T5 model, offering a permissive open - source license for commercial use.
This is a German summarization model. It is based on the multilingual T5 model google/mt5-small. The special characteristic of this model is that, unlike many other models, it is licensed under a permissive open source license (MIT). Among other things, this license allows commercial use.
This model is provided by the One Conversation team of Deutsche Telekom AG.
đ Quick Start
This model is ready for German text summarization. You can start using it based on the multilingual T5 framework.
⨠Features
- German Summarization: Specifically designed for German text summarization.
- Permissive License: Licensed under the MIT license, allowing commercial use.
đĻ Installation
No specific installation steps are provided in the original document, so this section is skipped.
đģ Usage Examples
No code examples are provided in the original document, so this section is skipped.
đ Documentation
Training
The training was conducted with the following hyperparameters:
- Base Model: google/mt5-small
- Source Prefix:
"summarize: "
- Batch Size: 3 (6)
- Max Source Length: 800
- Max Target Length: 96
- Warmup Ratio: 0.3
- Number of Train Epochs: 10
- Gradient Accumulation Steps: 2
- Learning Rate: 5e - 5
Datasets and Preprocessing
The datasets were preprocessed as follows:
The summary was tokenized with the google/mt5-small tokenizer. Then only the records with no more than 94 summary tokens were selected.
This model is trained on the following dataset:
We have permission to use the Swisstext dataset and release the resulting summarization model under MIT license (see permission - declaration-swisstext.pdf).
Evaluation on MLSUM German Test Set (no beams)
đ License
Copyright (c) 2021 Philip May, Deutsche Telekom AG
Licensed under the MIT License (the "License"); you may not use this work except in compliance with the License. You may obtain a copy of the License by reviewing the file LICENSE in the repository.