Deberta Xxlarge Fixed
This is an updated version of deberta-v2-xxlarge that implements the AutoModelForCausalLM class, enabling text generation. It fixes three issues in the original implementation, including the output embedding weight names, enhanced mask decoder implementation, and position embedding truncation handling.
Downloads 203
Release Time : 4/22/2024
Model Overview
An improved version based on DeBERTa-v2-xxlarge, supporting text generation functionality and addressing several issues in the original implementation, suitable for natural language processing tasks.
Model Features
Fixed Original Implementation Issues
Corrected the wrong output embedding weight names in checkpoint files, fixed the implementation of the Enhanced Mask Decoder (EMD), and handled position embedding truncation to support long sequences.
Supports Text Generation
Implemented the AutoModelForCausalLM class, enabling the DeBERTa model, originally designed only for mask filling, to generate text.
Large Model Capacity
Based on the DeBERTa-v2-xxlarge architecture, it is a large-scale model with 1.5 billion parameters.
Model Capabilities
Text Generation
Mask Filling
Language Understanding
Machine Translation
Use Cases
Natural Language Processing
Text Completion
Generates complete sentences or paragraphs based on given partial text.
Machine Translation
Translates text between different languages, such as the example translation from German to English.
Can produce fluent and accurate translation results
Featured Recommended AI Models