D

Deberta Xxlarge Fixed

Developed by ltg
This is an updated version of deberta-v2-xxlarge that implements the AutoModelForCausalLM class, enabling text generation. It fixes three issues in the original implementation, including the output embedding weight names, enhanced mask decoder implementation, and position embedding truncation handling.
Downloads 203
Release Time : 4/22/2024

Model Overview

An improved version based on DeBERTa-v2-xxlarge, supporting text generation functionality and addressing several issues in the original implementation, suitable for natural language processing tasks.

Model Features

Fixed Original Implementation Issues
Corrected the wrong output embedding weight names in checkpoint files, fixed the implementation of the Enhanced Mask Decoder (EMD), and handled position embedding truncation to support long sequences.
Supports Text Generation
Implemented the AutoModelForCausalLM class, enabling the DeBERTa model, originally designed only for mask filling, to generate text.
Large Model Capacity
Based on the DeBERTa-v2-xxlarge architecture, it is a large-scale model with 1.5 billion parameters.

Model Capabilities

Text Generation
Mask Filling
Language Understanding
Machine Translation

Use Cases

Natural Language Processing
Text Completion
Generates complete sentences or paragraphs based on given partial text.
Machine Translation
Translates text between different languages, such as the example translation from German to English.
Can produce fluent and accurate translation results
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase