B

Bart Base Cantonese

Developed by Ayaka
This is a Cantonese model based on the base version of BART, obtained through second-phase pre-training on the LIHKG dataset.
Downloads 42
Release Time : 10/25/2022

Model Overview

This model is primarily used for Cantonese masked language modeling tasks and can generate coherent Cantonese sentences.

Model Features

Cantonese support
Specially trained for Cantonese, capable of understanding and generating authentic Cantonese text.
Based on BART architecture
Adopts the BART base architecture, featuring powerful sequence-to-sequence modeling capabilities.
Second-phase pre-training
Second-phase pre-training on the LIHKG dataset enhances the model's understanding of Cantonese.

Model Capabilities

Text generation
Masked language modeling
Cantonese text processing

Use Cases

Text processing
Cantonese sentence completion
Automatically completes incomplete Cantonese sentences
Example: Input '聽日就要返香港,我激動到[MASK]唔着', output '聽日就要返香港,我激動到瞓唔着'
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase