Faster Whisper Large V3 Zh TW
This model is a CTranslate2 format model converted from JacobLinCool/whisper-large-v3-turbo-common_voice_19_0-zh-TW, used to achieve efficient automatic speech recognition in the faster-whisper library.
Downloads 120
Release Time : 12/21/2024
Model Overview
This is an automatic speech recognition (ASR) model optimized for Traditional Chinese (zh-TW). By converting it to the CTranslate2 format, it can achieve faster inference speed in the faster-whisper library.
Model Features
Efficient Inference
Through CTranslate2 format conversion, achieve faster speech recognition speed in the faster-whisper library
Optimized for Traditional Chinese
Specifically optimized for Traditional Chinese speech recognition
Easy to Use
Provide a simple API interface, and speech recognition function can be implemented with just a few lines of code
Model Capabilities
Speech to Text
Automatic Speech Recognition
Support for audio file processing
Use Cases
Speech Transcription
Meeting Record Transcription
Automatically convert meeting recordings into text records
Media Content Subtitle Generation
Automatically generate Traditional Chinese subtitles for video or podcast content
Voice Assistant
Traditional Chinese Voice Command Recognition
Used for voice command recognition in Traditional Chinese voice assistant applications
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2
Š 2025AIbase