Medra4b-GGUF Open-Source Medical Language Model - Free Deployment, Supports English and Romanian Text Generation and Summarization

Medra4b GGUF

Developed by mradermacher

Medra4b is a quantized language model specialized in the medical field, supporting English and Romanian, with a focus on text generation and summarization tasks.

Large Language Model

Transformers

Supports Multiple LanguagesOpen Source License:Apache-2.0 #Medical Text Generation #Multilingual Medical AI #Dermatology Summarization

Downloads 152

Release Time : 5/25/2025

Model Overview

Medra4b is a language model tailored for the medical field, particularly suited for dermatology-related text generation and summarization tasks. Fine-tuned to support English and Romanian, it offers multiple quantized versions to accommodate various hardware requirements.

Model Features

Multilingual Support

Supports English and Romanian, ideal for multilingual medical text processing.

Multiple Quantized Versions

Offers various quantized versions from Q2_K to f16 to meet different hardware and performance needs.

Medical Domain Optimization

Fine-tuned specifically for the medical field (especially dermatology), making it suitable for professional medical text generation and summarization.

Model Capabilities

Text Generation

Medical Text Summarization

Multilingual Processing

Use Cases

Medical Text Processing

Dermatology Report Generation

Generates detailed medical reports based on input descriptions of dermatological symptoms.

Medical Literature Summarization

Automatically summarizes lengthy medical literature to extract key information.

🚀 Medra4b Quantized Model

This project provides static quantizations of the Medra4b model, offering various quantized versions for different use - cases.

🚀 Quick Start

If you are new to using this quantized model, the following sections will guide you through its basic information, usage, and available quantized versions.

✨ Features

Multi - language Support: Supports both English and Romanian.
Multiple Use - cases: Suitable for text generation, medical AI, summarization, dermatology, and more.
Fine - tuned: Based on the Gemma - 3 architecture and fine - tuned for better performance.

📦 Installation

No specific installation steps are provided in the original document.

📚 Documentation

📄 Model Information

Property	Details
Base Model	drwlf/Medra4b
Datasets	nicoboss/medra - medical
Language	English, Romanian
Library Name	transformers
License	apache - 2.0
Quantized By	mradermacher
Tags	text - generation, medical - ai, summarization, dermatology, gemma - 3, fine - tuned

🔍 About

These are static quants of https://huggingface.co/drwlf/Medra4b. Weighted/imatrix quants are available at [https://huggingface.co/mradermacher/Medra4b - i1 - GGUF](https://huggingface.co/mradermacher/Medra4b - i1 - GGUF).

💻 Usage Examples

If you are unsure how to use GGUF files, refer to one of [TheBloke's READMEs](https://huggingface.co/TheBloke/KafkaLM - 70B - German - V0.1 - GGUF) for more details, including on how to concatenate multi - part files.

📋 Provided Quants

(sorted by size, not necessarily quality. IQ - quants are often preferable over similar sized non - IQ quants)

Link	Type	Size/GB	Notes
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q2_K.gguf)	Q2_K	1.8
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q3_K_S.gguf)	Q3_K_S	2.0
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q3_K_M.gguf)	Q3_K_M	2.2	lower quality
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q3_K_L.gguf)	Q3_K_L	2.3
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.IQ4_XS.gguf)	IQ4_XS	2.4
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q4_K_S.gguf)	Q4_K_S	2.5	fast, recommended
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q4_K_M.gguf)	Q4_K_M	2.6	fast, recommended
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q5_K_S.gguf)	Q5_K_S	2.9
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q5_K_M.gguf)	Q5_K_M	2.9
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q6_K.gguf)	Q6_K	3.3	very good quality
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.Q8_0.gguf)	Q8_0	4.2	fast, best quality
[GGUF](https://huggingface.co/mradermacher/Medra4b - GGUF/resolve/main/Medra4b.f16.gguf)	f16	7.9	16 bpw, overkill

Here is a handy graph by ikawrakow comparing some lower - quality quant types (lower is better):

And here are Artefact2's thoughts on the matter: https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

❓ FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to questions you might have and/or if you want some other model quantized.

🙏 Thanks

I thank my company, nethype GmbH, for letting me use its servers and providing upgrades to my workstation to enable this work in my free time. Additional thanks to @nicoboss for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご