Q

Qwen GLOCON Reasoning

Developed by shreyasmeher
A reinforcement learning model based on Qwen2.5-3B-Instruct, specifically designed for conflict event classification, optimized using the GRPO method for multi-reward signals and structured reasoning formats.
Downloads 51
Release Time : 2/18/2025

Model Overview

This model is a text classification model optimized via GRPO reinforcement learning, specifically designed to identify and classify social conflict events. It can analyze news reports, identify event triggers, participants, locations, and the nature of violence, and categorize them into one of five predefined classes.

Model Features

GRPO Reinforcement Learning Optimization
Uses the GRPO method to achieve synchronous optimization of multi-reward signals, enforcing structured reasoning formats through reinforcement signals.
Structured XML Output
Forces the model to adhere to a specific XML output format, including detailed reasoning processes and final classification results.
Multilingual Support
Supports conflict event classification in 13 languages.
Memory Optimization
Utilizes 4-bit quantization, gradient checkpointing, and vLLM acceleration for inference, with GPU memory usage capped at 60%.

Model Capabilities

Conflict Event Classification
Structured Reasoning
Multilingual Text Analysis
XML Format Output

Use Cases

Social Research
Civil Conflict Event Classification
Analyzes news reports to identify and classify social events such as protests and armed conflicts.
Accurately categorizes into one of five major event classes.
Academic Research
Transparent Decision Process Analysis
Provides classification results with reasoning processes, facilitating verification in academic research.
Classification results include detailed reasoning steps.
Education
RL Classification Teaching Demo
Serves as a demonstration case for reinforcement learning applications in text classification.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase