R

Rootsignals Judge Llama 70B

Developed by root-signals
Root Judge is a powerful medium-sized large language model designed for reliable and customizable LLM system evaluation. Fine-tuned based on Llama-3.3-70B-Instruct, it excels in pairwise preference judgment and multi-round instruction following tasks with source references.
Downloads 620
Release Time : 2/5/2025

Model Overview

Root Judge is a medium-sized model focused on large language model evaluation, performing excellently in hallucination detection and instruction following, and supporting local deployment and low-cost applications.

Model Features

High-performance hallucination detection
Detect context-related hallucinations in RAG settings, outperforming leading closed-source models
Powerful instruction following ability
Performs excellently in various benchmark tests and supports complex user-defined scoring criteria
Low-cost and efficient deployment
FP8 weights are provided for free, suitable for research and commercial applications, with costs only a fraction of similar models
Long context support
Can handle long inputs up to 32k tokens and provide detailed structured justifications
Local deployment support
Suitable for privacy-sensitive scenarios and supports running in a local environment

Model Capabilities

Large language model evaluation
Hallucination detection
Instruction following evaluation
Preference judgment
Structured output generation
Long context processing

Use Cases

Model evaluation
RAG system hallucination detection
Detect context-related hallucinations in retrieval-augmented generation systems
Achieved an 86.3% pass rate on the HaluBench test set
Instruction following evaluation
Evaluate the model's ability to follow complex instructions
Performed excellently in benchmark tests such as IFEval
Content moderation
Political content recognition
Identify politically relevant content and terms in text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase