F

Flow Judge V0.1

Developed by flowaicom
Flow Judge v0.1 is a lightweight but powerful model with 3.8 billion parameters, which can perform customized evaluations on large language model (LLM) systems in multiple fields.
Downloads 6,094
Release Time : 9/15/2024

Model Overview

Flow Judge v0.1 is a lightweight evaluation model based on the Phi-3.5-mini instruction model architecture, focusing on customized evaluation of the performance of large language model systems.

Model Features

Customizable Evaluation
Users can define their own evaluation criteria and scoring rules, enabling Flow Judge to meet specific needs and achieve accurate evaluation of the performance of LLM systems.
Support for Multiple Scoring Systems
Supports three different scoring scales, including binary pass/fail scoring, 3-Likert scoring, and 5-Likert scoring, which can meet evaluation needs at different granularities.
Structured Evaluation Results
Generates structured evaluation results with <feedback> and <score> tags, including qualitative feedback and numerical scores.
Lightweight and High-performance
Despite its small model size, its performance can be comparable to that of larger models in the retained dataset and out-of-domain benchmark tests.

Model Capabilities

Large Language Model System Evaluation
Customized Scoring
Structured Feedback Generation
Multi-scale Scoring

Use Cases

Customer Service
Customer Complaint Handling Evaluation
Evaluate the quality of the AI system's responses to customer complaint emails
Provide detailed feedback and scores, pointing out the advantages and disadvantages in the responses
Content Generation
Generated Content Quality Evaluation
Evaluate the accuracy, relevance, and fluency of AI-generated content
Provide structured scores and feedback according to custom criteria
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase