xFinder-llama38it Open-source Model - Accurately and Efficiently Extract Key Answers from Large Language Models

Xfinder Llama38it

Developed by IAAR-Shanghai

xFinder-llama38it is a fine-tuned key answer extraction model based on Llama3-8B-Instruct, designed to improve the accuracy and robustness of key answer extraction from large language model outputs.

Large Language Model

Transformers

English#Key Answer Extraction #LLM Evaluation Enhancement #Multi-task Adaptation

Downloads 189

Release Time : 5/20/2024

Model Overview

This model is specifically designed to accurately extract key answers from the outputs of large language models (LLMs), addressing the limitations of traditional regex-based methods and suitable for diverse and complex output scenarios.

Model Features

High Accuracy

Significantly improves the accuracy of key answer extraction through fine-tuning, outperforming traditional methods.

Strong Robustness

Capable of handling diverse and complex LLM outputs, adaptable to various task scenarios.

High-Quality Training Data

Training data is meticulously annotated by GPT-4 and human experts to ensure high quality.

Model Capabilities

Key Answer Extraction

Text Generation

Model Evaluation Enhancement

Use Cases

Model Evaluation

Automated Evaluator

Used to extract key answers from LLM outputs, enhancing the reliability of model evaluation.

Significantly improves evaluation accuracy and robustness.

🚀 xFinder-llama38it

xFinder-llama38it is a specialized model for extracting key answers from large language models (LLMs), fine-tuned from Llama3-8B-Instruct.

🚀 Quick Start

This README provides a comprehensive overview of xFinder-llama38it, including its features, installation, usage, and more.

✨ Features

Key Answer Extraction: Specifically designed for extracting key answers from large language models (LLMs).
Enhanced Evaluation: Addresses the limitations of traditional regular expression (RegEx)-based extraction methods, improving the reliability of model assessments across various tasks.
Fine-tuning: Fine-tuned from Llama3-8B-Instruct, trained on approximately 26.9K samples from the Key Answer Finder (KAF) dataset.

📦 Installation

No installation steps are provided in the original document.

💻 Usage Examples

No code examples are provided in the original document.

📚 Documentation

📋 Model Details

xFinder-llama38it is a model specifically designed for key answer extraction in large language models (LLMs). It is trained by fine-tuning Llama3-8B-Instruct.

Developed by: IAAR
Fine-tuned from Model: Llama3-8B-Instruct

🌐 Model Sources

Repository: https://github.com/IAAR-Shanghai/xFinder
Paper: https://openreview.net/forum?id=7UqQJUKaLM

📖 Uses

xFinder is primarily used to enhance the evaluation of LLMs by accurately extracting key answers from their outputs. It addresses the limitations of traditional regular expression (RegEx)-based extraction methods, which often fail to handle the diverse and complex outputs generated by LLMs. xFinder improves the reliability of model assessments across various tasks.

📈 Training Details

xFinder-llama38it is fine-tuned from Llama3-8B-Instruct. The training data consists of approximately 26.9K samples from the Key Answer Finder (KAF) dataset. This dataset is designed to enhance the accuracy and robustness of key answer extraction and includes a variety of tasks. It has been meticulously annotated by GPT-4 and human experts to ensure high-quality training and evaluation. For more details, see this paper and try it with code.

🧪 Evaluation

xFinder is evaluated on the fully human-annotated test and generalization sets of the KAF dataset. The results demonstrate significant improvements in extraction accuracy and robustness compared to traditional methods. For more details, please refer to the paper and try it out using the provided code.

📄 License

The model is released under the cc-by-nc-nd-4.0 license.

📜 Citation

@inproceedings{
    xFinder,
    title={xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation},
    author={Qingchen Yu and Zifan Zheng and Shichao Song and Zhiyu li and Feiyu Xiong and Bo Tang and Ding Chen},
    booktitle={The Thirteenth International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=7UqQJUKaLM}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご