S

Sciworld MPO

Developed by xwm
A reinforcement learning model fine-tuned based on Llama-3.1-8B-Instruct, utilizing meta plan optimization technology to enhance agent planning capabilities
Downloads 96
Release Time : 2/17/2025

Model Overview

This model provides high-level general guidance through meta-planning and continuously optimizes based on feedback from agent task execution, demonstrating excellent performance in ALFWorld and SciWorld benchmarks

Model Features

Meta Plan Optimization Technology
Utilizes MPO technology to enhance the planning capabilities of large language model agents
High-Performance Benchmarking
Achieves an average accuracy of 83.1% in ALFWorld and SciWorld benchmarks
Feedback-Driven Optimization
Continuously optimizes based on feedback from agent task execution

Model Capabilities

Agent Planning Optimization
Meta Plan Generation
Task Execution Feedback Analysis
Reinforcement Learning Decision Making

Use Cases

Agent Development
Virtual Assistant Planning Optimization
Enhances the planning capabilities of virtual assistants in complex tasks
Demonstrates excellent performance in ALFWorld benchmarks
Scientific Experiment Planning
Optimizes the planning process for scientific experiment steps
Achieves high accuracy in SciWorld benchmarks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase