Sciworld MPO
A reinforcement learning model fine-tuned based on Llama-3.1-8B-Instruct, utilizing meta plan optimization technology to enhance agent planning capabilities
Downloads 96
Release Time : 2/17/2025
Model Overview
This model provides high-level general guidance through meta-planning and continuously optimizes based on feedback from agent task execution, demonstrating excellent performance in ALFWorld and SciWorld benchmarks
Model Features
Meta Plan Optimization Technology
Utilizes MPO technology to enhance the planning capabilities of large language model agents
High-Performance Benchmarking
Achieves an average accuracy of 83.1% in ALFWorld and SciWorld benchmarks
Feedback-Driven Optimization
Continuously optimizes based on feedback from agent task execution
Model Capabilities
Agent Planning Optimization
Meta Plan Generation
Task Execution Feedback Analysis
Reinforcement Learning Decision Making
Use Cases
Agent Development
Virtual Assistant Planning Optimization
Enhances the planning capabilities of virtual assistants in complex tasks
Demonstrates excellent performance in ALFWorld benchmarks
Scientific Experiment Planning
Optimizes the planning process for scientific experiment steps
Achieves high accuracy in SciWorld benchmarks
Featured Recommended AI Models
Š 2025AIbase