TinyAgent-7B Open-Source Small-Sized Language Model - Available for Edge Devices, Supports Function Calls and Complex Reasoning

Tinyagent 7B

Developed by squeeze-ai-lab

TinyAgent is a small language model designed for edge devices, focusing on function calling and complex reasoning capabilities, capable of running securely and privately on resource-constrained devices.

Large Language Model

Transformers

English#Edge Device Function Calling #Low-resource SLM #Privacy-first

Downloads 63

Release Time : 5/27/2024

Model Overview

TinyAgent addresses privacy, connectivity, and latency issues when deploying traditional large language models on edge devices through high-quality data training and LLMCompiler-enabled function calling.

Model Features

Edge Device Optimization

Designed specifically for resource-constrained edge devices, capable of operating in low-power environments

Function Calling Capability

Utilizes LLMCompiler for efficient function calling, supporting complex task planning

ToolRAG Enhancement

Significantly improves task accuracy by retrieving the most relevant tools and examples through ToolRAG

Privacy Protection

Local deployment prevents data leakage, ensuring user privacy and security

Model Capabilities

Function calling

Task planning

Text generation

Tool usage

Multi-app integration

Use Cases

Office Automation

Email Composition

Automatically generates and sends emails

Meeting Scheduling

Manages calendar events and Zoom meetings

Personal Assistant

Contact Management

Automatically organizes and updates contact information

🚀 TinyAgent: Function Calling at the Edge

TinyAgent is designed to empower Small Language Models (SLMs) with complex reasoning and function - calling capabilities. These SLMs can be securely and privately deployed at the edge, addressing the limitations of traditional Large Language Models (LLMs) such as GPT - 4 and Gemini - 1.5, which are often too large and resource - intensive for edge deployment.

Get the desktop app‎ ‎ | Read the blog post

Thumbnail

✨ Features

Edge Deployment: Enables complex reasoning and function calling in SLMs for secure and private edge deployment.
Function Calling Focus: Trains SLMs with curated data and focuses on function calling using LLMCompiler.
MacOS Interaction: Can interact with various MacOS applications to assist users with daily tasks like email composition, contact management, calendar scheduling, and Zoom meeting organization.

Model Developers: Squeeze AI Lab at University of California, Berkeley.

Variations: TinyAgent models come in 2 sizes: TinyAgent - 1.1B and TinyAgent - 7B

License: MIT

🎥 Demo

📚 Documentation

💻 Usage Examples

Please see our Github for details on how to use TinyAgent models. TinyAgent models can be used programmatically or through our user interface.

🔧 Technical Details

Dataset

We curated a dataset of 40,000 real - life use cases. We use GPT - 3.5 - Turbo to generate real - world instructions. These are then used to obtain synthetic execution plans using GPT - 4 - Turbo. Please check out our blog post for more details on our dataset.

Fine - tuning Procedure

TinyAgent models are fine - tuned from base models. Below is a table of each TinyAgent model with its base counterpart:

Model	Success Rate
GPT - 3.5 - turbo	65.04%
GPT - 4 - turbo	79.08%
[TinyLLama - 1.1B - 32K - Instruct](https://huggingface.co/Doctor - Shotgun/TinyLlama - 1.1B - 32k - Instruct)	12.71%
[WizardLM - 2 - 7b](https://huggingface.co/MaziyarPanahi/WizardLM - 2 - 7B - GGUF)	41.25%
TinyAgent - 1.1B + ToolRAG / [[hf](https://huggingface.co/squeeze - ai - lab/TinyAgent - 1.1B)] [[gguf](https://huggingface.co/squeeze - ai - lab/TinyAgent - 1.1B - GGUF)]	80.06%
TinyAgent - 7B + ToolRAG / [[hf](https://huggingface.co/squeeze - ai - lab/TinyAgent - 7B)] [[gguf](https://huggingface.co/squeeze - ai - lab/TinyAgent - 7B - GGUF)]	84.95%

Using the synthetic data generation process described above, we use parameter - efficient fine - tuning with LoRA to fine - tune the base models for 3 epochs. Please check out our [blog post](https://bair.berkeley.edu/blog/2024/05/29/tiny - agent/) for more details on our fine - tuning procedure.

🛠️ ToolRAG

When faced with challenging tasks, SLM agents require appropriate tools and in - context examples to guide them. If the model sees irrelevant examples, it can hallucinate. Likewise, if the model sees the descriptions of the tools that it doesn’t need, it usually gets confused, and these tools take up unnecessary prompt space. To tackle this, TinyAgent uses ToolRAG to retrieve the best tools and examples suited for a given query. This process has minimal latency and increases the accuracy of TinyAgent substantially. Please take a look at our [blog post](https://bair.berkeley.edu/blog/2024/05/29/tiny - agent/) and our [ToolRAG model](https://huggingface.co/squeeze - ai - lab/TinyAgent - ToolRAG) for more details.

🔗 Links

Blog Post: https://bair.berkeley.edu/blog/2024/05/29/tiny - agent/
Github: https://github.com/SqueezeAILab/TinyAgent

📄 License

This project is licensed under the MIT license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご