Lucy Open-Source Model - A lightweight and powerful tool that supports independent web search and lightweight browsing

Lucy

Developed by Menlo

Lucy is a lightweight but powerful model with 1.7 billion parameters, focusing on autonomous web search and lightweight browsing.

Large Language Model

Transformers

EnglishOpen Source License:Apache-2.0 #Mobile web search #Lightweight browsing #CPU optimization

Downloads 138

Release Time : 7/18/2025

Model Overview

A 1.7 - billion - parameter model built on Qwen3 - 1.7B, optimized for efficient operation on mobile devices, supporting autonomous web search and basic browsing functions.

Model Features

Autonomous search ability

Supported by tools that support MCP (such as Serper integrated with Google Search)

Basic browsing function

Web browsing is achieved through Crawl4AI, Serper, etc.

Optimized for mobile devices

The model is lightweight and can run efficiently on CPUs or mobile devices

Focus on reasoning

The machine - generated task vector optimizes the thinking process of search tasks

Model Capabilities

Autonomous web search

Lightweight web browsing

Text generation

Inference on mobile devices

Use Cases

Information retrieval

Autonomous network search

Users can autonomously conduct network searches through the model to obtain information

The accuracy on SimpleQA is higher than that of DeepSeek - v3

Mobile applications

Search assistant on mobile devices

Provides efficient search and browsing functions on mobile devices

Can run at a good speed on CPUs or mobile devices

🚀 Lucy: Edgerunning Agentic Web Search on Mobile with a 1.7B model.

Lucy is a 1.7B model designed for agentic web search and lightweight browsing. Built on Qwen3-1.7B, it combines the research capabilities of larger models and can run efficiently on mobile devices, even with only a CPU.

Authors: Alan Dao, Bach Vu Dinh, Alex Nguyen, Norapat Buppodom

image/gif

🚀 Quick Start

Lucy is a compact yet powerful 1.7B model that specializes in agentic web search and lightweight browsing. Based on Qwen3-1.7B, it inherits the in - depth research capabilities of larger models and is optimized to run efficiently on mobile devices, even in CPU - only configurations.

✨ Features

🔍 Strong Agentic Search: Empowered by MCP - enabled tools (e.g., Serper with Google Search).
🌐 Basic Browsing Capabilities: Via Crawl4AI (MCP server to be released), Serper, etc.
📱 Mobile - Optimized: Lightweight enough to run on CPU or mobile devices at a decent speed.
🎯 Focused Reasoning: Machine - generated task vectors optimize thinking processes for search tasks.

📚 Documentation

Overview

Lucy is a compact but capable 1.7B model focused on agentic web search and lightweight browsing. Built on Qwen3-1.7B, Lucy inherits deep research capabilities from larger models while being optimized to run efficiently on mobile devices, even with CPU - only configurations. We achieved this through machine - generated task vectors that optimize thinking processes, smooth reward functions across multiple categories, and pure reinforcement learning without any supervised fine - tuning.

Evaluation

Following the same MCP benchmark methodology used for [Jan - Nano](https://huggingface.co/Menlo/Jan - nano) and [Jan - Nano - 128k](https://huggingface.co/Menlo/Jan - nano - 128k), Lucy demonstrates impressive performance despite being only a 1.7B model, achieving higher accuracy than DeepSeek - v3 on [SimpleQA](https://openai.com/index/introducing - simpleqa/).

![image/png](https://cdn - uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/lG2FqLCWXq1N8lh7wlJgW.png)

📦 Installation

Deployment

Lucy can be deployed using various methods including vLLM, llama.cpp, or through local applications like Jan, LMStudio, and other compatible inference engines. The model supports integration with search APIs and web browsing tools through the MCP.

Deploy using VLLM

vllm serve Menlo/Lucy-128k \
    --host 0.0.0.0 \
    --port 1234 \
    --enable-auto-tool-choice \
    --tool-call-parser hermes

Or `llama-server` from `llama.cpp`

llama-server ...

Recommended Sampling Parameters

Temperature: 0.7
Top-p: 0.9
Top-k: 20
Min-p: 0.0

🤝 Community & Support

Discussions: HuggingFace Community

📄 License

This project is licensed under the Apache 2.0 license.

📄 Citation

Paper (coming soon): Lucy: edgerunning agentic web search on mobile with machine generated task vectors.

📋 Information Table

Property	Details
Model Type	Compact 1.7B model for agentic web search and lightweight browsing
Base Model	Qwen/Qwen3 - 1.7B
Pipeline Tag	text - generation
Library Name	transformers
License	Apache 2.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご