đ Lucy: Edgerunning Agentic Web Search on Mobile with a 1.7B model.
Lucy is a 1.7B model designed for agentic web search and lightweight browsing. Built on Qwen3-1.7B, it combines the research capabilities of larger models and can run efficiently on mobile devices, even with only a CPU.

Authors: Alan Dao, Bach Vu Dinh, Alex Nguyen, Norapat Buppodom

đ Quick Start
Lucy is a compact yet powerful 1.7B model that specializes in agentic web search and lightweight browsing. Based on Qwen3-1.7B, it inherits the in - depth research capabilities of larger models and is optimized to run efficiently on mobile devices, even in CPU - only configurations.
⨠Features
- đ Strong Agentic Search: Empowered by MCP - enabled tools (e.g., Serper with Google Search).
- đ Basic Browsing Capabilities: Via Crawl4AI (MCP server to be released), Serper, etc.
- đą Mobile - Optimized: Lightweight enough to run on CPU or mobile devices at a decent speed.
- đ¯ Focused Reasoning: Machine - generated task vectors optimize thinking processes for search tasks.
đ Documentation
Overview
Lucy is a compact but capable 1.7B model focused on agentic web search and lightweight browsing. Built on Qwen3-1.7B, Lucy inherits deep research capabilities from larger models while being optimized to run efficiently on mobile devices, even with CPU - only configurations. We achieved this through machine - generated task vectors that optimize thinking processes, smooth reward functions across multiple categories, and pure reinforcement learning without any supervised fine - tuning.
Evaluation
Following the same MCP benchmark methodology used for [Jan - Nano](https://huggingface.co/Menlo/Jan - nano) and [Jan - Nano - 128k](https://huggingface.co/Menlo/Jan - nano - 128k), Lucy demonstrates impressive performance despite being only a 1.7B model, achieving higher accuracy than DeepSeek - v3 on [SimpleQA](https://openai.com/index/introducing - simpleqa/).

đĻ Installation
Deployment
Lucy can be deployed using various methods including vLLM, llama.cpp, or through local applications like Jan, LMStudio, and other compatible inference engines. The model supports integration with search APIs and web browsing tools through the MCP.
Deploy using VLLM
vllm serve Menlo/Lucy-128k \
--host 0.0.0.0 \
--port 1234 \
--enable-auto-tool-choice \
--tool-call-parser hermes
Or llama-server
from llama.cpp
llama-server ...
Recommended Sampling Parameters
Temperature: 0.7
Top-p: 0.9
Top-k: 20
Min-p: 0.0
đ¤ Community & Support
đ License
This project is licensed under the Apache 2.0 license.
đ Citation
Paper (coming soon): Lucy: edgerunning agentic web search on mobile with machine generated task vectors.
đ Information Table
Property |
Details |
Model Type |
Compact 1.7B model for agentic web search and lightweight browsing |
Base Model |
Qwen/Qwen3 - 1.7B |
Pipeline Tag |
text - generation |
Library Name |
transformers |
License |
Apache 2.0 |