Fara-7B: An Efficient Agentic Small Language Model for Computer Use

Microsoft Research has released Fara-7B, a 7-billion-parameter agentic small language model (SLM) that interacts with computers via mouse/keyboard actions. It achieves 73.5% task success on WebVoyager, surpassing models like GPT-4o and UI-TARS-1.5-7B.

Why This Matters

Agentic models like Fara-7B operate in real-world environments, facing challenges that idealized benchmarks ignore. While Fara-7B excels in automated web tasks (e.g., booking tickets, price comparisons), it still struggles with complex instructions and hallucinations. A 2025 evaluation found it failed 62% of WebVoyager tasks without retries, highlighting the gap between lab performance and real-world reliability.

Key Insights

“38.4% success rate on WebTailBench (2025)”: Microsoft’s new benchmark for underrepresented tasks like job searches and real estate.
“Synthetic data pipeline for multi-step web tasks (2025)”: Trained on 145,000 trajectories from public websites.
“Magentic-UI integrated with Fara-7B”: Enables direct testing on Copilot+ PCs via Windows 11.

Practical Applications

Use Case: Microsoft’s Fara-7B automates web tasks like booking travel and managing accounts.
Pitfall: Overreliance on model predictions without user verification may lead to unintended actions (e.g., unauthorized email sends).

References:

On This Page

Fara-7B: An Efficient Agentic Small Language Model for Computer Use