Meka Agent is an open-source, autonomous computer-using agent that delivers state-of-the-art browsing capabilities. The agent works and acts in the same way humans do, by purely using vision as its eyes and acting within a full computer context.
It is designed as a simple, extensible, and customizable framework, allowing flexibility in the choice of models, tools, and infrastructure providers.
The agent primarily focuses on web browsing today, and achieves state-of-the-art benchmark results in the WebArena Benchmark (72.7%).
If you would like to get started with browser automations without any setup, visit the Meka App to try the Meka Agent with $10 in free credits.
To get started, we choose OpenAI o3 as the model and Anchor Browser as the VM-based infrastructure provider. We are open to submissions by other infra providers with OS-level controls!
Meka is created from lessons learned from experimentation and publicly available research. Our fundamental philosophy in creating this agent is to think like how humans would, from vision to tools to memory.