| Model | Active Parameters | Main Use Cases | License | Resources |
|---|---|---|---|---|
| Holo3.1-35B-A3B | 3B | Fast, low-latency computer use across web, desktop, and mobile | Apache 2.0 | Model card |
| Holo3-35B-A3B | 3B | High-throughput, low-latency | Apache 2.0 | Model card |
| Holo3-122B-A10B | 10B | Maximum performance, complex tasks | Research only | Benchmarks |
Two ways to use Holo
| Mode | Pattern | Output | When to use |
|---|---|---|---|
| Agent loop | Multi-turn: conversation + screenshots → next tool call | {note, thought, tool_call} or native tool_calls | Holo as the brain of an autonomous browser or desktop agent |
| Element localization | Single-turn: image + target description → coordinates | {x, y} in [0, 1000] | UI grounding inside any external agent or pipeline (yours or someone else’s) |
Get started
Get an API key
Generate a key on Portal-H and export it. The free tier gives rate-limited access to
holo3-1-35b-a3b, no credit card required.Install the OpenAI client
The Models API is OpenAI-compatible, so the official client works as-is, only the
base_url changes.Make your first request
Point the client at H by overriding The same API and code paths work for all models; swap
base_url, then send a request. Holo is multimodal: you can send text, images, or both. Here is a minimal text request to confirm your key and client are working.model for holo3-122b-a10b when you need maximum performance.That is the whole setup. To use Holo on real screens, send a screenshot and continue with the agent loop or element localization below.Next steps
Agent loop
How to use Holo in your computer-use harness.
Element localization
Get click coordinates from a screenshot.
API reference
Endpoint, models, parameters, and limits.