> ## Documentation Index
> Fetch the complete documentation index at: https://hub.hcompany.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# List models

> Programmatic discovery of the served models, their capabilities, pricing, and deprecation dates.

Lists the models currently served by the API, with capabilities, limits, pricing, and lifecycle metadata. Use it to discover model IDs at runtime and to detect upcoming removals via `deprecation_date` instead of hardcoding assumptions.

**Returns** a list object whose `data` array contains one object per model.

***

## Response

<ResponseField name="data[].id" type="string">
  The model ID to pass as `model` in [chat completions](/models-api/chat-completions), e.g. `holo3-1-35b-a3b`.
</ResponseField>

<ResponseField name="data[].context_length" type="integer">
  Total context window in tokens.
</ResponseField>

<ResponseField name="data[].max_output_length" type="integer">
  Hard ceiling on output tokens per request: 4,096 for `holo3-1-35b-a3b`, 32,768 for `holo3-122b-a10b`.
</ResponseField>

<ResponseField name="data[].input_modalities" type="array">
  `["text", "image"]` for both Holo models.
</ResponseField>

<ResponseField name="data[].supported_features" type="array">
  Capability flags. `reasoning` on both models; `tools` (native function calling) on `holo3-1-35b-a3b` only.
</ResponseField>

<ResponseField name="data[].supported_sampling_parameters" type="array">
  Accepted sampling fields, e.g. `temperature`, `top_p`, `top_k`, `max_tokens`, `stop`, `frequency_penalty`, `presence_penalty`, `seed`.
</ResponseField>

<ResponseField name="data[].pricing" type="object">
  Per-token USD rates as decimal strings: `prompt` and `completion` per input/output token.
</ResponseField>

<ResponseField name="data[].deprecation_date" type="string">
  Set when the model is scheduled for removal; `null` otherwise. After removal, requests to the ID fail with `model_not_found`.
</ResponseField>

<ResponseField name="data[].is_active" type="boolean">
  Whether the model is currently serving traffic (also see `is_ready`).
</ResponseField>

***

## Examples

<CodeGroup>
  ```python Python theme={null}
  models = client.models.list()
  for m in models.data:
      print(m.id)
  ```

  ```typescript TypeScript theme={null}
  const models = await client.models.list();
  for (const m of models.data) {
    console.log(m.id);
  }
  ```

  ```bash cURL theme={null}
  curl https://api.hcompany.ai/v1/models \
    -H "Authorization: Bearer $HAI_API_KEY"
  ```
</CodeGroup>

```json Response (truncated) theme={null}
{
  "object": "list",
  "data": [
    {
      "id": "holo3-1-35b-a3b",
      "object": "model",
      "name": "Holo3 1 35B A3B",
      "context_length": 65536,
      "max_output_length": 4096,
      "input_modalities": ["text", "image"],
      "supported_features": ["reasoning", "tools"],
      "supported_sampling_parameters": ["temperature", "top_p", "top_k", "max_tokens", "stop", "frequency_penalty", "presence_penalty", "seed"],
      "pricing": {"prompt": "0.00000025", "completion": "0.0000018"},
      "is_active": true,
      "deprecation_date": null
    }
  ]
}
```
