Methods
There are different methods and contexts in which to deploy Holo1, including:| Method | Pre-requisites | Notes |
|---|---|---|
| Local vLLM setup | Install vLLM / Machine with GPU | Uses the vLLM to download Holo1 from HuggingFace. |
| Local Docker Container | Install Docker / Machine with GPU | Uses the vllm/vllm-openai:v0.9.1 image. |
| Amazon SageMaker | Subscribe to Holo1 Models on AWS Marketplace | Deploys the Holo1 model via a prebuilt Notebook. No manual or complicated setup required. |
Environment setup
Set your environment variables using one of two methods, outlined below: Option 1: Create a.envat the root of this repo
OPENAI_API_KEY and other variables are correctly loaded by the bash scripts.
Option 2: Export in your shell profile (for global setup)Add these to your
.zshrc or .bashrc files:
vllm, you can leave the HAI_API_KEY empty (or set it to any value), and set HAI_MODEL_URL to http://localhost:PORT using the port where your local vllm instance is running..png)
