Methods
There are different methods and contexts in which to deploy Holo1, including:Method | Pre-requisites | Notes |
---|---|---|
Local vLLM setup | Install vLLM / Machine with GPU | Uses the vLLM to download Holo1 from HuggingFace. |
Local Docker Container | Install Docker / Machine with GPU | Uses the vllm/vllm-openai:v0.9.1 image. |
Amazon SageMaker | Subscribe to Holo1 Models on AWS Marketplace | Deploys the Holo1 model via a prebuilt Notebook. No manual or complicated setup required. |
Environment setup
Set your environment variables using one of two methods, outlined below: Option 1: Create a.env
at the root of this repo
OPENAI_API_KEY
and other variables are correctly loaded by the bash scripts.
Option 2: Export in your shell profile (for global setup)Add these to your
.zshrc
or .bashrc
files:
vllm
, you can leave the HAI_API_KEY
empty (or set it to any value), and set HAI_MODEL_URL
to http://localhost:PORT
using the port where your local vllm
instance is running.