inference: Examples for sending inference requestsllm/chat_completions.py: Python example of sending chat completion requests to OpenAI compatible serverllm/completions.py: Python example of sending completion requests to OpenAI compatible serverllm/completions.sh: Bash example of sending completion requests to OpenAI compatible server, supports JSON modetext_embedding/embeddings.py: Python example of sending text embedding requests to OpenAI compatible servervlm/vision_completions.py: Python example of sending chat completion requests with image attached to prompt to OpenAI compatible server for vision language models
logits: Example for logits generationlogits.py: Python example of getting logits from hosted model.
api: Examples for using the Python APIbasic_usage.py: Basic Python example demonstrating the Vector Inference API
slurm_dependency: Example of launching a model withvec-infand running a downstream SLURM job that waits for the server to be ready before sending a request.
examples
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
parent directory.. | ||||