Batch Inference

batch_inference

Methods

Chat Completion -> { completion_message_batch }
post/alpha/batch-inference/chat-completion
Parameters
X-LlamaStack-Client-Version: string
Optional
X-LlamaStack-Provider-Data: string
Optional
Response fields
completion_message_batch: Array<{ content, role, stop_reason, 1 more... }>
Request example
200Example
Completion ->
post/alpha/batch-inference/completion