Inference
inference
Methods
Chat Completion -> { completion_message, logprobs } | { event }
post/alpha/inference/chat-completion
post/alpha/inference/completion
post/alpha/inference/embeddings
Parameters
X-LlamaStack-Client-Version: string
Optional
X-LlamaStack-Provider-Data: string
Optional
Response fields
Request example
200Example
Domain types
CompletionResponse = { content, stop_reason, logprobs }
EmbeddingsResponse = { embeddings }
TokenLogProbs = { logprobs_by_token }