Endpoints overview

Complete list of all API endpoints available through the Prism Gateway.

About

Prism exposes 97 endpoints across 20+ categories. All inference endpoints live under /v1/ and follow the OpenAI API format. Admin endpoints live under /-/ and require an admin token.

Base URL

All endpoints are relative to your Prism gateway URL:

https://gateway.futureagi.com

Inference endpoints use the /v1/ prefix and accept your virtual API key (sk-prism-...) as a Bearer token. Admin endpoints use the /-/ prefix and require the admin token.


Chat and completions

The primary endpoints for generating text with LLMs.

MethodPathDescription
POST/v1/chat/completionsChat completion (streaming and non-streaming)
POST/v1/completionsText completion (legacy)
POST/v1/count_tokensCount tokens for a set of messages

MethodPathDescription
POST/v1/embeddingsGenerate text embeddings
POST/v1/rerankRerank text passages by relevance
POST/v1/searchSearch API
POST/v1/ocrOptical character recognition

Audio

MethodPathDescription
POST/v1/audio/speechText-to-speech
POST/v1/audio/speech/streamStreaming text-to-speech
POST/v1/audio/transcriptionsSpeech-to-text (Whisper)
POST/v1/audio/translationsTranslate audio to English

Images and video

MethodPathDescription
POST/v1/images/generationsGenerate images from prompts
POST/v1/videosSubmit video generation job
GET/v1/videosList video jobs
GET/v1/videos/{video_id}Get video job status
DELETE/v1/videos/{video_id}Cancel video job

Files

MethodPathDescription
POST/v1/filesUpload a file
GET/v1/filesList files
GET/v1/files/{file_id}Get file metadata
GET/v1/files/{file_id}/contentDownload file content
DELETE/v1/files/{file_id}Delete a file

Vector stores

Used with the Assistants API for file-based retrieval.

MethodPathDescription
POST/v1/vector_storesCreate vector store
GET/v1/vector_storesList vector stores
GET/v1/vector_stores/{id}Get vector store
POST/v1/vector_stores/{id}Update vector store
DELETE/v1/vector_stores/{id}Delete vector store
POST/v1/vector_stores/{id}/searchSearch a vector store
POST/v1/vector_stores/{id}/filesAdd file to vector store
GET/v1/vector_stores/{id}/filesList files in vector store
DELETE/v1/vector_stores/{id}/files/{file_id}Remove file from vector store
POST/v1/vector_stores/{id}/file_batchesBatch add files

Assistants API

Full proxy for the OpenAI Assistants API. Create assistants, manage threads, send messages, and execute runs.

Assistants

MethodPathDescription
POST/v1/assistantsCreate assistant
GET/v1/assistantsList assistants
GET/v1/assistants/{id}Get assistant
POST/v1/assistants/{id}Update assistant
DELETE/v1/assistants/{id}Delete assistant

Threads

MethodPathDescription
POST/v1/threadsCreate thread
GET/v1/threads/{id}Get thread
POST/v1/threads/{id}Update thread
DELETE/v1/threads/{id}Delete thread

Messages

MethodPathDescription
POST/v1/threads/{id}/messagesAdd message
GET/v1/threads/{id}/messagesList messages
GET/v1/threads/{id}/messages/{msg_id}Get message
POST/v1/threads/{id}/messages/{msg_id}Update message
DELETE/v1/threads/{id}/messages/{msg_id}Delete message

Runs

MethodPathDescription
POST/v1/threads/{id}/runsCreate run
GET/v1/threads/{id}/runsList runs
GET/v1/threads/{id}/runs/{run_id}Get run
POST/v1/threads/{id}/runs/{run_id}Update run
POST/v1/threads/{id}/runs/{run_id}/cancelCancel run
POST/v1/threads/{id}/runs/{run_id}/submit_tool_outputsSubmit tool outputs
GET/v1/threads/{id}/runs/{run_id}/stepsList run steps
GET/v1/threads/{id}/runs/{run_id}/steps/{step_id}Get run step
POST/v1/threads/runsCreate thread and run in one call

Responses API

MethodPathDescription
POST/v1/responsesCreate response
GET/v1/responses/{id}Get response
DELETE/v1/responses/{id}Delete response

Async inference

MethodPathDescription
GET/v1/async/{job_id}Get async job status and result
DELETE/v1/async/{job_id}Cancel async job

Async jobs are created by sending a regular chat completion request with async mode enabled. The batch API is available via admin endpoints below.


Scheduled completions

MethodPathDescription
POST/v1/scheduledSchedule a completion for later
GET/v1/scheduledList scheduled jobs
GET/v1/scheduled/{job_id}Get scheduled job
DELETE/v1/scheduled/{job_id}Cancel scheduled job

Realtime (WebSocket)

MethodPathDescription
GET/v1/realtimeUpgrade to WebSocket for real-time audio/video streaming

Native format passthrough

For clients that prefer a provider’s native API format instead of the OpenAI format.

MethodPathDescription
POST/v1/messagesAnthropic Messages API (native format)
POST/v1/messages/count_tokensAnthropic token counting
POST/v1beta/models/{model}:generateContentGoogle GenAI generate content
POST/v1beta/models/{model}:streamGenerateContentGoogle GenAI streaming

Models

MethodPathDescription
GET/v1/modelsList all available models
GET/v1/models/{model}Get model details

MCP (Model Context Protocol)

Prism acts as an MCP server, aggregating tools from upstream MCP tool servers.

MethodPathDescription
POST/mcpMCP protocol endpoint
GET/mcpMCP SSE streaming endpoint

Management

MethodPathDescription
GET/-/mcp/statusMCP server status and stats
GET/-/mcp/toolsList available tools
GET/-/mcp/resourcesList MCP resources
GET/-/mcp/promptsList MCP prompts
POST/-/mcp/testTest tool execution

A2A (Agent-to-Agent)

MethodPathDescription
GET/.well-known/agent.jsonAgent capabilities card
POST/a2aA2A protocol messages
GET/v1/agentsList registered A2A agents

Admin: key management

Requires admin token.

MethodPathDescription
POST/-/keysCreate API key
GET/-/keysList keys
GET/-/keys/{key_id}Get key details
PUT/-/keys/{key_id}Update key
DELETE/-/keys/{key_id}Revoke key
POST/-/keys/{key_id}/creditsAdd credits to key

Admin: organization config

MethodPathDescription
GET/-/orgs/{org_id}/configGet org config
PUT/-/orgs/{org_id}/configSet org config
DELETE/-/orgs/{org_id}/configDelete org config
GET/-/orgs/configsList all org configs
POST/-/orgs/configs/bulkBulk load configs

Admin: operations

MethodPathDescription
GET/-/cluster/nodesList cluster nodes
POST/-/admin/providers/{id}/rotateStart key rotation
GET/-/admin/providers/{id}/rotationGet rotation status
POST/-/admin/providers/{id}/rotate/promotePromote rotated key
POST/-/admin/providers/{id}/rotate/rollbackRollback rotation
POST/-/batchesSubmit batch job
GET/-/batches/{batch_id}Get batch status
POST/-/batches/{batch_id}/cancelCancel batch
GET/-/shadow/statsShadow testing statistics

Health and diagnostics

MethodPathDescription
GET/healthzLiveness probe
GET/livezLiveness probe (alias)
GET/readyzReadiness probe
POST/-/reloadReload config from file
GET/-/configServer config summary
GET/-/metricsPrometheus metrics
GET/-/health/providersProvider health status
GET/-/health/providers/{org_id}Org-specific provider health

Next Steps

Was this page helpful?

Questions & Discussion