There are two ways to enumerate the catalog at runtime, and they cover different audiences.Documentation Index
Fetch the complete documentation index at: https://docs.flex.ai/llms.txt
Use this file to discover all available pages before exploring further.
| Endpoint | Auth | Returns | Use when |
|---|---|---|---|
GET /api/models | None (public) | Every model in the catalog — text, code, reasoning, embeddings, and vision/multimodal. Includes per-token pricing, capability tags, and live health_status. | You want one public call that surfaces the whole catalog with capabilities and pricing. |
GET /v1/models | Bearer key | Text, chat, and embedding models in OpenAI’s Model shape. | Your client already speaks OpenAI’s /v1/models. |
/v1/models returns the OpenAI-shaped list. For richer metadata — capability tags, health, and per-token pricing — use /api/models.
Calling /api/models
No auth, no headers, no params. Returns a JSON array of model objects.
category— one oftext,code,reasoning,multimodal,embedding. Coarse-grained.supports[]— capability tags likechat,tool_use,streaming,vision,reasoning,embeddings. This is what to filter on when you care about a specific capability.model_name— the value to pass asmodelin subsequent API calls. (slugis the dashboard URL form;model_nameis the API form.)input_per_mtok/output_per_mtok— per-token pricing (USD per million tokens).
Per-capability discovery flow
Each subsection: how to find the models, then the endpoint to call once you have amodel_name.
Chat & text
POST /v1/chat/completions. See streaming and tool use for the common patterns.
Vision (image input on chat)
POST /v1/chat/completions and pass image parts in the content array. The full pattern lives in the vision guide.
Embeddings
POST /v1/embeddings. See the embeddings guide.
See also
- Model catalog — human-readable table of every hosted model, with capabilities and pricing.
- OpenAI compatibility — what’s in and out of
/v1/*, including the scope of/v1/models. - Billing — how per-token pricing maps to charges.