> ## Documentation Index > Fetch the complete documentation index at: https://docs.flex.ai/llms.txt > Use this file to discover all available pages before exploring further. # Quickstart > Get an API key and make your first inference request in under two minutes. ## 1. Sign up Visit [tokens.flex.ai](https://tokens.flex.ai) and request access. On approval you'll receive an invite email; activate your account and land in the [dashboard](https://tokens.flex.ai/dashboard). ## 2. Add billing, then create an API key A credit card — preceded by a billing address — is required before you can create keys. Add both from the dashboard's **Billing** page, then open **API Keys** and click **Create key**. Copy the `sk-…` value — it's shown once. New accounts start with **\$10 of free credit** — your first requests draw from it before any card charge. ## 3. Make a request Pick a text model from [the catalog](https://flex.ai/models). The examples below use `Llama-3.3-70B-Instruct-FP8`. ```bash cURL theme={null} curl https://tokens.flex.ai/v1/chat/completions \ -H "Authorization: Bearer $FLEXAI_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "Llama-3.3-70B-Instruct-FP8", "messages": [ {"role": "user", "content": "In one sentence, what is FlexAI?"} ] }' ``` ```python Python theme={null} # pip install openai import os from openai import OpenAI client = OpenAI( base_url="https://tokens.flex.ai/v1", api_key=os.environ["FLEXAI_API_KEY"], ) response = client.chat.completions.create( model="Llama-3.3-70B-Instruct-FP8", messages=[{"role": "user", "content": "In one sentence, what is FlexAI?"}], ) print(response.choices[0].message.content) ``` ```typescript TypeScript theme={null} // npm install openai import OpenAI from "openai"; const client = new OpenAI({ baseURL: "https://tokens.flex.ai/v1", apiKey: process.env.FLEXAI_API_KEY, }); const response = await client.chat.completions.create({ model: "Llama-3.3-70B-Instruct-FP8", messages: [{ role: "user", content: "In one sentence, what is FlexAI?" }], }); console.log(response.choices[0].message.content); ``` You should see a single-line description come back. That's it — your key works. ## Next steps Deliver tokens as they're generated instead of waiting for the full response. Let the model invoke your functions and work with their output. Filter the live catalog from code to find the right text, vision, or embedding model. Rate limits, credit exhaustion behavior, and the 402 response. Every 4xx/5xx we return, with example bodies and how to recover.