OpenAI Python SDK: Quickstart Tutorial with Code Examples

TL;DR

The OpenAI Python SDK makes it simple to integrate powerful AI capabilities—like chat, transcription, vision, and embeddings—without needing complex frameworks or boilerplate code. In this quickstart, you’ll learn how to install the SDK, set up your API key securely, and use the core functions with practical Python examples. By the end, you’ll see how to build AI features directly with the SDK while understanding when frameworks or orchestration tools might be worth adding.

Resources for Reference

Github Repository
Full Program GitHub Repository
📹 Lesson Video: Embedded Below

SDKs vs Agentic Frameworks: Getting Close to the Bare Metal

Agentic AI is everywhere right now. Frameworks like LangChain, LangGraph, and CrewAI promise to handle orchestration, memory, and multi-step reasoning for you. In this program we use them too — but let’s be clear: you don’t need an agentic framework to build with LLMs.

At the core, all of these frameworks still call into the same SDKs that model providers release. SDKs are the “bare metal” layer: minimal abstraction, direct access to capabilities like chat, transcription, vision, and embeddings. Frameworks just wrap those SDKs to reduce boilerplate and enforce structure.

That extra scaffolding can be useful for complex systems, but sometimes all you want is a quick feature or a lightweight script. In those cases, working directly with the SDK is faster, simpler, and gives you more control.

In this lesson, we’ll focus on the OpenAI Python SDK — walking through installation, setup, and a few core functions so you can see how to build directly on top of OpenAI’s APIs. Later, we’ll also cover the OpenAI Agents SDK, which takes things further into multi-agent orchestration — OpenAI’s answer to LangGraph.

Why Use the OpenAI SDK?

So what makes OpenAI’s SDK worth picking up first? Because it lets you drop AI into everyday workflows without dragging in a heavy framework.

Think about the small, repetitive tasks you could automate: summarizing a Slack thread so your team doesn’t miss the point. Turning a five-minute meeting recording into searchable notes. Pulling legible text out of a whiteboard photo after a brainstorming session. These are all cases where a few lines of code with the OpenAI SDK can save hours.

So why OpenAI’s SDK in particular (vs SDKs by other LLM providers)? Two reasons stand out:

Maturity and ecosystem. It’s battle-tested, actively maintained, and backed by extensive docs, tutorials, and community examples. If you find a snippet online, odds are it “just works” here.
Industry adoption. OpenAI’s API design has become the de-facto standard — so much so that competitors like Google and Anthropic now expose OpenAI-compatible endpoints (we’ll look at that later in this lesson).

The result: a toolkit that’s lightweight, flexible, and increasingly interoperable. You can start small, stay close to the bare metal, and only add complexity when you really need it.

Choosing Your Layer: SDKs, Frameworks, or Low-Code

When building with LLMs, you have a choice: work close to the raw API, lean on a framework, or skip code entirely with a low-code tool. Each option comes with tradeoffs in setup, flexibility, and control.

Here’s how they stack up at a glance:

So what does this mean in practice?

At the lowest level, some developers prefer working almost at the bare metal: sending HTTP requests with requests or curl, manually setting headers and parsing JSON. It works, but it’s tedious. SDKs take that pain away. They’re thin wrappers that give you direct access to the same endpoints with just a few clean function calls. You stay close to the metal, but without all the plumbing code.

At the other end of the spectrum are agentic frameworks like LangChain, LangGraph, AutoGen, CrewAI, or OpenAI’s own Agents SDK. They don’t just wrap the API — they add orchestration, memory, and chaining patterns. That extra scaffolding is invaluable for enterprise-grade assistants or multi-agent systems, but it comes with heavier installs, a steeper learning curve, and less flexibility in how you wire things up.

And then there’s the low/no-code world — tools like Zapier, n8n, or Make. These let you skip Python entirely. Drag a few blocks onto a canvas and you’ve got a working Slack bot that summarizes messages with GPT. Perfect for quick automation, but limited by what the platform lets you configure.

Our recommendation? Start with the SDK. You’ll learn the raw building blocks first, so when frameworks enter the picture later, you’ll know exactly what they’re doing under the hood.

Getting Started with the OpenAI SDK

Let’s get your environment ready. The setup is quick, and once you’re done you’ll be able to make your first API call with just a few lines of Python.

Step 1: Create an OpenAI account.
Go to platform.openai.com and sign up if you don’t already have an account.

Step 2: Generate an API key.
From your dashboard, create a new API key. This key is how your code authenticates with OpenAI’s servers.

⚠️ Important: Treat your API key like a password. Never paste it directly into scripts or share it in public repositories.

Step 3: Install the package.

pip install openai python-dotenv

Step 4: Store your key safely.
Create a .env file in your project root to keep secrets out of your code:

OPENAI_API_KEY=your-api-key-here

Step 5: Initialize the client.

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()  # Load variables from .env file
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

Step 6: Run a quick test.

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Say hello in one short sentence."}]
)

print(response.choices[0].message.content)

If everything’s wired up, you should see a short reply like:

Hello! Nice to meet you.

Key Functions and How to Use Them

No matter what you’re building, using the OpenAI SDK really boils down to three steps:

Send a request with your input (a prompt, some text, an audio file, or an image).
Get a response back from the model.
Do something with it — display it, store it, or feed it into the next step of your app.

That’s it. Everything else is just a variation of this pattern.

In the next sections, we’ll look at the four most common capabilities — chat completions, audio transcription, image analysis, and text embeddings — each with a simple example you can run in Python.

1. Chat Completions: Your AI Swiss Army Knife

Imagine a Slack thread that drags on with people repeating themselves, adding side comments, and burying the actual decision. You don’t want to read 20 messages — you just want the gist. That’s where chat completions shine.

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "system", "content": "You are a concise assistant."},
        {"role": "user", "content": """Summarize this Slack thread:

        - John: Can we push the deadline? I'm swamped.
        - Priya: Maybe Monday?
        - Alex: Docs are 80% ready, just need final edits.
        - Sarah: I’ll handle the charts.
        - Priya: Friday might be better so everyone has the weekend free.
        - John: Friday works. Thanks.
        """}
    ]
)

print(response.choices[0].message.content)

Sample output:

Summary: Deadline moved to Friday. Docs need final edits. Sarah will handle charts.

👉 Takeaway: Chat completions are versatile. You can use the same endpoint for summarization, Q&A, translation, or even lightweight chatbots — all with just a few lines of Python.

Try it yourself: Run the code and experiment with different prompts for summarization, translation, or other tasks.

2. Audio Transcription: Turn Speech into Searchable Text

Have a quick meeting recording, a voice memo, or a podcast snippet you’d rather skim as text? The Whisper model in the OpenAI SDK makes transcription just a few lines of code.


from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Transcribe an audio file
with open("meeting_clip.mp3", "rb") as audio_file:
    transcript = client.audio.transcriptions.create(
        model="whisper-1",
        file=audio_file
    )

print(transcript.text)

Sample output:

"Project deadline moved to Friday. Sarah will finish the charts. Docs need final edits."

👉 Takeaway: Audio transcription is one of the easiest ways to add immediate value — turning unstructured speech into searchable, sharable notes for your team.

Whisper supports many audio formats including mp3, mp4, wav, and more. The file size limit is 25MB.

3. Image Analysis: Make Sense of Visual Notes

Ever snapped a photo of a whiteboard after a brainstorm, only to forget half of what it meant? With GPT-4o’s vision capabilities, you can feed the image directly into the SDK and get a clean description or extraction of the content.

from openai import OpenAI
import os, base64
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Encode image as base64
def encode_image(image_path):
    with open(image_path, "rb") as f:
        return base64.b64encode(f.read()).decode("utf-8")

base64_image = encode_image("whiteboard.jpg")

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Summarize the notes from this whiteboard image."},
                {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{base64_image}"}}
            ]
        }
    ]
)

print(response.choices[0].message.content)

Sample output:

"Whiteboard notes: Deadline Friday. Sarah = charts. Docs need final edits."

👉 Takeaway: Vision models let you capture ideas from physical spaces — whiteboards, sketches, or screenshots — and make them digital, searchable, and actionable.

GPT-4o and GPT-4o mini support vision capabilities. Images should be under 20MB.

4. Text Embeddings: Smarter Search & Organization

Ever wished you could search by meaning, not just keywords? Embeddings turn text into vectors that capture semantic similarity. With them, you can build smarter document search, clustering, or recommendation systems.

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

response = client.embeddings.create(
    model="text-embedding-3-small",
    input="Deadline moved to Friday. Sarah will handle charts. Docs need edits."
)

embedding = response.data[0].embedding
print(f"Embedding dimension: {len(embedding)}")
print(f"First 5 values: {embedding[:5]}")

Sample output:

Embedding dimension: 1536
First 5 values: [-0.012, 0.021, 0.034, -0.008, 0.017]

👉 Takeaway: Embeddings give you a mathematical “fingerprint” of text, enabling semantic search and clustering that goes far beyond plain keyword matching.

Use embeddings to measure text similarity by calculating cosine similarity between vectors. The text-embedding-3-small model is cost-effective, while text-embedding-3-large offers higher accuracy.

Mini Challenge: Chain Two Functions Together

Try combining two of the SDK functions you just learned:

Step 1: Use the audio transcription API to turn a short voice memo into text.
Step 2: Feed that transcript into a chat completion request, asking the model to summarize it in two bullet points.

You’ll end up with a pipeline that takes raw speech and delivers a clean, searchable summary — all in under 20 lines of code.

👉 Once you’ve tried that, think of other combos: summarize text embeddings into clusters, describe an image and then translate it, or search across Slack threads with embeddings before asking the model to draft a recap. The SDK gives you the building blocks — your job is to connect them.

Industry Insight: The OpenAI API as a De-Facto Standard

The skills you’re learning with the OpenAI SDK don’t just apply to OpenAI. Several major providers — including Google (Gemini) and Anthropic (Claude) — now expose OpenAI-compatible endpoints.

That means you can often take the same code, swap in a new API key, change the base_url, and run it against a different model family:

from openai import OpenAI
import os

# Example 1: Let's call Google's Gemini using OpenAI compatible endpoint
gemini_client = OpenAI(
    api_key="GEMINI_API_KEY",
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)

gemini_response = gemini_client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {
            "role": "user",
            "content": "Explain to me how AI works"
        }
    ]
)

print(gemini_response .choices[0].message.content)

# Example 2: Let's call Anthropic's Claude using OpenAI compatible endpoint
anthropic_client = OpenAI(
    api_key="ANTHROPIC_API_KEY",  # Your Claude API key
    base_url="https://api.anthropic.com/v1/"  # the Claude API endpoint
)

anthropic_response = anthropic_client.chat.completions.create(
    model="claude-sonnet-4-5", # Anthropic model name
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Who are you?"}
    ],
)

print(anthropic_response.choices[0].message.content)

# Example 3: Call a local Ollama model (e.g., Llama 3 or Mistral)
# 💡 You must have Ollama installed and running locally.
# Start the Ollama server first by running `ollama serve` in your terminal.
# Then, this code will connect to your local endpoint.

ollama_client = OpenAI(
    base_url="http://localhost:11434/v1",  # Ollama serves OpenAI-compatible API locally
    api_key="ollama"  # placeholder key, not required for local use, pass any value
)

ollama_response = ollama_client.chat.completions.create(
    model="llama3",  # Or any model you've pulled via `ollama pull llama3`
    messages=[
        {"role": "system", "content": "You are a concise, technical explainer."},
        {"role": "user", "content": "Summarize how local inference differs from cloud-based LLMs."}
    ]
)

print(ollama_response.choices[0].message.content)

And it doesn’t stop there — platforms like OpenRouter aggregate dozens of providers behind this same API spec, so the exact same code can talk to many different models.

👉 Takeaway: OpenAI’s API design has quietly become the lingua franca of LLMs. Knowing the SDK doesn’t tie you to one provider — it gives you a portable skillset you can reuse across the industry.

Tips and Best Practices

Working with the OpenAI SDK is straightforward, but a few habits will save you headaches (and money):

1. Secure Your API Keys

Never hardcode your API key.
Use environment variables or a .env file.
Keep keys out of GitHub and shared repos.

2. Manage Costs and Limits

Pick smaller models for lightweight tasks.
Track usage so you don’t get surprised by a big bill.
Batch requests when possible to save tokens.
Don’t always reach for GPT-4 — often gpt-4o-mini is more than enough.
Set usage limits and alerts in your OpenAI dashboard so runaway scripts don’t blow your budget.

3. Debug Smarter

Wrap calls in try/except so errors don’t crash your app.
Log responses when testing to see what the API is actually returning.
If things fail suddenly, check the OpenAI status page.

4. Common Pitfalls to Avoid

Forgetting to set max_tokens and accidentally generating a huge (and costly) output.
Mixing up API keys across environments (dev vs prod) — label them clearly.

👉 Pro Tip: Start small, test often, then scale up. It keeps your apps fast, cheap, and reliable.

Conclusion and Next Steps

You’ve seen how the OpenAI Python SDK lets you install it in minutes, connect your key, and start building with chat, transcription, vision, and embeddings. Along the way, we covered security basics, cost control, and common mistakes to avoid — so you can build confidently without surprises.

Where do you go from here? Start experimenting. Swap prompts, models, and parameters to see how outputs change. Try combining features — for example, transcribe audio with Whisper, then summarize it with chat completions. Explore multimodal inputs with images or audio. Or just build something small and useful, like a Slack summarizer or searchable document index.

Frameworks like LangChain or OpenAI’s own Agents SDK add orchestration and multi-agent design patterns — but now you know the raw building blocks they sit on. Start simple, then layer on complexity when the project truly calls for it.

Learning Resources

🏠 Home - All Lessons

TL;DR

Resources for Reference

Github Repository
Full Program GitHub Repository
📹 Lesson Video: Embedded Below

SDKs vs Agentic Frameworks: Getting Close to the Bare Metal

Why Use the OpenAI SDK?

So what makes OpenAI’s SDK worth picking up first? Because it lets you drop AI into everyday workflows without dragging in a heavy framework.

So why OpenAI’s SDK in particular (vs SDKs by other LLM providers)? Two reasons stand out:

Maturity and ecosystem. It’s battle-tested, actively maintained, and backed by extensive docs, tutorials, and community examples. If you find a snippet online, odds are it “just works” here.
Industry adoption. OpenAI’s API design has become the de-facto standard — so much so that competitors like Google and Anthropic now expose OpenAI-compatible endpoints (we’ll look at that later in this lesson).

The result: a toolkit that’s lightweight, flexible, and increasingly interoperable. You can start small, stay close to the bare metal, and only add complexity when you really need it.

Choosing Your Layer: SDKs, Frameworks, or Low-Code

Here’s how they stack up at a glance:

So what does this mean in practice?

Our recommendation? Start with the SDK. You’ll learn the raw building blocks first, so when frameworks enter the picture later, you’ll know exactly what they’re doing under the hood.

Getting Started with the OpenAI SDK

Let’s get your environment ready. The setup is quick, and once you’re done you’ll be able to make your first API call with just a few lines of Python.

Step 1: Create an OpenAI account.
Go to platform.openai.com and sign up if you don’t already have an account.

Step 2: Generate an API key.
From your dashboard, create a new API key. This key is how your code authenticates with OpenAI’s servers.

⚠️ Important: Treat your API key like a password. Never paste it directly into scripts or share it in public repositories.

Step 3: Install the package.

pip install openai python-dotenv

Step 4: Store your key safely.
Create a .env file in your project root to keep secrets out of your code:

OPENAI_API_KEY=your-api-key-here

Step 5: Initialize the client.

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()  # Load variables from .env file
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

Step 6: Run a quick test.

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Say hello in one short sentence."}]
)

print(response.choices[0].message.content)

If everything’s wired up, you should see a short reply like:

Hello! Nice to meet you.

Key Functions and How to Use Them

No matter what you’re building, using the OpenAI SDK really boils down to three steps:

Send a request with your input (a prompt, some text, an audio file, or an image).
Get a response back from the model.
Do something with it — display it, store it, or feed it into the next step of your app.

That’s it. Everything else is just a variation of this pattern.

1. Chat Completions: Your AI Swiss Army Knife

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "system", "content": "You are a concise assistant."},
        {"role": "user", "content": """Summarize this Slack thread:

        - John: Can we push the deadline? I'm swamped.
        - Priya: Maybe Monday?
        - Alex: Docs are 80% ready, just need final edits.
        - Sarah: I’ll handle the charts.
        - Priya: Friday might be better so everyone has the weekend free.
        - John: Friday works. Thanks.
        """}
    ]
)

print(response.choices[0].message.content)

Sample output:

Summary: Deadline moved to Friday. Docs need final edits. Sarah will handle charts.

👉 Takeaway: Chat completions are versatile. You can use the same endpoint for summarization, Q&A, translation, or even lightweight chatbots — all with just a few lines of Python.

Try it yourself: Run the code and experiment with different prompts for summarization, translation, or other tasks.

2. Audio Transcription: Turn Speech into Searchable Text

Have a quick meeting recording, a voice memo, or a podcast snippet you’d rather skim as text? The Whisper model in the OpenAI SDK makes transcription just a few lines of code.


from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Transcribe an audio file
with open("meeting_clip.mp3", "rb") as audio_file:
    transcript = client.audio.transcriptions.create(
        model="whisper-1",
        file=audio_file
    )

print(transcript.text)

Sample output:

"Project deadline moved to Friday. Sarah will finish the charts. Docs need final edits."

👉 Takeaway: Audio transcription is one of the easiest ways to add immediate value — turning unstructured speech into searchable, sharable notes for your team.

Whisper supports many audio formats including mp3, mp4, wav, and more. The file size limit is 25MB.

3. Image Analysis: Make Sense of Visual Notes

from openai import OpenAI
import os, base64
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

# Encode image as base64
def encode_image(image_path):
    with open(image_path, "rb") as f:
        return base64.b64encode(f.read()).decode("utf-8")

base64_image = encode_image("whiteboard.jpg")

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Summarize the notes from this whiteboard image."},
                {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{base64_image}"}}
            ]
        }
    ]
)

print(response.choices[0].message.content)

Sample output:

"Whiteboard notes: Deadline Friday. Sarah = charts. Docs need final edits."

👉 Takeaway: Vision models let you capture ideas from physical spaces — whiteboards, sketches, or screenshots — and make them digital, searchable, and actionable.

GPT-4o and GPT-4o mini support vision capabilities. Images should be under 20MB.

4. Text Embeddings: Smarter Search & Organization

from openai import OpenAI
import os
from dotenv import load_dotenv

load_dotenv()
client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

response = client.embeddings.create(
    model="text-embedding-3-small",
    input="Deadline moved to Friday. Sarah will handle charts. Docs need edits."
)

embedding = response.data[0].embedding
print(f"Embedding dimension: {len(embedding)}")
print(f"First 5 values: {embedding[:5]}")

Sample output:

Embedding dimension: 1536
First 5 values: [-0.012, 0.021, 0.034, -0.008, 0.017]

👉 Takeaway: Embeddings give you a mathematical “fingerprint” of text, enabling semantic search and clustering that goes far beyond plain keyword matching.

Mini Challenge: Chain Two Functions Together

Try combining two of the SDK functions you just learned:

Step 1: Use the audio transcription API to turn a short voice memo into text.
Step 2: Feed that transcript into a chat completion request, asking the model to summarize it in two bullet points.

You’ll end up with a pipeline that takes raw speech and delivers a clean, searchable summary — all in under 20 lines of code.

Industry Insight: The OpenAI API as a De-Facto Standard

That means you can often take the same code, swap in a new API key, change the base_url, and run it against a different model family:

from openai import OpenAI
import os

# Example 1: Let's call Google's Gemini using OpenAI compatible endpoint
gemini_client = OpenAI(
    api_key="GEMINI_API_KEY",
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)

gemini_response = gemini_client.chat.completions.create(
    model="gemini-2.5-flash",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {
            "role": "user",
            "content": "Explain to me how AI works"
        }
    ]
)

print(gemini_response .choices[0].message.content)

# Example 2: Let's call Anthropic's Claude using OpenAI compatible endpoint
anthropic_client = OpenAI(
    api_key="ANTHROPIC_API_KEY",  # Your Claude API key
    base_url="https://api.anthropic.com/v1/"  # the Claude API endpoint
)

anthropic_response = anthropic_client.chat.completions.create(
    model="claude-sonnet-4-5", # Anthropic model name
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Who are you?"}
    ],
)

print(anthropic_response.choices[0].message.content)

# Example 3: Call a local Ollama model (e.g., Llama 3 or Mistral)
# 💡 You must have Ollama installed and running locally.
# Start the Ollama server first by running `ollama serve` in your terminal.
# Then, this code will connect to your local endpoint.

ollama_client = OpenAI(
    base_url="http://localhost:11434/v1",  # Ollama serves OpenAI-compatible API locally
    api_key="ollama"  # placeholder key, not required for local use, pass any value
)

ollama_response = ollama_client.chat.completions.create(
    model="llama3",  # Or any model you've pulled via `ollama pull llama3`
    messages=[
        {"role": "system", "content": "You are a concise, technical explainer."},
        {"role": "user", "content": "Summarize how local inference differs from cloud-based LLMs."}
    ]
)

print(ollama_response.choices[0].message.content)

And it doesn’t stop there — platforms like OpenRouter aggregate dozens of providers behind this same API spec, so the exact same code can talk to many different models.

Tips and Best Practices

Working with the OpenAI SDK is straightforward, but a few habits will save you headaches (and money):

1. Secure Your API Keys

Never hardcode your API key.
Use environment variables or a .env file.
Keep keys out of GitHub and shared repos.

2. Manage Costs and Limits

Pick smaller models for lightweight tasks.
Track usage so you don’t get surprised by a big bill.
Batch requests when possible to save tokens.
Don’t always reach for GPT-4 — often gpt-4o-mini is more than enough.
Set usage limits and alerts in your OpenAI dashboard so runaway scripts don’t blow your budget.

3. Debug Smarter

Wrap calls in try/except so errors don’t crash your app.
Log responses when testing to see what the API is actually returning.
If things fail suddenly, check the OpenAI status page.

4. Common Pitfalls to Avoid

Forgetting to set max_tokens and accidentally generating a huge (and costly) output.
Mixing up API keys across environments (dev vs prod) — label them clearly.

👉 Pro Tip: Start small, test often, then scale up. It keeps your apps fast, cheap, and reliable.

Conclusion and Next Steps

Learning Resources

🏠 Home - All Lessons