OpenAI Media
Generate images and transcribe audio using OpenAI models (GPT-Image, DALL-E, Whisper)
The OpenAI Media integration lets your agents generate images and transcribe audio using OpenAI's models.
Prerequisites
You need:
- OpenAI API Key — Get one from the OpenAI Platform.
Setup
- Open your agent in Agent Studio
- In the Integrations panel, click Add Integration Config
- Select OpenAI Media from the integration list
- Give the config a display name (e.g., "OpenAI Media")
- Paste your API Key
- Save the agent revision
Using in Your Agent
- In an Agent Step, attach OpenAI Media tools from the tools panel
- The agent uses your API key for all OpenAI Media API calls
- Tools are referenced as
openai_media.gpt-image-1, etc.
Available Tools
Image Generation
| Tool | Description |
|---|---|
gpt-image-1 | Generate images with GPT Image 1 |
gpt-image-1-mini | Generate images with GPT Image 1 Mini (faster, lower cost) |
gpt-image-1.5 | Generate images with GPT Image 1.5 |
Audio
| Tool | Description |
|---|---|
transcribe_audio | Transcribe audio to text using Whisper |