Google Media
Generate images, videos, and transcribe audio using Google models (Imagen, Veo, Gemini)
The Google Media integration lets your agents generate images, create videos, and transcribe audio using Google's AI models.
Prerequisites
You need:
- Google Cloud API Key — Create one in the Google Cloud Console with access to the Gemini / Imagen APIs.
Setup
- Open your agent in Agent Studio
- In the Integrations panel, click Add Integration Config
- Select Google Media from the integration list
- Give the config a display name (e.g., "Google Media - Production")
- Paste your API Key
- Save the agent revision
Using in Your Agent
- In an Agent Step, attach Google Media tools from the tools panel
- The agent uses your API key for all Google Media API calls
- Tools are referenced as
google_media.imagen-4.0-generate-001, etc.
Available Tools
Image Generation
| Tool | Description |
|---|---|
imagen-4.0-generate-001 | Generate images with Imagen 4 |
imagen-4.0-ultra-generate-001 | Generate images with Imagen 4 Ultra (highest quality) |
imagen-4.0-fast-generate-001 | Generate images with Imagen 4 Fast |
gemini-2.5-flash-image | Generate images with Gemini 2.5 Flash |
gemini-3-pro-image-preview | Generate images with Gemini 3 Pro |
Video Generation
| Tool | Description |
|---|---|
veo-2.0-generate-001 | Generate videos with Veo 2 |
veo-3.0-generate-001 | Generate videos with Veo 3 |
veo-3.0-fast-generate-001 | Generate videos with Veo 3 Fast |
veo-3.1-generate-preview | Generate videos with Veo 3.1 |
veo-3.1-fast-generate-preview | Generate videos with Veo 3.1 Fast |
Audio
| Tool | Description |
|---|---|
transcribe_audio | Transcribe audio to text using Google models |