Google Media

Generate images, videos, and transcribe audio using Google models (Imagen, Veo, Gemini)

The Google Media integration lets your agents generate images, create videos, and transcribe audio using Google's AI models.

Prerequisites

You need:

Google Cloud API Key — Create one in the Google Cloud Console with access to the Gemini / Imagen APIs.

Setup

Open your agent in Agent Studio
In the Integrations panel, click Add Integration Config
Select Google Media from the integration list
Give the config a display name (e.g., "Google Media - Production")
Paste your API Key
Save the agent revision

Using in Your Agent

In an Agent Step, attach Google Media tools from the tools panel
The agent uses your API key for all Google Media API calls
Tools are referenced as google_media.imagen-4.0-generate-001, etc.

Available Tools

Image Generation

Tool	Description
`imagen-4.0-generate-001`	Generate images with Imagen 4
`imagen-4.0-ultra-generate-001`	Generate images with Imagen 4 Ultra (highest quality)
`imagen-4.0-fast-generate-001`	Generate images with Imagen 4 Fast
`gemini-2.5-flash-image`	Generate images with Gemini 2.5 Flash
`gemini-3-pro-image-preview`	Generate images with Gemini 3 Pro

Video Generation

Tool	Description
`veo-2.0-generate-001`	Generate videos with Veo 2
`veo-3.0-generate-001`	Generate videos with Veo 3
`veo-3.0-fast-generate-001`	Generate videos with Veo 3 Fast
`veo-3.1-generate-preview`	Generate videos with Veo 3.1
`veo-3.1-fast-generate-preview`	Generate videos with Veo 3.1 Fast

Audio

Tool	Description
`transcribe_audio`	Transcribe audio to text using Google models

Google Maps

Search places, geocode addresses, get directions, and calculate distances using Google Maps Platform

Grafana

Grafana observability platform integration for dashboards, data sources, alert rules, and annotations

On this page

Using in Your Agent

Available Tools

Image Generation

Video Generation