FruxonDocs

Google Media

Generate images, videos, and transcribe audio using Google models (Imagen, Veo, Gemini)

The Google Media integration lets your agents generate images, create videos, and transcribe audio using Google's AI models.

Prerequisites

You need:

  • Google Cloud API Key — Create one in the Google Cloud Console with access to the Gemini / Imagen APIs.

Setup

  1. Open your agent in Agent Studio
  2. In the Integrations panel, click Add Integration Config
  3. Select Google Media from the integration list
  4. Give the config a display name (e.g., "Google Media - Production")
  5. Paste your API Key
  6. Save the agent revision

Using in Your Agent

  1. In an Agent Step, attach Google Media tools from the tools panel
  2. The agent uses your API key for all Google Media API calls
  3. Tools are referenced as google_media.imagen-4.0-generate-001, etc.

Available Tools

Image Generation

ToolDescription
imagen-4.0-generate-001Generate images with Imagen 4
imagen-4.0-ultra-generate-001Generate images with Imagen 4 Ultra (highest quality)
imagen-4.0-fast-generate-001Generate images with Imagen 4 Fast
gemini-2.5-flash-imageGenerate images with Gemini 2.5 Flash
gemini-3-pro-image-previewGenerate images with Gemini 3 Pro

Video Generation

ToolDescription
veo-2.0-generate-001Generate videos with Veo 2
veo-3.0-generate-001Generate videos with Veo 3
veo-3.0-fast-generate-001Generate videos with Veo 3 Fast
veo-3.1-generate-previewGenerate videos with Veo 3.1
veo-3.1-fast-generate-previewGenerate videos with Veo 3.1 Fast

Audio

ToolDescription
transcribe_audioTranscribe audio to text using Google models

On this page