Gemini Audio Mcp

Name: Gemini Audio Mcp
Author: jxoesneon

jxoesneon/gemini-audio-mcp

Generate voiceovers, music beds, and sound effects from your agent using Google Gemini 2.5 and Lyria 3 without hand-rolling Audio API clients.

Overview

io.github.jxoesneon/gemini-audio-mcp is an MCP server for the Build phase that generates audio, music, and voice via Google Gemini 2.5 and Lyria 3 for AI assistants.

What is this MCP server?

MCP tools for audio, music, and voice generation on Gemini 2.5 and Lyria 3
Distributed as OCI image ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 with stdio transport
Requires GEMINI_API_KEY from Google AI Studio
Positioned as high-performance generation server for agent-driven creative pipelines
Registry version 0.1.0—early release; expect API surface evolution
MCP server version 0.1.0
1 required secret environment variable: GEMINI_API_KEY
OCI package ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 with stdio transport

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

What problem does it solve?

You need custom audio for demos and product UI but bouncing between Google AI Studio, download folders, and your repo breaks agent-centric flow.

Who is it for?

Indie builders already on Google AI Studio who want Cursor or Claude Code to produce Lyria music and Gemini speech in one integration step.

Skip if: Teams forbidden from cloud generative audio, pro DAW mastering workflows, or products that only need speech-to-text without generation.

What do I get? / Deliverables

Your agent can request generated voice, music, and sound assets through MCP so files land ready for app integration or marketing drafts.

Generated voice, music, or sound clips produced through MCP tool calls
Agent-ready audio assets for prototypes, notifications, and marketing drafts

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

BuildIntegrations & version control

Audio generation is wired while building demos, onboarding flows, and marketing assets—an integration task, not distribution analytics. The server fronts Gemini and Lyria APIs via MCP (OCI image + GEMINI_API_KEY), fitting agent-tooling for multimedia product surfaces.

How it compares

Gemini/Lyria generative audio MCP, not local whisper transcription or a stock-music marketplace.

Common Questions / FAQ

Who is io.github.jxoesneon/gemini-audio-mcp for?

Solo builders shipping agent-driven apps or content who want Gemini 2.5 and Lyria 3 audio generation inside MCP-compatible coding tools.

When should I use io.github.jxoesneon/gemini-audio-mcp?

Use it while building prototypes, app sound design, or launch creatives when cloud-generated voice and music speed iteration before final production assets.

How do I add io.github.jxoesneon/gemini-audio-mcp to my agent?

Run the OCI image ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 as a stdio MCP server, set GEMINI_API_KEY from Google AI Studio, and register the server in Claude Code, Cursor, or your MCP host.

Gemini Audio Mcp

jxoesneon/gemini-audio-mcp

Generate voiceovers, music beds, and sound effects from your agent using Google Gemini 2.5 and Lyria 3 without hand-rolling Audio API clients.

Overview

io.github.jxoesneon/gemini-audio-mcp is an MCP server for the Build phase that generates audio, music, and voice via Google Gemini 2.5 and Lyria 3 for AI assistants.

What is this MCP server?

MCP tools for audio, music, and voice generation on Gemini 2.5 and Lyria 3
Distributed as OCI image ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 with stdio transport
Requires GEMINI_API_KEY from Google AI Studio
Positioned as high-performance generation server for agent-driven creative pipelines
Registry version 0.1.0—early release; expect API surface evolution
MCP server version 0.1.0
1 required secret environment variable: GEMINI_API_KEY
OCI package ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 with stdio transport

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

What problem does it solve?

You need custom audio for demos and product UI but bouncing between Google AI Studio, download folders, and your repo breaks agent-centric flow.

Who is it for?

Indie builders already on Google AI Studio who want Cursor or Claude Code to produce Lyria music and Gemini speech in one integration step.

Skip if: Teams forbidden from cloud generative audio, pro DAW mastering workflows, or products that only need speech-to-text without generation.

What do I get? / Deliverables

Your agent can request generated voice, music, and sound assets through MCP so files land ready for app integration or marketing drafts.

Generated voice, music, or sound clips produced through MCP tool calls
Agent-ready audio assets for prototypes, notifications, and marketing drafts

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

BuildIntegrations & version control

How it compares

Gemini/Lyria generative audio MCP, not local whisper transcription or a stock-music marketplace.

Common Questions / FAQ

Who is io.github.jxoesneon/gemini-audio-mcp for?

Solo builders shipping agent-driven apps or content who want Gemini 2.5 and Lyria 3 audio generation inside MCP-compatible coding tools.

When should I use io.github.jxoesneon/gemini-audio-mcp?

Use it while building prototypes, app sound design, or launch creatives when cloud-generated voice and music speed iteration before final production assets.

How do I add io.github.jxoesneon/gemini-audio-mcp to my agent?

Run the OCI image ghcr.io/jxoesneon/gemini-audio-mcp:0.1.0 as a stdio MCP server, set GEMINI_API_KEY from Google AI Studio, and register the server in Claude Code, Cursor, or your MCP host.

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is io.github.jxoesneon/gemini-audio-mcp for?

When should I use io.github.jxoesneon/gemini-audio-mcp?

How do I add io.github.jxoesneon/gemini-audio-mcp to my agent?

This week for builders

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is io.github.jxoesneon/gemini-audio-mcp for?

When should I use io.github.jxoesneon/gemini-audio-mcp?

How do I add io.github.jxoesneon/gemini-audio-mcp to my agent?