11 MCP ToolsCloud LibraryEditing EngineAgentic QA Loopsx402 Payments

Generate, refine & manage
media through one endpoint

Composable media generation for AI agents — with a cloud library to manage what they create, an editing engine to refine it, and analysis tools to close the quality loop.

Agents build through MCP. Humans review in the web workspace. Teams collaborate with shared projects. x402 users authenticate with just a wallet.

https://mcp.soundside.ai/mcp·300 free credits · No card required

Iterative Agentic Design

Agents that critique their own work

analyze_media with Gemini vision QA lets agents evaluate their output against an intent checklist, then decide whether to re-generate, edit in post, or approve.

1
Generate
Agent creates image, video, or audio from a prompt
2
Analyze
analyze_media evaluates quality, checks intent, scores output
3
Decide
Agent compares result against spec — pass or iterate?
4
Refine
Re-generate with adjusted prompt, or edit_video to fix in post
5
Store
Approved asset saved to library with lineage and tags
analyze_media(resource_id, analysis_type="vision_qa", intent_checklist={"no_pillarboxing": true})
Spec-driven QA: each checklist item becomes an explicit pass/fail check

Collaborative by Design

Three ways in. One shared workspace.

Agents connect via MCP. Humans work in the web UI with the Crew chatbot. x402 users authenticate with a crypto wallet — no account needed. All three share the same projects, assets, and permissions.

🤖MCP Agents
JSON-RPC 2.0 endpoint
12 composable tools
Async with status polling
Resource ID chaining
Works with Claude, Cursor, any MCP client
💬Web + Crew Chat
Project dashboard with media grid
Type filters & collections
Crew chatbot with full tool access
Review agent output & iterate
Brand kit management
x402 Wallet
USDC on Base via Coinbase
No email, no password, no account
Pay per call at the endpoint
Wallet-based login to web UI
Same projects & permissions
lib_share grants view, edit, or admin access by email — enabling team workflows where agents produce, humans review, and collaborators contribute.

Tool Catalog

11 composable MCP tools

6 generation, 2 editing & analysis, 3 library management. Every tool returns a resource ID that chains into any other.

🖼Generate
create_image
Text or reference → image across 5 providers
MiniMax · Grok · Luma · Runway · Vertex
🎬Generate
create_video
Text or image → video with character consistency
MiniMax · Luma · Runway · Grok · Vertex
🔊Generate
create_audio
TTS, transcription, voice cloning, sound effects
MiniMax · Vertex · Runway · CosyVoice
🎵Generate
create_music
Lyrics + style prompt → full music track
MiniMax · Creative Freedom
📝Generate
create_text
LLM completions with optional structured JSON output
Vertex · MiniMax · Grok
📊Generate
create_artifact
Presentations, charts, documents, diagrams
Plotly · PPTX · Mermaid
✂️Edit
edit_video
21+ post-production actions in-pipeline
soundside.ai engine
🔍Edit
analyze_media
Technical metadata, AI vision QA, quality scoring
FFprobe · Gemini
📂Library
lib_list
Browse projects, collections, resources, lineage, brand kits
soundside.ai
⚙️Library
lib_manage
CRUD for projects, collections, resources, brand kits
soundside.ai
🔗Library
lib_share
Share projects with view/edit/admin permissions
soundside.ai

Editing Engine

Post-production in the pipeline

Agents don't just generate — they edit, composite, and deliver finished media. Humans can step in via the web UI to fine-tune anything the agent produced.

trimconcatcrossfademix_audioreplace_audioadd_textcolor_gradeken_burnsspeed_rampfilm_grainvignettesplit_screenoverlayextract_frameextract_audioburn_subtitlesadjust_speedlooppad_audioextract_framescustom

Cloud Resource Management

A persistent library agents can build on

Other platforms return a URL that expires. Soundside gives agents — and the humans who work with them — a persistent cloud library with projects, collections, brand kits, and full derivation history.

📁
Projects & Collections
Organize assets into projects with nested collections. Filter by type — images, video, audio, text, documents. Every resource gets a persistent cloud URL.
🎨
Brand Kits
Define colors, fonts, logos once. Every artifact — decks, docs, diagrams — inherits brand identity. Consistent output without repeating style instructions.
🧬
Lineage Tracking
Every resource knows its parents. Trace a final video back through its narration, music, key frames, and original character references. Full provenance chain.
🔎
Search, Tags & Metadata
Full-text search across names, tags, prompts, and provider metadata. Tag resources during generation. Filter across projects.

Competitive Landscape

The only platform with all seven

Multi-ProviderEditingCloud LibraryCollaborationSelf-Critique Loopx402MCP
Replicate
Fal.ai
Runway API
OpenAI
soundside.ai

Honest Pricing

Platform is our product. Providers pass through at cost.

We don't mark up generation. Our revenue comes from the tools we build and host — editing, library, orchestration — at $0.01 per write. Reads are free.

Platform Tools — Our Product
lib_list — Free (read-only)
analyze_media — 3¢/call
edit_video, lib_manage, lib_share — 1¢/call
Editing, library, brand kits, sharing, analysis
Generation — Pass-Through
Images — 2–8¢ per image
Video — $0.28–$3.20 per clip
Zero markup. Provider cost only.
🎁300 free credits to start
x402 — pay with USDC, no account

Start building in 5 minutes

Connect your MCP client · Get 300 free credits · Ship agent media workflows