Tool catalog
15 tools. All available to all users. Version: 2026-03-16
When to use
Use this to scope your agent surface
Use the catalog to understand which tools are available, their sync/async behavior, and which providers each tool supports. Always call tools/list at runtime for the canonical schemas.
| Tool | Category | Async | Providers | Description |
|---|---|---|---|---|
| analyze_media | editing | mixed | soundside.ai | Technical analysis, AI vision QA, transcription, segment detection, and EDL export. |
| apply_effect | editing | sync | soundside.ai | Apply Ken Burns, speed ramp, film grain, or vignette effects. |
| compose_media | editing | sync | soundside.ai | Add text, overlay media, or build split-screen composites. |
| create_artifact | generation | sync | plotly, pptx, weasyprint, mermaid, gamma | Create charts, presentations, documents, and diagrams. |
| create_audio | generation | mixed | minimax, vertex, runway, creative_freedom | Text-to-speech, voice cloning, voice design, sound effects, and a deprecated transcribe compatibility shim. |
| create_image | generation | mixed | vertex, grok, runway, minimax, luma | Generate images from text prompts with character reference support. |
| create_music | generation | async | minimax | Generate music from lyrics and style prompts. |
| create_text | generation | sync | vertex, minimax, grok | LLM chat completions with optional structured JSON output. |
| create_video | generation | async | vertex, runway, minimax, luma, grok | Generate video from text or image. Supports I2V, video extension, and character references. |
| edit_audio | editing | sync | soundside.ai | Mix, replace, or pad audio on existing media. |
| edit_video | editing | sync | soundside.ai | Core video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, and custom FFmpeg. |
| extract_media | editing | sync | soundside.ai | Extract frames, frame sets, or audio tracks from media. |
| lib_list | library | sync | soundside.ai | Browse projects, collections, resources, lineage, and brand kits. |
| lib_manage | library | sync | soundside.ai | CRUD for projects, collections, resources, and brand kits. |
| lib_share | library | sync | soundside.ai | Share projects with other users by email with permission levels. |
Pricing
Live pricing
Soundside aims to break even on provider pass-through costs with a minimal margin (~10%). Editing and library tools are $0.01/call. `analyze_media` is mode-priced: $0.01 for `technical` and `export_edl`, $0.02 for `transcribe` and `detect_segments`, and $0.03 for `vision_qa`.
Machine-readable pricing is always available at: GET /api/x402/status
Prices are DB-driven and may change without redeployment. Always check the endpoint rather than hardcoding.
Developer docs
Detailed tool reference
For full parameter documentation, examples, and tips, see the Tool Reference on GitHub.
Operational notes
Legacy references
- • batch_dispatch and batch_wait are not part of the current public runtime tool surface.
- • Provider-specific legacy tools remain available in some environments but are intentionally omitted from this catalog.