Docs / Tool Catalog

Tool catalog

15 tools. All available to all users. Version: 2026-03-16

When to use

Use this to scope your agent surface

Use the catalog to understand which tools are available, their sync/async behavior, and which providers each tool supports. Always call tools/list at runtime for the canonical schemas.

ToolCategoryAsyncProvidersDescription
analyze_mediaeditingmixedsoundside.aiTechnical analysis, AI vision QA, transcription, segment detection, and EDL export.
apply_effecteditingsyncsoundside.aiApply Ken Burns, speed ramp, film grain, or vignette effects.
compose_mediaeditingsyncsoundside.aiAdd text, overlay media, or build split-screen composites.
create_artifactgenerationsyncplotly, pptx, weasyprint, mermaid, gammaCreate charts, presentations, documents, and diagrams.
create_audiogenerationmixedminimax, vertex, runway, creative_freedomText-to-speech, voice cloning, voice design, sound effects, and a deprecated transcribe compatibility shim.
create_imagegenerationmixedvertex, grok, runway, minimax, lumaGenerate images from text prompts with character reference support.
create_musicgenerationasyncminimaxGenerate music from lyrics and style prompts.
create_textgenerationsyncvertex, minimax, grokLLM chat completions with optional structured JSON output.
create_videogenerationasyncvertex, runway, minimax, luma, grokGenerate video from text or image. Supports I2V, video extension, and character references.
edit_audioeditingsyncsoundside.aiMix, replace, or pad audio on existing media.
edit_videoeditingsyncsoundside.aiCore video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, and custom FFmpeg.
extract_mediaeditingsyncsoundside.aiExtract frames, frame sets, or audio tracks from media.
lib_listlibrarysyncsoundside.aiBrowse projects, collections, resources, lineage, and brand kits.
lib_managelibrarysyncsoundside.aiCRUD for projects, collections, resources, and brand kits.
lib_sharelibrarysyncsoundside.aiShare projects with other users by email with permission levels.

Pricing

Live pricing

Soundside aims to break even on provider pass-through costs with a minimal margin (~10%). Editing and library tools are $0.01/call. `analyze_media` is mode-priced: $0.01 for `technical` and `export_edl`, $0.02 for `transcribe` and `detect_segments`, and $0.03 for `vision_qa`.

Machine-readable pricing is always available at: GET /api/x402/status

Prices are DB-driven and may change without redeployment. Always check the endpoint rather than hardcoding.

Developer docs

Detailed tool reference

For full parameter documentation, examples, and tips, see the Tool Reference on GitHub.

Operational notes

Legacy references

  • batch_dispatch and batch_wait are not part of the current public runtime tool surface.
  • Provider-specific legacy tools remain available in some environments but are intentionally omitted from this catalog.