Docs / Tool Catalog

Tool catalog

19 tools. All available to all users. Version: 2026-04-19

When to use

Use this to scope your agent surface

Use the catalog to understand which tools are available, their sync/async behavior, and which providers each tool supports. Always call tools/list at runtime for the canonical schemas.

ToolCategoryAsyncProvidersDescription
analyze_mediaanalysismixedanthropic, grok, openai, qwen, soundside.ai, vertexTechnical ffprobe analysis, AI vision QA, transcription, segment detection, and EDL export.
apply_effecteditingsyncsoundside.aiApply Ken Burns, speed ramp, film grain, or vignette effects.
compose_mediaeditingsyncsoundside.aiAdd text, overlay media, or build split-screen composites.
compose_videocompositionsyncsoundside.aiServer-side video composition pipeline: enrich plan, generate assets in parallel, assemble with transitions, audio ducking, and overlays.
create_artifactgenerationsyncdocx, gamma, mermaid, plotly, pptx, weasyprintCreate charts, presentations, documents, and diagrams.
create_audiogenerationmixedminimax, runway, vertexText-to-speech, voice cloning, voice design, and sound effects.
create_imagegenerationmixedalibaba, grok, luma, minimax, runway, vertexGenerate images from text prompts with character reference support.
create_musicgenerationasyncminimaxGenerate music from lyrics and style prompts.
create_textgenerationsyncgrok, minimax, vertexLLM chat completions with optional structured JSON output.
create_videogenerationasyncalibaba, grok, luma, minimax, runway, vertexGenerate video from text or image. Supports I2V, video extension, and character references.
edit_audioeditingsyncsoundside.aiMix, replace, or pad audio on existing media.
edit_videoeditingsyncsoundside.aiCore video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, and custom FFmpeg.
extract_mediaeditingsyncsoundside.aiExtract frames, frame sets, or audio tracks from media.
lib_listlibrarysyncsoundside.aiBrowse projects, collections, resources, lineage, and brand kits (free).
lib_managelibrarysyncsoundside.aiCRUD for projects, collections, resources, and brand kits.
lib_sharelibrarysyncsoundside.aiShare projects with other users by email with permission levels.
list_adaptersadapterssyncsoundside.aiList LoRA adapters mirrored into the Soundside library (free).
manage_adapteradapterssyncsoundside.aiInspect, deploy, undeploy, delete, or select a checkpoint for a LoRA adapter.
train_adapteradaptersasyncdashscope, modalTrain a LoRA adapter from library media on DashScope (Wan) or Modal (Hunyuan/LTX) backends.

Pricing

Live pricing

Soundside aims to break even on provider pass-through costs with a minimal margin (~10%). Editing and library tools are $0.01/call. `analyze_media` is mode-priced: $0.01 for `technical` and `export_edl`, $0.02 for `transcribe` and `detect_segments`, and $0.03 for `vision_qa`.

Machine-readable pricing is always available at: GET /api/x402/status

Prices are DB-driven and may change without redeployment. Always check the endpoint rather than hardcoding.

Developer docs

Detailed tool reference

For full parameter documentation, examples, and tips, see the Tool Reference on GitHub.

Operational notes

Legacy references

  • creative_freedom is a self-hosted GPU provider available via API key only; it is intentionally omitted from the public x402 catalog.
  • batch_dispatch and batch_wait are internal runtime concerns, not user-facing MCP tools.