
Transloadit Media Processing
Wire Transloadit pipelines for encode, transcode, thumbnails, image transforms, OCR, and chained media robots via MCP server or Node CLI.
Overview
transloadit-media-processing is an agent skill for the Build phase that integrates Transloadit’s 86+ media robots for encode, transform, and pipeline workflows via MCP or Node CLI.
Install
npx skills add https://github.com/github/awesome-copilot --skill transloadit-media-processingWhat is this skill?
- 86+ specialized Transloadit robots for video, audio, image, and document transforms
- Video encode to HLS/MP4/WebM, thumbnails, GIFs, concat, subtitles, and overlays
- Image resize, crop, watermark, and format conversion including WebP and AVIF
- Audio extract/transcode plus speech-to-text, TTS, OCR, and moderation-style AI steps
- Setup via @transloadit/mcp-server (recommended) or @transloadit/node CLI with free account signup
- 86+ specialized processing robots
Adoption & trust: 8.5k installs on skills.sh; 34.6k GitHub stars; 1/3 security scanners passed (skills.sh audits).
What problem does it solve?
You need production-grade video, audio, image, or document processing but do not want to maintain bespoke ffmpeg and worker infrastructure yourself.
Who is it for?
Indie SaaS or content products adding uploads, transcoding, or asset derivatives with a free Transloadit account and Node-friendly tooling.
Skip if: Fully offline-only media editing with no cloud account, or trivial single-image resize that local libraries already cover.
When should I use this skill?
When asked to encode video to HLS/MP4, generate thumbnails, resize or watermark images, extract audio, concatenate clips, add subtitles, OCR documents, or run any media processing pipeline.
What do I get? / Deliverables
You define Transloadit assemblies and integration steps so uploads trigger the right encode, thumbnail, OCR, or chained robot pipelines in the cloud.
- Transloadit assembly or pipeline configuration for the requested transforms
- MCP server or Node CLI integration steps for the agent or app
- Documented robot chain for multi-step media jobs
Recommended Skills
Journey fit
Media processing is implemented by integrating a third-party transformation API during product build, before you stress-test delivery in Ship. Integrations subphase matches cloud robots, MCP/CLI setup, and multi-step assembly pipelines—not local Flutter or pure unit-test work.
How it compares
Cloud pipeline orchestration skill—not a local CLI ffmpeg cheat sheet or a generic S3 upload-only integration.
Common Questions / FAQ
Who is transloadit-media-processing for?
Solo builders and small teams shipping features that need scalable video, audio, image, or document transforms through Transloadit.
When should I use transloadit-media-processing?
During Build when wiring upload backends—e.g., HLS encoding, thumbnails, watermarks, OCR, or multi-step pipelines—before you harden perf and security in Ship.
Is transloadit-media-processing safe to install?
It assumes external API credentials and network access; review the Security Audits panel on this Prism page and scope secrets carefully in your agent environment.
SKILL.md
READMESKILL.md - Transloadit Media Processing
# Transloadit Media Processing Process, transform, and encode media files using Transloadit's cloud infrastructure. Supports video, audio, images, and documents with 86+ specialized processing robots. ## When to Use This Skill Use this skill when you need to: - Encode video to HLS, MP4, WebM, or other formats - Generate thumbnails or animated GIFs from video - Resize, crop, watermark, or optimize images - Convert between image formats (JPEG, PNG, WebP, AVIF, HEIF) - Extract or transcode audio (MP3, AAC, FLAC, WAV) - Concatenate video or audio clips - Add subtitles or overlay text on video - OCR documents (PDF, scanned images) - Run speech-to-text or text-to-speech - Apply AI-based content moderation or object detection - Build multi-step media pipelines that chain operations together ## Setup ### Option A: MCP Server (recommended for Copilot) Add the Transloadit MCP server to your IDE config. This gives the agent direct access to Transloadit tools (`create_template`, `create_assembly`, `list_assembly_notifications`, etc.). **VS Code / GitHub Copilot** (`.vscode/mcp.json` or user settings): ```json { "servers": { "transloadit": { "command": "npx", "args": ["-y", "@transloadit/mcp-server", "stdio"], "env": { "TRANSLOADIT_KEY": "YOUR_AUTH_KEY", "TRANSLOADIT_SECRET": "YOUR_AUTH_SECRET" } } } } ``` Get your API credentials at https://transloadit.com/c/-/api-credentials ### Option B: CLI If you prefer running commands directly: ```bash npx -y @transloadit/node assemblies create \ --steps '{"encoded": {"robot": "/video/encode", "use": ":original", "preset": "hls-1080p"}}' \ --wait \ --input ./my-video.mp4 ``` ## Core Workflows ### Encode Video to HLS (Adaptive Streaming) ```json { "steps": { "encoded": { "robot": "/video/encode", "use": ":original", "preset": "hls-1080p" } } } ``` ### Generate Thumbnails from Video ```json { "steps": { "thumbnails": { "robot": "/video/thumbs", "use": ":original", "count": 8, "width": 320, "height": 240 } } } ``` ### Resize and Watermark Images ```json { "steps": { "resized": { "robot": "/image/resize", "use": ":original", "width": 1200, "height": 800, "resize_strategy": "fit" }, "watermarked": { "robot": "/image/resize", "use": "resized", "watermark_url": "https://example.com/logo.png", "watermark_position": "bottom-right", "watermark_size": "15%" } } } ``` ### OCR a Document ```json { "steps": { "recognized": { "robot": "/document/ocr", "use": ":original", "provider": "aws", "format": "text" } } } ``` ### Concatenate Audio Clips ```json { "steps": { "imported": { "robot": "/http/import", "url": ["https://example.com/clip1.mp3", "https://example.com/clip2.mp3"] }, "concatenated": { "robot": "/audio/concat", "use": "imported", "preset": "mp3" } } } ``` ## Multi-Step Pipelines Steps can be chained using the `"use"` field. Each step references a previous step's output: ```json { "steps": { "resized": { "robot": "/image/resize", "use": ":original", "width": 1920 }, "optimized": { "robot": "/image/optimize", "use": "resized" }, "exported": { "robot": "/s3/store", "use": "opti