
Audio Transcription
Turn podcast, interview, or voice memo audio into durable transcripts and timestamped subtitle artifacts via PostPlus hosted Whisper endpoints.
Install
npx skills add https://github.com/postplusai/postplus-skills --skill audio-transcriptionWhat is this skill?
- Released endpoints: `transcription-whisper` and `transcription-whisper-turbo` via hosted media routes
- Default `transcription-whisper` when subtitle quality matters; turbo for cheaper rough passes without primary timestamps
- Async hosted flow persists request, response, manifest, generation handle, provider status, and downloaded outputs
- Requires `durationSeconds` in capability input for billing and preflight
- Explicit handoff: use `video-transcription` for video inputs and `video-analysis` for semantic video understanding
Adoption & trust: 1 installs on skills.sh; 8 GitHub stars; 1/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).
Recommended Skills
Video Editagentspace-so/runcomfy-agent-skills
Image To Videoagentspace-so/runcomfy-agent-skills
Image Editagentspace-so/runcomfy-agent-skills
Flux Kontextagentspace-so/runcomfy-agent-skills
Nano Banana 2agentspace-so/runcomfy-agent-skills
Nano Banana Editagentspace-so/runcomfy-agent-skills
Journey fit
Primary fit
Speech-to-text against hosted APIs is an integration step while assembling media pipelines in the product. Whisper transcription through `media-file` / `media-generation` is an external service hook, not ideation or launch SEO work.
Common Questions / FAQ
Is Audio Transcription safe to install?
skills.sh reports 1 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Audio Transcription
# Audio Transcription ## Use When - The input is audio and the main job is speech-to-text, subtitle-ready timing, rough speech search, multilingual transcription, or durable transcript artifacts. - Use `video-transcription` for video inputs and `video-analysis` for semantic video understanding. ## Do Not Use When - The task belongs to ideation, QA, or another released skill listed in the handoff section. - Required inputs are missing and guessing would change the result. ## Execution Boundary - Released endpoint keys are `transcription-whisper` and `transcription-whisper-turbo` through hosted `media-file` and `media-generation`. - Default to `transcription-whisper` when subtitle quality matters. Use turbo only for an explicit cheaper rough pass or when timestamps are not primary. - Hosted transcription is async. Submit writes request, response, manifest, generation handle, provider status, provider URLs, and downloaded outputs if already completed. ## Source And Path - Include `durationSeconds` in the hosted capability request input for billing/preflight. - Use `enableTimestamps` when output will feed subtitles or edit decisions. - Start with one source file before larger batches. - Keep internal requests, responses, manifests, normalized transcripts, and downloaded artifacts under `.postplus/audio-transcription`; keep final user-facing transcript exports outside `.postplus`. ## Handoff - If status is pending, return the manifest path, `generationHandle`, not keep the conversation open just to poll. - When completed, hand off downloaded artifacts and `normalizedTranscriptPath` to `subtitle-packager` if SRT/ASS is needed. ## Stop Conditions - Stop when required user intent, source evidence, or owned input artifacts are missing and guessing would change the result. - If an owned CLI or script command fails, report the exact error and stop. Do not bypass the failure with metadata-only answers, readiness probing, local payload rewrites, fallback providers, or unpublished tools. ## Public Command Boundary - Choose the smallest matching command or workflow from the user input and run it directly. - Readiness diagnostics: `postplus doctor --skill audio-transcription`. - If an owned CLI or script command fails, report the exact error and stop. Do not bypass the failure with metadata-only answers, readiness probing, local payload rewrites, fallback providers, or unpublished tools. - Use `postplus media schema --json` only when constructing or repairing an unknown request shape. - Hosted media capability: `postplus media capability --request <hosted-capability-request.json> --output <result.json>`. - Use the capability request shape required by the selected workflow; do not call provider APIs directly. - If the CLI returns a quote-confirmation challenge, run `postplus quote confirm --json --challenge-file <challenge.json>` and retry with the returned token.