
Translate Book Parallel
Translate a full PDF, DOCX, or EPUB into another language with resumable chunked parallel subagents and merged output formats.
Overview
Translate Book (Parallel Subagents) is an agent skill most often used in Build (also Grow content, Launch distribution) that runs a resumable chunked pipeline to translate whole books across PDF, DOCX, and EPUB outputs.
Install
npx skills add https://github.com/aradotso/trending-skills --skill translate-book-parallelWhat is this skill?
- End-to-end pipeline: Calibre to HTMLZ/Markdown, chunked translation, validate, Pandoc merge, multi-format export
- Parallel subagents (~6000 characters per chunk, 8 concurrent by default) with isolated context windows
- Resumable manifest.json with SHA-256 hashes and 1:1 source-to-output chunk validation
- Triggers cover PDF, DOCX, and EPUB targets such as Spanish, Chinese, and Japanese
- Chunks target ~6000 characters each
- 8 parallel subagents concurrent by default
- manifest.json SHA-256 tracking with 1:1 source-to-output chunk validation
Adoption & trust: 1.2k installs on skills.sh; 31 GitHub stars; 2/3 security scanners passed (skills.sh audits).
What problem does it solve?
Translating an entire book in one agent session truncates text, loses place, and mixes languages as context grows.
Who is it for?
Indie authors localizing a rights-cleared ebook, course creators cloning documentation corpora, or builders batching long PDF manuals to another language.
Skip if: Short strings, real-time UI i18n, or copyrighted material you are not allowed to process with external LLMs.
When should I use this skill?
User asks to translate a book (PDF/DOCX/EPUB), convert a PDF to another language, or run parallel book translation with subagents.
What do I get? / Deliverables
You get validated per-chunk translations merged into TOC-backed HTML and optional DOCX, EPUB, or PDF with a resumable manifest if the run stops mid-pipeline.
- Per-chunk translated markdown files and validated manifest
- Merged HTML with TOC plus optional DOCX, EPUB, and PDF exports
Recommended Skills
Journey fit
Spans multiple journey phases - primary shelf plus alternate fits below.
Long-form manuscript translation is treated as a Build docs/content pipeline because input is structured documents and output is publishable artifacts, not live ops. Docs is the shelf for book-scale markdown pipelines, Calibre and Pandoc conversion, and merged HTML/DOCX/EPUB/PDF deliverables.
Where it fits
Convert a technical PDF manual to markdown chunks and fan out eight translation workers before merging a single localized DOCX.
Produce a Spanish EPUB clone of an English course book to expand the same SKU for another market.
Generate print-ready translated PDF and EPUB bundles for storefront upload after validation passes.
Resume an interrupted run using manifest hashes without re-translating completed chunks.
How it compares
Use for chunked parallel book pipelines instead of one-shot chat translation of large files.
Common Questions / FAQ
Who is translate-book-parallel for?
Solo builders and creators using Claude Code who have full-length PDF, DOCX, or EPUB sources and want automated multi-format translated outputs.
When should I use translate-book-parallel?
Use it in Build when producing localized manuscript files; in Grow when expanding content libraries; in Launch when preparing translated ebooks for distribution—after Calibre and Pandoc are installed.
Is translate-book-parallel safe to install?
It runs local conversion and agent translation over your files; review the Security Audits panel on this page and only process content you may send through your agent provider.
SKILL.md
READMESKILL.md - Translate Book Parallel
# Translate Book (Parallel Subagents) > Skill by [ara.so](https://ara.so) — Daily 2026 Skills collection. A Claude Code skill that translates entire books (PDF/DOCX/EPUB) into any language using parallel subagents. Each chunk gets an isolated context window — preventing truncation and context accumulation that plague single-session translation. ## Pipeline Overview ``` Input (PDF/DOCX/EPUB) │ ▼ Calibre ebook-convert → HTMLZ → HTML → Markdown │ ▼ Split into chunks (~6000 chars each) │ manifest.json tracks SHA-256 hashes ▼ Parallel subagents (8 concurrent by default) │ each: read chunk → translate → write output_chunk*.md ▼ Validate (manifest hash check, 1:1 source↔output match) │ ▼ Merge → Pandoc → HTML (with TOC) → Calibre → DOCX / EPUB / PDF ``` ## Prerequisites ```bash # 1. Calibre (provides ebook-convert) # macOS brew install --cask calibre # Linux sudo apt-get install calibre # Or download from https://calibre-ebook.com/ # 2. Pandoc brew install pandoc # macOS sudo apt-get install pandoc # Linux # 3. Python dependencies pip install pypandoc beautifulsoup4 ``` Verify all tools are available: ```bash ebook-convert --version pandoc --version python3 -c "import pypandoc; print('pypandoc ok')" ``` ## Installation **Option A: npx (recommended)** ```bash npx skills add deusyu/translate-book -a claude-code -g ``` **Option B: ClawHub** ```bash clawhub install translate-book ``` **Option C: Git clone** ```bash git clone https://github.com/deusyu/translate-book.git ~/.claude/skills/translate-book ``` ## Usage in Claude Code Once the skill is installed, use natural language inside Claude Code: ``` translate /path/to/book.pdf to Chinese ``` ``` translate ~/Downloads/mybook.epub to Japanese ``` ``` /translate-book translate /path/to/book.docx to French ``` The skill orchestrates the full pipeline automatically. ## Supported Languages | Code | Language | |------|-----------| | `zh` | Chinese | | `en` | English | | `ja` | Japanese | | `ko` | Korean | | `fr` | French | | `de` | German | | `es` | Spanish | Language codes are extensible — add new ones in the skill definition. ## Running Pipeline Steps Manually ### Step 1: Convert to Markdown Chunks ```bash python3 scripts/convert.py /path/to/book.pdf --olang zh ``` This produces inside `{book_name}_temp/`: - `chunk0001.md`, `chunk0002.md`, ... (source chunks, ~6000 chars each) - `manifest.json` (SHA-256 hashes for validation) ```bash # For EPUB input python3 scripts/convert.py /path/to/book.epub --olang ja # For DOCX input python3 scripts/convert.py /path/to/book.docx --olang fr ``` ### Step 2: Translate (Parallel Subagents) The skill handles this step — it launches 8 concurrent subagents per batch, each translating one chunk independently: ``` # Each subagent receives exactly this task: Read chunk0042.md → translate to target language → write output_chunk0042.md ``` **Resumable:** Already-translated chunks (valid `output_chunk*.md` files) are skipped on re-run. ### Step 3: Merge and Build All Formats ```bash python3 scripts/merge_and_build.py \ --temp-dir book_name_temp \ --title "《Book Title in Target Language》" ``` Before merging, validation checks: - Every source chunk has a matching output file (1:1) - Source chunk hashes match `manifest.json` (no stale outputs) - No output files are empty Outputs produced: | File | Description | |------|-------------| | `output.md` | Merged translated Markdown | | `book.html` | Web version wit