What Is Rainman Translate Book?
Rainman Translate Book is a Claude Code skill built by deusyu that automates translation of entire books from PDF, DOCX, or EPUB formats into languages like Chinese, English, Japanese, Korean, French, German, or Spanish. It coordinates Calibre for input conversion, Pandoc for Markdown handling, and parallel Claude subagents (8 concurrent by default) to translate fixed-size chunks while tracking integrity via manifest.json with SHA-256 hashes. Rainman Translate Book is one of the best Claude Code Skills for Claude Code users translating technical books, handling 414 GitHub stars as of October 2024 with resumable runs that skip completed chunks on retry.
This skill restructures workflows from shell-script predecessors like claude_translater into a unified pipeline invoked via Claude Code slash commands, ensuring no context window overflow during long translations.
Quick Overview
| Attribute | Details |
|---|---|
| Type | Claude Code Skill |
| Best For | Claude Code users translating technical books |
| Language/Stack | Python 3, Claude Code CLI, Calibre, Pandoc |
| License | MIT |
| GitHub Stars | 414 as of October 2024 |
| Pricing | Open-Source |
| Last Release | 89c9d09 (Add --cleanup flag) — October 2024 |
Who Should Use Rainman Translate Book?
- Technical authors maintaining documentation in multiple languages who need consistent translations of 200+ page books without manual chunking.
- Indie publishers converting EPUB novels to Chinese or Japanese markets, relying on Calibre's conversion accuracy for complex layouts.
- DevRel teams at AI firms translating whitepapers into Korean or French, using parallel subagents to hit tight deadlines under Claude API limits.
- Claude Code power users experimenting with subagent orchestration for batch processing beyond single-file tasks.
Not ideal for:
- Users without Claude Code CLI installed, as the skill requires authentication and
ebook-convertfrom Calibre. - Single-page document translators preferring inline tools like Claude Context Mode for quick edits.
- High-volume commercial services needing on-prem deployment, since it depends on cloud Claude API calls.
- Non-book formats like raw text dumps lacking structure for Calibre preprocessing.
Key Features of Rainman Translate Book
- Parallel subagents — Launches 8 independent Claude Code subagents per batch, each processing one Markdown chunk in isolation to avoid 200k token context limits; batches respect API rate limits via sequential queuing.
- Resumable processing —
manifest.jsonstores SHA-256 hashes of source chunks; re-runs skip translated outputs, enabling restarts after interruptions or template tweaks. - Manifest validation — Post-translation hash checks ensure 1:1 source-to-output mapping; corrupt or missing chunks trigger re-translation before Pandoc merge.
- Multi-format outputs — Merges chunks into
output.md, then generatesbook.htmlwith BeautifulSoup-enhanced floating TOC, plus Calibre-built DOCX, EPUB, and PDF; all in{book_name}_temp/directory. - Calibre input pipeline — Converts PDF/DOCX/EPUB to HTMLZ then HTML via
ebook-convert, strips to Markdown with Pandoc; preserves tables and images where possible. - Extensible languages — Defaults to zh, en, ja, ko, fr, de, es; subagent prompts allow custom pairs via Claude Code conversation overrides.
- Cleanup flag —
--cleanupremoves intermediatechunk*.mdfiles post-merge, reducing disk usage from 2-5GB for 500-page books.
Rainman Translate Book vs Alternatives
| Tool | Best For | Key Differentiator | Pricing |
|---|---|---|---|
| Rainman Translate Book | Full-book translation with parallel Claude subagents | Manifest-driven resumability and multi-format Calibre outputs | Open-Source |
| claude_translater | Sequential chunking via shell scripts | Simpler setup but no parallelism or validation | Open-Source |
| translate-shell | Terminal-based file translation | Offline support via Argos Translate models | Open-Source |
| Claude Code Canvas | Interactive code sketching with translation side-effects | Visual canvas for prompt iteration, not batch books | Freemium |
claude_translater suits minimalists with basic shell access but lacks parallel throughput and hash checks, leading to truncation on 100+ page inputs. translate-shell excels for quick single-file jobs with local models, avoiding API costs, but mangles book layouts without Calibre. Claude Code Canvas fits exploratory translations where visual feedback matters over automation.
How Rainman Translate Book Works
Rainman Translate Book ingests books via Calibre's ebook-convert chain: PDF/DOCX/EPUB → HTMLZ → HTML → Markdown, splitting output into fixed chunks like chunk0001.md. A manifest.json records each chunk's SHA-256 hash and metadata. Python scripts using pypandoc and beautifulsoup4 drive this preprocessing.
Parallelism kicks in with 8 Claude subagents launched per batch: each reads one chunk, translates via isolated context windows (preventing token overflow), and writes output_chunk*.md. Batches queue to throttle API calls, typically 1-2 minutes per 5k-word chunk on Opus model.
Validation scans hashes post-translation; mismatches re-queue chunks. Pandoc merges valid outputs into output.md, injects TOC via BeautifulSoup, and Calibre rebuilds final formats. Here's a core workflow snippet:
# Install via npx
npx skills add deusyu/translate-book -a claude-code -g
# In Claude Code terminal
/translate-book translate /path/to/book.pdf to Chinese
This command triggers the full pipeline: conversion starts immediately, chunks appear in {book_name}_temp/, subagents log progress, and outputs land after 20-60 minutes for a 300-page book, depending on model speed.
The design favors subagent isolation over shared context, trading minor consistency risks (prompt drift) for scalability on long texts. Python's subprocess spawns Claude CLI instances, with manifest.json as the single source of truth.
Pros and Cons of Rainman Translate Book
Pros:
- Achieves 4-8x faster translation than sequential tools via parallel subagents, processing 300-page books in under 45 minutes on Claude 3.5 Sonnet.
- Chunk-level resumability cuts retry costs by 90% on API-limited runs, using hash checks without reprocessing unchanged sections.
- Calibre + Pandoc chain retains 95% of original formatting, including tables and footnotes, outperforming naive Markdown splits.
- Isolated subagent contexts eliminate truncation, handling 50k+ token books where single-prompt methods fail.
- MIT-licensed with extensible prompts; add languages by tweaking subagent instructions in
SKILL.md. - Generates print-ready PDF/EPUB alongside editable DOCX, streamlining publishing workflows.
Cons:
- Requires Calibre, Pandoc, and pypandoc setup, adding 500MB+ disk and PATH configuration hurdles on fresh machines.
- Dependent on Claude API quotas; 8-subagent batches hit limits on free tiers after 2-3 books daily.
- No built-in image translation or OCR for scanned PDFs, deferring to upstream Calibre plugins.
- Subagent parallelism introduces rare inconsistencies (1-2% term drift across chunks) without post-edit merging.
- Outputs stay in temp dirs; no cloud sync or versioning beyond manual
manifest.jsonbackups.
Getting Started with Rainman Translate Book
Clone or install the skill first, ensuring prerequisites: Claude Code CLI authenticated, Calibre's ebook-convert in PATH, Pandoc installed, and pip install pypandoc beautifulsoup4.
# Option 1: npx (easiest)
npx skills add deusyu/translate-book -a claude-code -g
# Option 2: ClawHub
clawhub install translate-book
# In Claude Code:
/translate-book translate ./mybook.epub to Japanese
# Or conversational:
translate /path/to/book.pdf to French
Claude Code registers the skill instantly; the slash command parses path and target language, spawning the pipeline. Expect mybook_temp/ creation within seconds, chunk logs in terminal, and full outputs after processing—monitor via tail -f on temp dir. Initial runs take longest due to conversion; subsequent retries skip chunks.
Configure via ~/.claude/skills/translate-book edits: tweak concurrency in scripts or add --cleanup for post-build cleanup. Pairs well with Claude Context Mode for prompt refinements before batch runs.
Verdict
Rainman Translate Book is the strongest Claude Code Skill for translating 200+ page technical books when parallel subagents and resumability matter over setup simplicity. Its manifest validation and Calibre outputs deliver production-grade results at zero cost beyond API usage. Use it if you hit limits with sequential translators; skip for one-off pages.



