This page summarises the main changes introduced in each version ofDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/HUANGCHIHHUNGLeo/claude-real-video/llms.txt
Use this file to discover all available pages before exploring further.
claude-real-video. For the very latest updates and patch releases, see the GitHub releases page.
0.3.0
New flags
-
--why— Write your viewing intent directly intoMANIFEST.txt. The model reads this up front and analyses the frames and transcript through that lens rather than producing a generic summary. -
--kb— Save the analysis as a dated markdown note (e.g.2025-01-15-video-slug.md) into a folder you specify — an Obsidian vault, a plain notes directory, anywhere. The note lives on aftercrv-outis overwritten. -
--keep-audio— Preserve the full original soundtrack asaudio.m4aalongside the frames and transcript. The audio is copied losslessly where possible, and re-encoded to AAC otherwise. This lets audio-capable models (Gemini, GPT-4o, …) actually hear the video — not just read the words. -
--report— Generatereport.htmlin the output directory, showing every extracted frame — kept, dropped as a duplicate, or removed by the--max-framescap — along with its pixel-diff percentage. Open in a browser to tune--dedup-thresholdand--scenevisually.
Subtitle priority chain
claude-real-video now checks for existing subtitles before falling back to Whisper, in this order:
- A sidecar
.srtor.vttfile sitting next to a local source file. - An embedded subtitle stream inside the video container.
- Whisper transcription of the audio track (unchanged fallback).
Sliding-window deduplication (--dedup-window, default 4)
Previously, deduplication only compared each frame against its immediate predecessor. Version 0.3.0 compares against the last N kept frames (default 4). This catches A-B-A cuts — a shot the model has already seen won’t be sent again just because a different frame appeared in between.
Set --dedup-window 1 to restore the old consecutive-only behaviour.
Honest transcript notes
MANIFEST.txt now distinguishes clearly between two “no transcript” situations that were previously conflated:
- Silent video (no audio track):
(none — this video has no subtitles and no audio track) - Whisper not installed:
(none — no existing subtitles; install whisper to transcribe: pip install openai-whisper)
The current release is 0.3.0. Install or upgrade via pip: pypi.org/project/claude-real-video.