Overview
AI scanning is the core workflow of Flashcard AI. Point the desktop app at a folder of images or a video file, and Gemini extracts every multiple-choice question as a structured flashcard — no manual typing required. Images are batched into PDFs (up to 50 pages per batch) and sent to Gemini with a structured extraction prompt. Gemini returns one JSON object per page containing the question, answer options, correct answers, question type, and whether it inferred the answer.Supported input types
| Type | Extensions |
|---|---|
| Images | .jpg, .jpeg, .png, .webp, .bmp |
| Video files | .mp4, .avi |
| Direct PDF | Sent as-is in batch mode |
Video files are processed by extracting frames first. Each distinct frame is treated as an image page before being batched.
How scanning works
Select a folder
Choose a folder containing your exam images (or a video file) from the desktop app. The app reads all supported image files in order.
Images are merged into PDFs
Images are grouped into batches of up to 50 pages each. Each batch is merged into a single in-memory PDF and sent to Gemini in one API call, which is faster and cheaper than sending images one by one.
Gemini extracts questions
Gemini processes every page and returns a JSON array — one object per page. The extraction prompt instructs Gemini to capture the question stem, all answer options, the correct answer(s), question type, and whether the answer was inferred.
Flashcards are created
Each valid JSON object becomes a
Flashcard. Pages with no question (NOT_A_QUESTION) are silently skipped. Cards with inferred answers get an automatic warning note.Gemini JSON output format
Each page Gemini processes returns one object with this structure:Model fallback chain
If a model is unavailable (HTTP 404) the app automatically falls back to the next model in the list:gemini-2.5-flash(default, recommended)gemini-2.5-flash-litegemini-3-flash-previewgemini-3.1-flash-lite-previewgemini-flash-latestgemini-flash-lite-latest
Inferred answers
When no explicit answer clue is visible in an image (no highlight, checkmark, filled bubble, or solution section), Gemini reasons using its domain knowledge and marks the card with"inferred": true.
These cards receive an automatic note:
⚠ Đáp án do AI suy luận (không có đáp án rõ trong ảnh)
Skipped pages
Gemini automatically skips pages that do not contain a question:- Blank or mostly blank pages
- Title pages and course headers
- Logo or watermark-only pages
- Diagrams or charts with no question stem
- Answer explanation pages without a question
"question": "NOT_A_QUESTION" and are not added to the deck.
Multi-key parallel mode
Adding multiple Gemini API keys enables parallel processing. The image list is divided into equal-sized packs — one pack per key — and each pack runs on a dedicated worker thread simultaneously.Keys are validated in parallel before the scan starts. Dead or invalid keys are excluded automatically, and the scan begins from a key that has had recovery time after validation.
Real-time log output
The scanning UI streams live progress messages:Scanning tips
Image quality matters
Image quality matters
Gemini reads the pixel data directly. Blurry, low-resolution, or heavily compressed images reduce extraction accuracy. Aim for at least 150 DPI for scanned documents.
Keep one question per image
Keep one question per image
The batch-PDF mode assumes one question per page. If an image contains multiple questions, only the first will be reliably extracted.
Rate limits and key count
Rate limits and key count
The default safe rate is 8 requests per minute per key. With one key this means roughly 7.5 seconds between batches. Add more keys to speed up scanning proportionally.
Special characters and code snippets
Special characters and code snippets
Gemini preserves code syntax, math formulas, Greek letters, and special symbols exactly as they appear. Indentation, operators (
==, !=, >=), and arrows (→, ≥) are retained in the output.Next steps
Study mode
Review your extracted deck with keyboard shortcuts and track mastery.
Quiz mode
Practice with multiple-choice questions and save your progress.