Skip to main content
Mega Brain accepts any format of expert content and transforms it into structured knowledge. This guide walks you through ingesting different material types into the system.

Overview

The ingestion process automatically:
  • Downloads and transcribes content
  • Extracts metadata (author, title, date)
  • Saves files with source traceability
  • Detects duplicates to prevent reprocessing
  • Prepares materials for the processing pipeline
All ingested materials are saved to the inbox/ directory organized by person and content type.

Supported Material Types

Ingesting YouTube Videos

The most common source. JARVIS automatically downloads transcriptions using the YouTube Transcript API.
/ingest https://www.youtube.com/watch?v=EXAMPLE_ID
What happens:
  1. Video metadata is fetched (title, author, duration)
  2. Transcript is downloaded automatically
  3. File is saved as inbox/[PERSON]/[TYPE]/video-title.txt
  4. A unique SOURCE_ID is generated (e.g., CG003 for Cole Gordon video 3)
Example output:
JARVIS: Material received.

  Source:    YouTube
  Title:     "How to Create Irresistible Offers"
  Author:    Alex Hormozi
  Duration:  42:15
  Words:     6,230

  Saved to: inbox/alex-hormozi/MASTERCLASS/how-to-create-irresistible-offers.txt

  Next step: execute /process-jarvis to process.

Advanced Ingestion Options

Manual Metadata Override

Override auto-detected metadata:
# Specify person explicitly
/ingest https://youtube.com/watch?v=abc123 --person "Cole Gordon"

# Set content type
/ingest /path/to/file.txt --type MASTERCLASS

# Ingest and immediately process
/ingest https://youtube.com/watch?v=abc123 --process

Content Type Classification

TypeAuto-Detection KeywordsUse Case
PODCAST”podcast”, “episode”, “interview”Conversational content
MASTERCLASS”masterclass”, “mastermind”, “training”Expert workshops
COURSE”course”, “module”, “lesson”Structured programs
BLUEPRINT”blueprint”, “pdf”, “playbook”Strategic documents
VSL”vsl”, “webinar”, “sales letter”Sales presentations
SCRIPT”script”, “template”, “copy”Templates and frameworks

Known Source Detection

The system recognizes expert sources automatically:
1

Path Analysis

Analyzes the file path or URL to detect known experts:
inbox/COLE GORDON/MASTERMINDS/video.txt

SOURCE_PERSON: Cole Gordon
SOURCE_COMPANY: Cole Gordon
CORPUS: closers_io
2

Known Sources Library

Detected PatternPersonCompanyCorpus
”hormozi”, “acquisition”Alex HormoziAlex Hormoziacquisition_com
”cole gordon”, “closers”Cole GordonCole Gordonclosers_io
”leila”Leila HormoziAlex Hormoziacquisition_com
”setterlun”, “sam ovens”Sam OvensSetterlun Universitysetterlun
3

Automatic Corpus Assignment

Related materials from the same expert are grouped into a corpus for cross-referencing.

Duplicate Detection

The pipeline includes 6 levels of duplicate detection to prevent reprocessing:

Detection Levels

  1. Exact MD5 Match - Same file already processed
  2. Content Hash - Different file, same content
  3. Partial Match - Content is substring of processed material
  4. YouTube ID - Same video from different source
  5. File Registry - Cross-reference with processing history
  6. Chunk Fingerprint - First 1000 characters comparison
When duplicates are found:
┌─────────────────────────────────────────────────────────┐
│  ⛔ DUPLICATE DETECTED - PROCESSING INTERRUPTED        │
├─────────────────────────────────────────────────────────┤
│                                                         │
│  Current file:  inbox/new-video.txt                     │
│  MD5:           a1b2c3d4e5f6...                         │
│                                                         │
│  Duplicate of:  inbox/old-video.txt                     │
│  Registered:    2026-02-15                              │
│  SOURCE_ID:     CG005                                   │
│                                                         │
│  ACTION: File will NOT be processed.                    │
│                                                         │
└─────────────────────────────────────────────────────────┘

Batch Ingestion

Process multiple materials at once:
# List of YouTube videos
/ingest https://youtube.com/watch?v=video1
/ingest https://youtube.com/watch?v=video2
/ingest https://youtube.com/watch?v=video3

Verification

Check Inbox Status

View pending materials:
/inbox
Output:
═══════════════════════════════════════════════
                INBOX STATUS
═══════════════════════════════════════════════

📥 PENDING: 3 files

  Cole Gordon (2 files)
  ├─ masterclass-closing.txt (6,234 words)
  └─ podcast-interview.txt (8,901 words)

  Alex Hormozi (1 file)
  └─ offer-creation-framework.txt (4,567 words)

⭐ NEXT: /process-jarvis to process pending files

Check System Status

/jarvis-briefing
Provides:
  • Health score (0-100)
  • Processed materials count
  • Pending inbox files
  • Active agents status

Next Steps

Process Ingested Materials

Learn how to run the 5-phase processing pipeline on ingested materials

View Processing Results

Check processing logs and generated artifacts

Build docs developers (and LLMs) love