Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/practical-tutorials/project-based-learning/llms.txt

Use this file to discover all available pages before exploring further.

OpenCV is the go-to library for computer vision in Python, and these tutorials show you how to put it to work on real-world problems. You’ll build document scanners, face detection and recognition systems, barcode readers, people counters, OCR pipelines, semantic and instance segmentation tools, and more — often combining OpenCV with deep learning backends, dlib, and Tesseract. Whether you’re a beginner looking to understand perspective transforms or an experienced practitioner exploring Mask R-CNN instance segmentation, there’s a project here for you.
Most OpenCV tutorials from PyImageSearch require opencv-python and imutils. Some deep learning tutorials also need TensorFlow or Keras.

Face Detection & Recognition

Detect faces in images and video streams, cluster them by identity, and build end-to-end recognition pipelines.

Object Detection & Tracking

Detect, localize, and track single or multiple objects across frames using classical and deep-learning-based approaches.

Image Processing & Analysis

Apply OpenCV’s image analysis capabilities — from saliency maps and semantic segmentation to barcode scanning, image stitching, and neural style transfer.

Pose & Landmark Detection

Detect facial landmarks, hand keypoints, and other structural features in images using dlib and deep learning.

Document & Text Processing

Scan physical documents, recognize text with OCR, detect text regions in natural images, and correct skewed text lines.

Miscellaneous OpenCV

Additional OpenCV projects that span multiple application areas or don’t fit neatly into the categories above.
Start with ‘Build A Document Scanner’ — it demonstrates OpenCV’s core perspective transform workflow in just a few dozen lines of Python.

Build docs developers (and LLMs) love