Overview
The Reciclaje AI model has been trained on a diverse dataset of waste imagery, focusing on common recyclable materials found in household and commercial waste streams. The training data ensures the model can accurately identify and classify different types of waste materials.The model was trained specifically for the 5 core waste categories: Metal, Glass, Plastic, Carton, and Medical waste.
Dataset Composition
Waste Categories
The training dataset includes images across all 5 detection classes:Metal (Class 0)
Metal (Class 0)
Examples include:
- Aluminum cans (soda, beer)
- Metal food containers (tin cans)
- Metal bottle caps
- Small metallic objects and packaging
Glass (Class 1)
Glass (Class 1)
Examples include:
- Glass bottles (clear, green, brown)
- Glass jars and containers
- Broken glass pieces
- Various glass packaging materials
Plastic (Class 2)
Plastic (Class 2)
Examples include:
- PET bottles (water, soft drinks)
- Plastic containers and packaging
- Plastic bags and film
- Various plastic product types
Carton (Class 3)
Carton (Class 3)
Examples include:
- Cardboard boxes (various sizes)
- Paper cartons (milk, juice)
- Corrugated cardboard
- Paperboard packaging
Medical (Class 4)
Medical (Class 4)
Examples include:
- Disposable masks
- Medical gloves
- Healthcare-related waste items
- Sanitization materials
Data Characteristics
Image Variety
The training dataset was designed to include:- Multiple Angles: Objects photographed from different perspectives
- Varying Lighting: Images captured under different lighting conditions
- Different Backgrounds: Various environmental contexts
- Mixed Conditions: Clean and dirty/damaged items
- Scale Variation: Objects at different distances and sizes
Real-World Scenarios
The model was trained on real-world waste imagery to ensure practical applicability in actual recycling and waste management scenarios.
- Items in recycling bins
- Waste on conveyor belts
- Mixed waste situations
- Individual items against various backgrounds
Data Preprocessing
Standard YOLOv8 preprocessing techniques were applied:Dataset Limitations
While comprehensive, the training data has some limitations:Underrepresented Scenarios
Certain conditions may be underrepresented:- Extreme lighting conditions (very dark or bright)
- Unusual perspective angles
- Heavily damaged or degraded items
- Uncommon waste item variations
- Items with significant occlusion
For detailed information about model limitations, see the Limitations page.
Continuous Improvement
The model can be improved through:- Additional Training Data: Expanding the dataset with more diverse examples
- Fine-tuning: Adjusting the model for specific use cases or environments
- Active Learning: Incorporating feedback from real-world deployments
Contributing Data
If you have waste imagery that could improve the model:Technical Details
Annotation Format
The model uses YOLO format annotations:Class Mapping
Next Steps
Model Overview
Learn about the model architecture and specifications
Limitations
Understand known limitations and edge cases