Overview
The taxonomy system classifies creator content using 6 super-concept dimensions with 3,208 pre-classified tags. Two collections power this:taxonomy_dimensions for the classification schema, and taxonomy_mapping for tag assignments.
taxonomy_dimensions
Defines the 6 super-concept dimensions used for content classification.Purpose
- Define classification schema for content tagging
- Store dimension names, descriptions, and allowed values
- Enable AI-powered content discovery and recommendations
- Support multi-dimensional content search
The 6 Dimensions
| Dimension | Description | Example Values |
|---|---|---|
body_type | Physical appearance and build | athletic, curvy, petite, muscular, slim |
performance_style | Content tone and presentation | sensual, playful, dominant, submissive, artistic |
setting | Location and environment | bedroom, outdoor, bathroom, office, studio |
attire | Clothing and accessories | lingerie, casual, cosplay, fetish_wear, nude |
audience_appeal | Target audience and niche | mainstream, fetish, couples, solo, group |
production_quality | Technical production level | professional, amateur, selfie, cinematic |
Key Fields
| Field | Type | Description |
|---|---|---|
id | UUID | Primary key |
dimension_name | String | Unique dimension identifier (slug) |
display_name | String | Human-readable name |
description | Text | Dimension purpose and usage |
allowed_values | JSON | Array of valid tag values for this dimension |
is_multi_select | Boolean | Whether multiple values can be selected |
sort_order | Integer | Display order in UI |
created_at | DateTime | Dimension creation timestamp |
Example Queries
List All Dimensions
Get Dimension with Values
taxonomy_mapping
Stores 3,208 pre-classified tags mapped across the 6 dimensions.Purpose
- Map platform-specific tags to taxonomy dimensions
- Enable fast AI-powered content classification
- Support content search and discovery
- Provide training data for taxonomy classifiers
Classification System
Each tag can have assignments across multiple dimensions:Key Fields
| Field | Type | Description |
|---|---|---|
id | UUID | Primary key |
tag | String | Original tag text (normalized lowercase) |
platform_source | String | Platform where tag originated |
dimension_mappings | JSON | Object mapping dimension names to value arrays |
confidence_score | Decimal | Classification confidence (0.0-1.0) |
usage_count | Integer | How many times this tag has been applied |
created_at | DateTime | Mapping creation timestamp |
updated_at | DateTime | Last classification update |
Example Queries
Search for Tag
Get Tags by Dimension
Create New Mapping
AI Classification
The taxonomy system powers AI-powered content classification:Action Flow
Use thetaxonomy-tag action flow to auto-classify content:
Custom Taxonomy Model
Thescout-fast-tag:latest Ollama model is fine-tuned on the 3,208 taxonomy mappings:
Related Collections
scraped_media- Content items tagged with taxonomy classificationsaction_flows- Thetaxonomy-tagflow that applies classificationsagent_audits- Logs of taxonomy classification executions
Workflow Integration
Content Classification Flow
- User uploads or scrapes new media with caption
- AI chat interface or dashboard triggers classification
- Agent emits
[ACTION:taxonomy-tag:{"media_id":"..."}] - Action runner fetches media caption + existing tags
- Calls Ollama
scout-fast-tagmodel with caption - Model returns relevant tags from 3,208 mapping corpus
- System looks up
taxonomy_mappingfor each tag - Aggregates
dimension_mappingsinto final classification - Updates
scraped_media.taxonomy_tagswith result - Content now discoverable via dimension-based search
Search and Discovery Flow
- User searches for “athletic outdoor content”
- System queries
taxonomy_mappingfor matching tags - Finds tags with
body_type: ["athletic"]ANDsetting: ["outdoor"] - Queries
scraped_mediawheretaxonomy_tagscontains those values - Returns ranked results by engagement metrics
Best Practices
- Normalize tags before lookup - Convert to lowercase, trim whitespace
- Use confidence thresholds - Only apply mappings with
confidence_score >= 0.7 - Track usage - Increment
usage_countwhen applying classifications - Handle multi-select - Check
taxonomy_dimensions.is_multi_selectbefore applying - Batch classify - Process multiple media items in single taxonomy flow execution
- Update existing classifications - Re-run classification when captions are edited
Data Sources
The 3,208 taxonomy mappings were sourced from:- OnlyFans tag corpus (2,400+ tags)
- Fansly category system (600+ tags)
- Manual classification of top 200 industry terms
- Community-submitted classifications
Model Training
Thescout-fast-tag model was fine-tuned using:
- Base model: SmolLM 135M parameters
- Training data:
Nodes/Universe/taxonomy_graph.json(3,205 nodes) - Training script:
Nodes/Universe/apply_taxonomy_assignments.py - Fine-tuning method: HITL (Human-in-the-loop) corrections
- Accuracy: 87% on validation set
See Also
- Media Collections - Content that receives classifications
- Ollama MCP Tools - AI model inference for classification
- Action Runner - Flow execution system
