Documentation Index
Fetch the complete documentation index at: https://mintlify.com/MilesONerd/neurenix/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Neurenix provides native Kubernetes integration for deploying, scaling, and managing ML models in production. The framework includes support for:- Deployments: Scalable model serving with rolling updates
- Pods: Individual container instances
- Services: Load balancing and service discovery
- ConfigMaps & Secrets: Configuration and credential management
- Jobs: Batch inference and training
Prerequisites
- Kubernetes cluster (1.19+)
- kubectl configured
- Docker images built and pushed to a registry
Quick Start
Deploy a Model
Expose via Service
Deployments
DeploymentConfig
Comprehensive deployment configuration:Deployment Operations
Neurenix-Specific Deployment
Simplified deployment creation:GPU Deployments
Pods
PodConfig
Pod Operations
Create Neurenix Pod
Services
ServiceConfig
Service Operations
Create Neurenix Service
Complete Production Deployment
YAML Export
Export configurations to YAML files:Best Practices
- Resource Limits: Always set CPU and memory limits to prevent resource exhaustion
- Health Checks: Implement liveness and readiness probes for reliability
- Rolling Updates: Use rolling updates with maxUnavailable=0 for zero-downtime deployments
- Horizontal Pod Autoscaling: Configure HPA for automatic scaling based on metrics
- Pod Disruption Budgets: Protect availability during cluster maintenance
- Namespaces: Use separate namespaces for different environments
- Labels and Selectors: Use consistent labeling for service discovery and monitoring
- Secrets Management: Use Kubernetes secrets or external secret managers
- Monitoring: Integrate with Prometheus and Grafana for observability
- Logging: Use structured logging with centralized log aggregation