Quick Start

Welcome

This guide will help you make your first diabetes prediction using the easiest deployment method - the REST API (Phase 3). You’ll have a working prediction endpoint in minutes.

Time Required: ~10 minutesPrerequisites: Docker installed on your system

Quick Start with REST API

Clone or Navigate to Source

First, ensure you have the project source code:

cd ~/workspace/source/fase-3

Build Docker Image

Build the FastAPI container:

docker build -t apirest .

This creates a Docker image with all dependencies installed:

Python 3.12
FastAPI 0.111.0
scikit-learn 1.4.1
imbalanced-learn 0.12.0
All other required packages

Run API Container

Start the API server in detached mode:

docker run -d --name apirest-container -p 80:80 apirest

The API is now running and accessible at http://localhost

Copy Training Data

Copy your training dataset into the container:

# From the resources directory
docker cp train.csv apirest-container:/app

Make sure you have train.csv from the Kaggle dataset. See Dataset Documentation for download instructions.

Access Swagger UI

Open your browser and navigate to:

http://localhost/docs

You’ll see the interactive Swagger API documentation with two endpoints:

POST /train - Train the model
POST /predict - Make predictions

Train the Model

In Swagger UI:

Click on the POST /train endpoint
Click “Try it out”
Click “Execute”

The API will:

Load train.csv
Encode categorical features
Scale features with StandardScaler
Apply SMOTEENN resampling
Train RandomForestClassifier
Save model to model.pkl

Response:

{
  "message": "Model successfully trained"
}

Make Your First Prediction

Now predict diabetes for a sample patient:

Click on the POST /predict endpoint
Click “Try it out”
Replace the request body with this patient data:

{
  "gender": "Female",
  "age": 36,
  "hypertension": 0,
  "heart_disease": 0,
  "smoking_history": "current",
  "bmi": 32.27,
  "HbA1c_level": 6.2,
  "blood_glucose_level": 220
}

Click “Execute”

Response:

{
  "message": "Tiene diabetes"
}

Try different patient values to see how the prediction changes! Patients with higher HbA1c levels (≥6.5%) and blood glucose (≥126 mg/dL) are more likely to have diabetes.

Test with Different Patient Profiles

High Risk Patient
Low Risk Patient
Moderate Risk Patient

{
  "gender": "Female",
  "age": 36,
  "hypertension": 0,
  "heart_disease": 0,
  "smoking_history": "current",
  "bmi": 32.27,
  "HbA1c_level": 6.2,
  "blood_glucose_level": 220
}

Expected: “Tiene diabetes” (Has diabetes)Why: High BMI (obese range), elevated HbA1c (prediabetic), very high blood glucose

{
  "gender": "Male",
  "age": 28,
  "hypertension": 0,
  "heart_disease": 0,
  "smoking_history": "never",
  "bmi": 22.5,
  "HbA1c_level": 5.0,
  "blood_glucose_level": 90
}

Expected: “No tiene diabetes” (No diabetes)Why: Young, healthy BMI, normal HbA1c and glucose levels, no smoking

{
  "gender": "Female",
  "age": 54,
  "hypertension": 1,
  "heart_disease": 0,
  "smoking_history": "former",
  "bmi": 27.32,
  "HbA1c_level": 5.8,
  "blood_glucose_level": 105
}

Expected: Variable (depends on model)Why: Overweight, hypertension present, prediabetic HbA1c range

Using cURL

You can also interact with the API using command-line tools:

curl -X POST "http://localhost/train" \
  -H "accept: application/json"

Verify Container Status

Check that your container is running properly:

# View running containers
docker ps

# Expected output:
CONTAINER ID   IMAGE      COMMAND                  PORTS                NAMES
abc123def456   apirest    "fastapi run apirest..."  0.0.0.0:80->80/tcp   apirest-container

# View container logs
docker logs apirest-container

# Enter container shell (optional)
docker exec -it apirest-container /bin/bash

Stopping and Cleaning Up

When you’re done:

# Stop the container
docker stop apirest-container

# Remove the container
docker rm apirest-container

# (Optional) Remove the image
docker rmi apirest

Understanding the API Response

The predict endpoint returns a simple JSON response:

{
  "message": "Tiene diabetes"  // or "No tiene diabetes"
}

The messages are in Spanish:

“Tiene diabetes” = Has diabetes (prediction: 1)
“No tiene diabetes” = No diabetes (prediction: 0)

Troubleshooting

Port 80 already in use

If port 80 is occupied, use a different port:

docker run -d --name apirest-container -p 8080:80 apirest

Then access at http://localhost:8080/docs

Model file does not exist error

This means the model hasn’t been trained yet. Make sure to:

Copy train.csv into the container
Call the /train endpoint before /predict

docker cp train.csv apirest-container:/app

Something went wrong message

Check the container logs for detailed error messages:

docker logs apirest-container

Common issues:

Invalid patient data format
Missing required fields
Invalid categorical values (e.g., wrong gender or smoking_history)

Cannot download train.csv

You need Kaggle credentials to download the dataset:

Create a Kaggle account
Generate API token (kaggle.json)
Download dataset:

kaggle datasets download -d iammustafatz/diabetes-prediction-dataset
unzip diabetes-prediction-dataset.zip

See Dataset Documentation for detailed instructions.

What’s Next?

Phase 1: Notebook

Explore the data and model in an interactive Jupyter notebook

Phase 2: CLI

Use command-line tools for batch predictions

API Deployment

Advanced API deployment and integration guide

Patient Features

Understand what each patient feature means

Alternative Start: CLI (Phase 2)

If you prefer command-line tools over REST API:

# Navigate to fase-2
cd ~/workspace/source/fase-2

# Build and run container
docker build -t ai-proyecto-sustituto .
docker run -it --name ai-container ai-proyecto-sustituto /bin/bash

# In another terminal, copy data files
docker cp train.csv ai-container:/app
docker cp test.csv ai-container:/app

# Back in the container shell
python train.py --model_file model.pkl --data_file train.csv --overwrite_model
python predict.py --model_file model.pkl --input_file test.csv --predictions_file predictions.csv

# View predictions
cat predictions.csv

See Phase 2: CLI Tools for detailed documentation.

Overview

Getting Started

Core Concepts

Deployment

Welcome

Quick Start with REST API

Test with Different Patient Profiles

Using cURL

Verify Container Status

Stopping and Cleaning Up

Understanding the API Response

Troubleshooting

What’s Next?

Phase 1: Notebook

Phase 2: CLI

API Deployment

Patient Features

Alternative Start: CLI (Phase 2)

Build docs developers (and LLMs) love

Overview

Getting Started

Core Concepts

Deployment

Documentation Index

​Welcome

​Quick Start with REST API

​Test with Different Patient Profiles

​Using cURL

​Verify Container Status

​Stopping and Cleaning Up

​Understanding the API Response

​Troubleshooting

​What’s Next?

Phase 1: Notebook

Phase 2: CLI

API Deployment

Patient Features

​Alternative Start: CLI (Phase 2)

Build docs developers (and LLMs) love

Welcome

Quick Start with REST API

Test with Different Patient Profiles

Using cURL

Verify Container Status

Stopping and Cleaning Up

Understanding the API Response

Troubleshooting

What’s Next?

Alternative Start: CLI (Phase 2)