run

Overview

The run function executes a bootstrapped Syft-Flwr project in simulation mode, allowing you to test your federated learning workflow locally using mock datasets before deploying to real datasites.

Function Signature

def run(
    project_dir: Union[str, Path],
    mock_dataset_paths: list[Union[str, Path]]
) -> Union[bool, asyncio.Task]

Parameters

project_dir

Union[str, Path]

required

Path to the bootstrapped Syft-Flwr project directory. Must contain main.py and pyproject.toml.

mock_dataset_paths

list[Union[str, Path]]

required

List of paths to mock datasets for each datasite. The number of paths must match the number of datasites configured during bootstrap.

Returns

Synchronous Mode

bool

Returns True if simulation succeeded, False otherwise. Used when running in scripts or standard Python environments.

Async Mode

asyncio.Task

Returns a task handle if running in an async environment (e.g., Jupyter). Callers can await this task.

What It Does

Validates the project - Ensures the project was bootstrapped correctly
Sets up mock RDS clients - Creates simulated SyftBox network in a temporary directory
Bootstraps encryption keys - Generates E2E encryption keys for all participants (if enabled)
Runs simulation - Executes server and client code concurrently:
- DS (Data Scientist) runs the server/aggregator code
- Each DO (Data Owner) runs client code on their mock dataset
Collects logs - Saves execution logs to {project_dir}/simulation_logs/
Cleans up - Removes temporary network directory and encryption keys

Usage Example

From notebooks/fl-diabetes-prediction/local/ds.ipynb:323-326:

import syft_flwr
from pathlib import Path

SYFT_FLWR_PROJECT_PATH = Path("../fl-diabetes-prediction")

# Get mock dataset paths (from datasites)
mock_paths = []
for client in do_clients:
    dataset = client.dataset.get(name="pima-indians-diabetes-database")
    mock_paths.append(dataset.get_mock_path())

# Run simulation
print(f"Running syft_flwr simulation with mock paths: {mock_paths}")
syft_flwr.run(SYFT_FLWR_PROJECT_PATH, mock_paths)

Simulation Logs

Logs for each participant are saved to:

{project_dir}/simulation_logs/
├── ds@openmined.org.log      # Aggregator logs
├── do1@openmined.org.log     # Data Owner 1 logs
└── do2@openmined.org.log     # Data Owner 2 logs

Environment Variables

SYFT_FLWR_ENCRYPTION_ENABLED

str

default:"true"

Set to "false" to disable end-to-end encryption during simulation.

SYFT_FLWR_SKIP_MODULE_CHECK

str

default:"false"

Set to "true" to skip module validation (useful for parallel testing).

DATA_DIR

str

Automatically set for each client to point to their mock dataset path.

SYFTBOX_CLIENT_CONFIG_PATH

str

Automatically set for each client to point to their simulated config.

Execution Flow

Exceptions

FileNotFoundError

Raised if project directory, main.py, or pyproject.toml doesn’t exist.

NotADirectoryError

Raised if the project path is not a directory.

ValueError

Raised if any mock dataset path doesn’t exist.

Notes

Simulation runs in a temporary directory under /tmp/{project_name}
The server task completes first, then client tasks are automatically cancelled
All temporary files and encryption keys are cleaned up after simulation
In Jupyter environments, the function returns a task that can be awaited
DS logs are printed to stdout after the server completes

Core API

CLI Commands

Transport Layers

Orchestration

Overview

Function Signature

Parameters

Returns

What It Does

Usage Example

Simulation Logs

Environment Variables

Execution Flow

Exceptions

Notes

Build docs developers (and LLMs) love

Core API

CLI Commands

Transport Layers

Orchestration

​Overview

​Function Signature

​Parameters

​Returns

​What It Does

​Usage Example

​Simulation Logs

​Environment Variables

​Execution Flow

​Exceptions

​Notes

Build docs developers (and LLMs) love

Overview

Function Signature

Parameters

Returns

What It Does

Usage Example

Simulation Logs

Environment Variables

Execution Flow

Exceptions

Notes