Most issues with USB-Uncensored-LLM fall into three categories: engine not found (ranDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/techjarves/USB-Uncensored-LLM/llms.txt
Use this file to discover all available pages before exploring further.
start before install), insufficient RAM (model too large for the host machine), or platform-specific permission problems. The solutions below cover every known issue across Windows, macOS, Linux, and Android.
Windows Issues
Script closes instantly on Windows
Script closes instantly on Windows
python or winget commands typed at a terminal and opens the Microsoft Store instead of executing the binary. When a .bat file that calls python hits this interception, Windows opens the Store pop-up and immediately exits the script.Fix:- Run the
.batfile from a Command Prompt (cmd.exe) window rather than double-clicking it from Explorer. - Alternatively, right-click the
.batfile and select Run as Administrator. - To permanently fix the issue: go to Settings → Apps → Advanced app settings → App execution aliases and disable the aliases for
python.exeandpython3.exe.
Shared/python/python.exe and downloads one automatically on first run if no system Python is found. The App Execution Alias issue only affects the brief moment before the bundled Python is located."Ollama Engine Not Found" error
"Ollama Engine Not Found" error
Windows/start-fast-chat.bat was run before Windows/install.bat completed. The start script checks for the engine binary at Shared/bin/ollama-windows.exe and exits with this error if it is not present.Fix: Run Windows/install.bat first. The installer downloads the Ollama engine (step 6 of 7) and imports your selected models into the registry. The start script cannot function without the engine binary.If install.bat was interrupted mid-download, re-run it — the installer skips files that already exist and only re-downloads what is missing.PowerShell execution policy error
PowerShell execution policy error
Restricted, which blocks unsigned scripts from running.Fix: The install.bat file already passes -ExecutionPolicy Bypass to every PowerShell invocation, which should bypass this restriction without changing system settings. If the error still appears, run the following in an Administrator PowerShell session:macOS Issues
"cannot be opened because the developer cannot be verified"
"cannot be opened because the developer cannot be verified"
xattr -dr com.apple.quarantine on the engine binaries during setup. If the error persists after installation (for example, if you copied the folder from another machine), remove the quarantine attribute manually:install.command closes without running
install.command closes without running
.command file in Finder opens a new Terminal window and runs the script, but if macOS cannot determine a working directory or the script exits immediately due to a permission error, the window closes before you can read the output.Fix: Open Terminal manually, drag the Mac/install.command file into the Terminal window to paste its full path, and press Enter. This keeps the Terminal session open so you can read any error messages.If the script reports a permission denied error on the script itself:Linux Issues
"zstd: command not found" during extraction
"zstd: command not found" during extraction
.tar.zst archive. The installer extracts it using tar --use-compress-program=zstd, which requires the zstd binary to be available on the host system. Many minimal Linux installations do not include it by default.Fix: Install zstd using your distribution’s package manager, then re-run bash Linux/install.sh:llama-server missing after install
llama-server missing after install
Shared/bin/ollama-linux and Shared/lib/ollama/llama-server — if either is missing, it reports the engine as incomplete.Fix: Re-run the Linux installer. It detects the incomplete install and re-downloads the full archive:Android Issues
Termux process killed during compilation
Termux process killed during compilation
llama.cpp from source takes 10–30 minutes, which is long enough for Android to kill the process multiple times.Fix: Acquire a wake lock before starting the installer to prevent Android from suspending Termux:clang cmake git wget ninja python. If any of these packages fail to install, the compilation step will fail. Run apt update && apt full-upgrade -y first if you see package errors.The Android/start.sh script also calls termux-wake-lock automatically at launch to prevent the inference process from being killed during use."pkg command not found" or stale package errors
"pkg command not found" or stale package errors
pkg can also have corrupted package caches that cause install failures.Fix:- Uninstall Termux from the Play Store.
- Install Termux from F-Droid — this is the maintained, up-to-date version.
apt directly (instead of pkg) and runs apt full-upgrade before installing build tools to avoid cache-related issues with outdated Termux installations.Performance Issues
Very slow text generation (less than 1 token/sec)
Very slow text generation (less than 1 token/sec)
GPU not being used
GPU not being used
Shared/bin/ auto-detects available hardware accelerators at startup and prints the detected configuration in the terminal window.NVIDIA (Windows/Linux):- Verify that
nvidia-smiruns and shows your GPU. - Ensure CUDA drivers are installed and up to date.
- CUDA 11.x and 12.x are both supported by the bundled Ollama engine.
- Metal acceleration is detected and enabled automatically on M1/M2/M3/M4 chips. No action needed.
- ROCm support depends on the Ollama build. Check the terminal output at startup for
rocmin the device list.
cpu only despite a GPU being present, ensure your GPU driver is not blocked by Gatekeeper (macOS) or that the CUDA runtime is on the system PATH (Linux).Chat Server Issues
Port 3333 already in use
Port 3333 already in use
CHAT_SERVER_PORT in Shared/chat_server.py.Chat history not saving
Chat history not saving
Shared/chat_data/ directory does not exist or is not writable. This can happen on USB drives formatted as FAT32 if the physical write-protect switch on the drive is engaged, or if the drive was ejected and re-mounted as read-only.Checklist:- Confirm
Shared/chat_data/exists. The server creates it automatically on startup, so if it is missing, check that the server actually launched without errors. - On USB drives, check for a small physical write-protect switch on the side of the drive and ensure it is in the unlocked position.
- On Linux/macOS, check mount flags:
mount | grep <drive-name>. Ifroappears, the drive was mounted read-only. Remount with write permissions or use a different USB port. - On Windows, right-click the drive in Explorer → Properties → check that “Read-only” is not selected.
os.replace() after writing to a .tmp file) so partial saves cannot corrupt existing history — if a save fails, the previous chats.json remains intact.Ollama is already running — engine started twice
Ollama is already running — engine started twice
http://127.0.0.1:11434/api/tags before attempting to start the engine. If Ollama is already responding, the scripts skip the startup step and go straight to launching the chat server.You will see [OK] Ollama engine is already running! in the terminal when this happens — this is normal and not an error.