Setting up Ollama for local AI job summaries in InfoJobs

Ollama is a local LLM runtime that InfoJobs DevBoard uses to generate job summaries entirely on your machine. Because everything runs locally, there are no API keys to manage, no per-request costs, and no data sent to a third-party service. The Express backend communicates with Ollama over localhost:11434 whenever a user requests a job summary.

Installation

Download Ollama

Go to https://ollama.com and download the installer for your operating system (macOS, Linux, or Windows). Follow the platform-specific installation instructions on the site.

Pull the model

Open a terminal and download the qwen2.5:3b model. This is approximately 2 GB, so the download time will depend on your connection speed.

ollama pull qwen2.5:3b

Start Ollama

Start the Ollama server. It will listen on localhost:11434 by default.

ollama serve

Keep this terminal open while running InfoJobs DevBoard. The backend requires the server to be reachable at that address.

Verify the installation

In a second terminal, confirm that the model was downloaded successfully. The qwen2.5:3b entry should appear in the output.

ollama list

Model options

qwen2.5:3b is the default model used by InfoJobs DevBoard, but Ollama supports several others. The table below lists models that are compatible with the same ollama.chat interface used by the backend:

Model	Size	RAM Required	Quality
qwen2.5:3b	~2GB	~4GB	Good (default)
llama3.2	~4GB	~6GB	Excellent
mistral	~4GB	~6GB	Good
phi3	~2GB	~4GB	Basic

Choose a larger model for higher-quality summaries if your machine has the available RAM, or stick with qwen2.5:3b or phi3 on machines with limited resources.

Changing the model

To swap the model, open backend/routes/ai.js and update the model field in the ollama.chat call:

const response = await ollama.chat({
  model: 'qwen2.5:3b',  // change to another model, e.g. 'llama3.2' or 'mistral'
  messages: [{ role: 'user', content: prompt }],
  stream: true,
})

After saving the file, restart the backend. The new model must already be pulled with ollama pull <model-name> before the backend can use it.

Troubleshooting

Error: Ollama server not found

The backend could not reach Ollama at localhost:11434. Make sure the server is running:

ollama serve

Then verify that the model is installed:

ollama list

If qwen2.5:3b does not appear, pull it again with ollama pull qwen2.5:3b.

Error: Job Not Found

The summary endpoint returns a 404 with { "error": "Job Not Found" } when the requested job ID does not exist in the database. This is expected behavior — check that the job ID in the URL corresponds to a real listing.

CORS error in the browser

If the browser blocks the request to the backend, verify that the Express CORS middleware is configured and applied before the AI router. Check backend/middlewares/cors.js and confirm it is imported in backend/index.js.

Slow response or timeout

Larger models take more time to produce output, especially on machines without a GPU. Try switching to a lighter model:

ollama pull qwen2.5:3b
# or
ollama pull phi3

Then update the model field in backend/routes/ai.js as described in the Changing the model section above.

Getting Started

Architecture

Frontend Guide

Backend Guide

AI Integration

Setting up Ollama for local AI job summaries in InfoJobs

Installation

Model options

Changing the model

Troubleshooting

Build docs developers (and LLMs) love

Getting Started

Architecture

Frontend Guide

Backend Guide

AI Integration

Documentation Index

​Installation

​Model options

​Changing the model

​Troubleshooting

Build docs developers (and LLMs) love

Installation

Model options

Changing the model

Troubleshooting