Groq API Configuration

Overview

Splat uses Groq as its AI inference provider for analyzing Python errors and generating debugging suggestions. Groq provides ultra-fast LLM inference, making error analysis near-instantaneous.

Why Groq?

Groq was chosen for Splat because of:

Speed: Industry-leading inference speeds for real-time debugging
Quality: Access to powerful models like Llama 3 70B
Reliability: High uptime and consistent performance
Developer-friendly: Simple API and generous free tier

Getting a Groq API Key

Visit console.groq.com and create an account.

Groq offers a generous free tier suitable for most development needs.

Navigate to API Keys

Once logged in, go to the API Keys section in the console.

Create new API key

Click Create API Key, give it a descriptive name (e.g., “Splat Development”), and copy the generated key.

Save your API key immediately. You won’t be able to see it again after closing the dialog.

Add to environment

Add the key to your .env file:

.env

API=gsk_your_api_key_here
API_KEY=gsk_your_api_key_here

Available Models

Splat uses different models for different components:

Error Processing (process/process.py:36)

The main error analysis pipeline uses:

llama3-70b-8192

string

default:true

Llama 3 70B - The default model for error analysis.

Context window: 8,192 tokens
Best for: Complex error analysis, multi-file tracebacks
Performance: High accuracy, slower inference
Cost: Higher token usage

This model is hardcoded in process/process.py but provides the best error analysis quality.

Agent System (agents/agents.py:37)

The agent system model is configurable via the MODEL environment variable:

MODEL

string

default:"llama3-70b-8192"

Configure which model the agent system uses:

.env

MODEL=llama3-70b-8192

Recommended options:

llama3-70b-8192 - Best quality (default)
llama3-8b-8192 - Faster responses, good for simple errors
mixtral-8x7b-32768 - Larger context window for complex tracebacks

Model Comparison

Model	Context Window	Speed	Accuracy	Use Case
llama3-70b-8192	8,192 tokens	Moderate	Excellent	Complex errors (recommended)
llama3-8b-8192	8,192 tokens	Fast	Good	Simple errors, quick fixes
mixtral-8x7b-32768	32,768 tokens	Moderate	Very Good	Large tracebacks, multi-file analysis

For most use cases, the default llama3-70b-8192 provides the best balance of quality and performance.

API Configuration in Code

Splat configures the Groq API in two locations:

1. Error Processing Pipeline

File: process/process.py:11-36

from groq import Groq
import os
from dotenv import load_dotenv

load_dotenv()

def process(traceback_message: str, original_error_information: str, context: str):
    client = Groq(api_key=os.getenv("API"))
    
    chat_completion = client.chat.completions.create(
        messages=[...],
        model="llama3-70b-8192",
        response_format={"type": "json_object"}
    )
    return chat_completion.choices[0].message.content

Key features:

Uses the API environment variable
Hardcoded to use llama3-70b-8192
Returns structured JSON responses
Includes detailed system prompts for error analysis

2. Agent System

File: agents/agents.py:36-45

from pydantic_ai import Agent
import os
from dotenv import load_dotenv

load_dotenv()

bug_zapper = Agent(
    model_name=os.getenv('MODEL'),
    api_key=os.getenv('API_KEY'),
    deps_type=FlagDependencies,
    result_type=ZapperResult,
    system_prompt='You are an expert software debugging assistant...'
)

Key features:

Uses the API_KEY and MODEL environment variables
Configurable model selection
Structured output with Pydantic models
Tool-based architecture for file operations

Structured Output Format

Splat uses Groq’s JSON mode to ensure consistent, parseable responses:

{
  "where": {
    "repository_path": "/path/to/repo",
    "file_name": "example.py",
    "line_number": "42"
  },
  "what": {
    "error_type": "SyntaxError",
    "description": "Unclosed parenthesis in print statement"
  },
  "how": {
    "error_origination": "42",
    "suggested_code_solution": "print(hello)"
  }
}

This structured format enables automated error fixing and clear user feedback.

Rate Limits and Quotas

Groq provides generous rate limits:

Free tier: 30 requests per minute
Token limits: Varies by model and account type

For typical debugging workflows, the free tier is usually sufficient. Monitor your usage in the Groq Console.

Handling Rate Limits

If you hit rate limits:

Reduce frequency: Space out error analysis requests
Use smaller models: Switch to llama3-8b-8192 for simple errors
Upgrade account: Contact Groq for higher limits
Implement retry logic: Add exponential backoff in your code

API Usage Best Practices

Monitor usage

Track your API usage in the Groq Console to avoid unexpected limits.

Use appropriate models

Choose faster models for simple errors to save tokens and time.

Secure your keys

Never commit API keys. Use environment variables and .gitignore.

Handle errors gracefully

Implement error handling for API failures and network issues.

Security Considerations

Your Groq API key provides access to your account and incurs costs. Protect it carefully.

Key Security Measures

Environment variables only

# Good
API=gsk_xxx

# Bad - Never hardcode
client = Groq(api_key="gsk_xxx")

Restrict file permissions
```
chmod 600 .env
```
Use .gitignore
.gitignore
```
.env
.env.*
!.env.example
```
Rotate keys regularly
- Create new keys monthly
- Delete old keys immediately
Separate keys per environment
- Development key for local work
- Production key for deployed services

Troubleshooting

Authentication Failed

Error: 401 Unauthorized or Invalid API key Solutions:

Verify the API key is correct in .env
Ensure the key starts with gsk_
Check that the key hasn’t been deleted in the console
Generate a new key if needed

Model Not Available

Error: Model not found or similar Solutions:

Check the model name spelling
Verify the model is available in your Groq account tier
Use the default by removing the MODEL variable
Check Groq’s model documentation

Rate Limit Exceeded

Error: 429 Too Many Requests Solutions:

Wait before making more requests
Implement exponential backoff
Switch to a smaller/faster model
Upgrade your Groq account

Timeout Errors

Error: Request timeouts or slow responses Solutions:

Check your internet connection
Try a faster model (e.g., llama3-8b-8192)
Reduce the size of your error context
Check Groq’s status page

Advanced Configuration

Custom Model Parameters

While not currently exposed in Splat’s configuration, the Groq API supports additional parameters:

completion = client.chat.completions.create(
    model="llama3-70b-8192",
    messages=[...],
    temperature=0.7,      # Creativity (0-2)
    max_tokens=1024,      # Response length
    top_p=0.9,           # Nucleus sampling
    response_format={"type": "json_object"}
)

To use these, you would need to modify process/process.py.

Using Different Providers

While Splat is designed for Groq, you could adapt it for other OpenAI-compatible APIs by:

Installing the appropriate SDK
Updating the client initialization in process/process.py and agents/agents.py
Adjusting model names and parameters

Using alternative providers is not officially supported and may require code modifications.

Environment Variables

Complete environment variable reference

Initial Setup

Step-by-step configuration guide

Groq Documentation

Official Groq API documentation

Groq Console

Manage your API keys and usage

Get Started

Core Concepts

Commands

Configuration

Examples

Groq API Configuration

Overview

Why Groq?

Getting a Groq API Key

Available Models

Error Processing (process/process.py:36)

Agent System (agents/agents.py:37)

Model Comparison

API Configuration in Code

1. Error Processing Pipeline

2. Agent System

Structured Output Format

Rate Limits and Quotas

Handling Rate Limits

API Usage Best Practices

Monitor usage

Use appropriate models

Secure your keys

Handle errors gracefully

Security Considerations

Key Security Measures

Troubleshooting

Authentication Failed

Model Not Available

Rate Limit Exceeded

Timeout Errors

Advanced Configuration

Custom Model Parameters

Using Different Providers

Environment Variables

Initial Setup

Groq Documentation

Groq Console

Build docs developers (and LLMs) love

Get Started

Core Concepts

Commands

Configuration

Examples

​Overview

​Why Groq?

​Getting a Groq API Key

​Available Models

​Error Processing (process/process.py:36)

​Agent System (agents/agents.py:37)

​Model Comparison

​API Configuration in Code

​1. Error Processing Pipeline

​2. Agent System

​Structured Output Format

​Rate Limits and Quotas

​Handling Rate Limits

​API Usage Best Practices

Monitor usage

Use appropriate models

Secure your keys

Handle errors gracefully

​Security Considerations

​Key Security Measures

​Troubleshooting

​Authentication Failed

​Model Not Available

​Rate Limit Exceeded

​Timeout Errors

​Advanced Configuration

​Custom Model Parameters

​Using Different Providers

​Related Resources

Environment Variables

Initial Setup

Groq Documentation

Groq Console

Build docs developers (and LLMs) love

Overview

Why Groq?

Getting a Groq API Key

Available Models

Error Processing (process/process.py:36)

Agent System (agents/agents.py:37)

Model Comparison

API Configuration in Code

1. Error Processing Pipeline

2. Agent System

Structured Output Format

Rate Limits and Quotas

Handling Rate Limits

API Usage Best Practices

Security Considerations

Key Security Measures

Troubleshooting

Authentication Failed

Model Not Available

Rate Limit Exceeded

Timeout Errors

Advanced Configuration

Custom Model Parameters

Using Different Providers

Related Resources