Documentation Index
Fetch the complete documentation index at: https://mintlify.com/perplexityai/modelcontextprotocol/llms.txt
Use this file to discover all available pages before exploring further.
General Questions
What is MCP?
What is MCP?
MCP (Model Context Protocol) is an open protocol that enables AI assistants to securely connect to external tools and data sources. The Perplexity MCP Server implements this protocol to provide AI assistants with access to Perplexity’s web-grounded search, research, and reasoning capabilities.Benefits of MCP:
- Standardized way to integrate tools with AI assistants
- Works across multiple clients (Cursor, Claude Desktop, VS Code, etc.)
- Secure and controlled access to external APIs
- Easy to install and configure
What is the Perplexity MCP Server?
What is the Perplexity MCP Server?
The Perplexity MCP Server is the official integration that brings Perplexity’s AI-powered search and research capabilities to your favorite AI coding assistants and development tools.What it provides:
- Real-time web search with ranked results
- AI-powered question answering with citations
- Deep research with comprehensive multi-source analysis
- Advanced reasoning for complex analytical tasks
Tool Selection
Which tool should I use?
Which tool should I use?
Choose the right tool based on your needs:
perplexity_search
Best for: Finding URLs, checking recent news, verifying facts, discovering sourcesReturns: Ranked list of results with titles, URLs, snippets, and dates (no AI synthesis)Speed: Fast (< 2 seconds)Example use cases:- “Find official documentation for React hooks”
- “Search for recent articles about TypeScript 5.0”
- “Get URLs for Python asyncio tutorials”
perplexity_ask
Best for: Quick factual questions, summaries, explanations, general Q&AReturns: AI-generated answer with numbered citationsSpeed: Fast (< 3 seconds)Model: sonar-proExample use cases:- “What are the main features of Next.js 14?”
- “Explain how OAuth 2.0 works”
- “Summarize recent changes to the AWS SDK”
perplexity_research
Best for: Literature reviews, comprehensive overviews, investigative queries needing many sourcesReturns: Detailed multi-source analysis with numbered citationsSpeed: Slow (30+ seconds)Model: sonar-deep-researchExample use cases:- “Compare modern web frameworks for building SaaS applications”
- “Research best practices for microservices architecture in 2026”
- “Comprehensive analysis of GraphQL vs REST API design”
perplexity_reason
Best for: Math, logic, comparisons, complex arguments, chain-of-thought tasksReturns: Step-by-step reasoning with numbered citationsSpeed: Medium (3-10 seconds)Model: sonar-reasoning-proExample use cases:- “Analyze the trade-offs between PostgreSQL and MongoDB”
- “Calculate the complexity of this sorting algorithm”
- “Compare the security implications of different authentication methods”
Features
How do citations work?
How do citations work?
Most Perplexity tools return responses with numbered citations that reference the sources used to generate the answer.Citation format:Which tools include citations:
- ✅
perplexity_ask- Yes - ✅
perplexity_research- Yes - ✅
perplexity_reason- Yes - ❌
perplexity_search- No (returns raw search results instead)
- Verify the accuracy of information
- Explore sources for deeper understanding
- Assess the credibility of the response
- Reference sources in your own documentation
Citations are numbered in the order they appear in the response. The same source may be cited multiple times.
Can I use a custom base URL?
Can I use a custom base URL?
Yes! You can configure a custom base URL using the Default value:
PERPLEXITY_BASE_URL environment variable. This is useful for:- Using a proxy or gateway
- Testing against a staging environment
- Routing through a custom endpoint
https://api.perplexity.aiThe custom URL should provide API-compatible endpoints at
/chat/completions and /search.What is strip_thinking?
What is strip_thinking?
strip_thinking is an optional parameter for perplexity_reason and perplexity_research that removes internal reasoning tokens from the response.Purpose:- Some models include
<think>...</think>tags showing their internal reasoning process - These tags can consume significant context tokens in your AI assistant
- Setting
strip_thinking: trueremoves these tags while keeping the final answer
false (thinking tokens are included)When to use it:- ✅ You only need the final answer, not the reasoning process
- ✅ You’re running low on context tokens
- ✅ The thinking tokens are cluttering your chat history
- ❌ You want to understand how the model arrived at its conclusion
- ❌ You’re debugging unexpected responses
How do I increase the timeout?
How do I increase the timeout?
The default timeout is 5 minutes (300,000ms), which works for most queries. For longer research tasks, increase it using Recommended timeouts:
PERPLEXITY_TIMEOUT_MS:Configuration:perplexity_search: 60000ms (1 minute)perplexity_ask: 180000ms (3 minutes)perplexity_reason: 300000ms (5 minutes) - defaultperplexity_research: 600000ms+ (10+ minutes)
- If the API doesn’t respond within the timeout period, the request is aborted
- You’ll receive an error: “Request timeout: Perplexity API did not respond within Xms”
- The timeout is checked on each request, so you can change it without restarting
The
perplexity_research tool using the sonar-deep-research model typically takes 30+ seconds and may take several minutes for complex queries.Is there a rate limit?
Is there a rate limit?
Rate limits are determined by your Perplexity API subscription plan, not by the MCP server.Check your limits:
- Visit the Perplexity API Portal
- View your current plan and rate limits
- Monitor your usage and remaining quota
- You’ll receive a 429 error from the API
- Wait before making additional requests
- Consider upgrading your API plan for higher limits
- Optimize your queries to use fewer API calls
- Use
perplexity_searchinstead ofperplexity_askwhen you only need URLs - Use
perplexity_askinstead ofperplexity_researchfor simple questions - Cache responses when appropriate
- Implement exponential backoff for retries
The MCP server does not implement client-side rate limiting or caching. All requests are sent directly to the Perplexity API.
Advanced Features
Can I filter search results by recency or domain?
Can I filter search results by recency or domain?
Yes! Several tools support filtering parameters:Use cases:
Include specific domains:Exclude domains (use
Recency Filtering
Available for:perplexity_ask, perplexity_reason"hour"- Breaking news and very recent updates"day"- Today’s developments"week"- Recent announcements and releases"month"- Current trends and recent changes"year"- Exclude older information
Domain Filtering
Available for:perplexity_ask, perplexity_reason- prefix):Search Context Size
Available for:perplexity_ask, perplexity_reason"low"(default) - Fastest, less context"medium"- Balanced speed and comprehensiveness"high"- Most comprehensive, slower
What is the reasoning_effort parameter?
What is the reasoning_effort parameter?
reasoning_effort controls the depth of analysis for the perplexity_research tool using the sonar-deep-research model.Available values:"minimal"- Quick overview with basic analysis"low"- Standard research with moderate depth"medium"- Thorough analysis with multiple sources"high"- Comprehensive deep dive with extensive investigation
- Higher effort = More thorough analysis but slower response time
- Lower effort = Faster results but less comprehensive coverage
- Complex topics requiring multiple perspectives
- Critical decisions needing thorough research
- Comprehensive comparisons and evaluations
- Simple overview or introduction to a topic
- Time-sensitive queries
- When you already have some context
The
reasoning_effort parameter only applies to perplexity_research. Other tools use fixed reasoning strategies optimized for their use cases.Need More Help?
Can’t find your question here? Check out these resources:- Troubleshooting Guide - Common issues and solutions
- Community Forum - Ask questions and share solutions
- GitHub Issues - Report bugs and request features
- API Documentation - Detailed tool reference