Skip to main content

Overview

Perplexity’s APIs are fully compatible with OpenAI’s SDKs. You can use your existing OpenAI client libraries with both the Chat Completions API and Agentic Research API by simply changing the base URL and providing your Perplexity API key.
We recommend using the Perplexity SDK for the best experience with full type safety, enhanced features, and preset support. Use OpenAI SDKs if you’re already integrated and need drop-in compatibility.

Quick Start

Chat Completions API

Use the OpenAI SDK with Perplexity’s Chat Completions API:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

completion = client.chat.completions.create(
    model="sonar-pro",
    messages=[
        {"role": "user", "content": "What are the latest developments in AI?"}
    ]
)

print(completion.choices[0].message.content)

Agentic Research API

Use the OpenAI SDK with Perplexity’s Agentic Research API:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

response = client.responses.create(
    model="openai/gpt-5-mini",
    input="What are the latest developments in AI?"
)


print(response.output_text)
        break

Configuration

Setting Up the OpenAI SDK

Configure OpenAI SDKs to work with Perplexity by setting the base_url to https://api.perplexity.ai/v2:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_PERPLEXITY_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)
Important: Use base_url="https://api.perplexity.ai/v2" (with /v2) for both Chat Completions and Agentic Research APIs.

Chat Completions API

Perplexity’s Chat Completions API is fully compatible with OpenAI’s Chat Completions interface.

Basic Usage

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

completion = client.chat.completions.create(
    model="sonar-pro",
    messages=[
        {"role": "user", "content": "What are the latest developments in AI?"}
    ]
)

print(completion.choices[0].message.content)

Streaming

Streaming works exactly like OpenAI’s API:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

stream = client.chat.completions.create(
    model="sonar-pro",
    messages=[
        {"role": "user", "content": "What are the latest developments in AI?"}
    ],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Perplexity-Specific Parameters

Add Perplexity-specific search parameters using extra_body (Python) or direct parameters (TypeScript):
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

completion = client.chat.completions.create(
    model="sonar-pro",
    messages=[
        {"role": "user", "content": "Latest climate research findings"}
    ],
    extra_body={
        "search_domain_filter": ["nature.com", "science.org"],
        "search_recency_filter": "month"
    }
)

print(completion.choices[0].message.content)
print(f"Sources: {len(completion.search_results)} articles found")

Agentic Research API

Perplexity’s Agentic Research API is fully compatible with OpenAI’s Agentic Research API interface.

Basic Usage

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

response = client.responses.create(
    model="openai/gpt-5-mini",
    input="What are the latest developments in AI?"
)


print(response.output_text)
        break
print(f"Response ID: {response.id}")

Using Presets

Presets are pre-configured setups optimized for specific use cases. Use the extra_body parameter (Python) or cast the parameter (TypeScript) to pass presets:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

# Use a preset instead of specifying model and parameters
response = client.responses.create(
    input="What are the latest developments in AI?",
    extra_body={
        "preset": "pro-search"
    }
)


print(response.output_text)
        break
See Agentic Research API Presets for available presets and their configurations.

Using Third-Party Models

You can also specify third-party models directly instead of using presets:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

response = client.responses.create(
    model="openai/gpt-5-mini",
    input="What are the latest developments in AI?"
)


print(response.output_text)
        break

Streaming Responses

Streaming works with the Agentic Research API:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

response = client.responses.create(
    model="openai/gpt-5-mini",
    input="Write a bedtime story about a unicorn.",
    stream=True
)

for event in response:
    if event.type == "response.output_text.delta":
        print(event.delta, end="", flush=True)

Using Tools

The Agentic Research API supports tools, including web search:
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"
)

response = client.responses.create(
    model="openai/gpt-5-mini",
    input="What are the latest developments in AI?",
    tools=[
        {
            "type": "web_search",
            "filters": {
                "search_domain_filter": ["techcrunch.com", "wired.com"]
            }
        }
    ],
    instructions="You have access to a web_search tool. Use it for current information."
)


print(response.output_text)
        break

API Compatibility

Standard OpenAI Parameters

These parameters work exactly the same as OpenAI’s API: Chat Completions API:
  • model - Model name (use Perplexity model names like sonar-pro)
  • messages - Chat messages array
  • temperature - Sampling temperature (0-2)
  • max_tokens - Maximum tokens in response
  • top_p - Nucleus sampling parameter
  • frequency_penalty - Frequency penalty (-2.0 to 2.0)
  • presence_penalty - Presence penalty (-2.0 to 2.0)
  • stream - Enable streaming responses
Agentic Research API:
  • model - Model name (use 3rd party models like openai/gpt-5.2)
  • input - Input text or message array
  • instructions - System instructions
  • max_output_tokens - Maximum tokens in response
  • stream - Enable streaming responses
  • tools - Array of tools including web_search

Perplexity-Specific Parameters

Chat Completions API:
  • search_domain_filter - Limit or exclude specific domains
  • search_recency_filter - Filter by content recency
  • return_images - Include image URLs in response
  • return_related_questions - Include related questions
  • search_mode - “web” (default) or “academic” mode selector
Agentic Research API:
  • preset - Preset name (use Perplexity presets like pro-search)
  • tools[].filters - Search filters within web_search tool
  • tools[].user_location - User location for localized results
See Chat Completions API Reference and Agentic Research API Reference for complete parameter details.

Response Structure

Chat Completions API

Perplexity responses match OpenAI’s format exactly:
  • choices[0].message.content - The AI-generated response
  • model - The model name used
  • usage - Token consumption details
  • id, created, object - Standard response metadata
  • search_results - Array of web sources (Perplexity-specific)
  • citations - Array of citation URLs (Perplexity-specific)

Agentic Research API

Perplexity Agentic Research API matches OpenAI’s Agentic Research API format:
  • output - Structured output array containing messages with content[].text
  • model - The model name used
  • usage - Token consumption details
  • id, created_at, status - Response metadata

Best Practices

1

Use the correct base URL

Always use https://api.perplexity.ai/v2 (with /v2) for both APIs.
client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.perplexity.ai/v2"  # Correct
)
2

Choose the right API

  • Chat Completions API: Best for web-grounded conversations with built-in search
  • Agentic Research API: Best for structured outputs, third-party models, and tool use
# Chat Completions - web search built-in
completion = client.chat.completions.create(
    model="sonar-pro",
    messages=[{"role": "user", "content": "Latest AI news"}]
)

# Responses - explicit tool control
response = client.responses.create(
    model="openai/gpt-5-mini",
    input="Latest AI news",
    tools=[{"type": "web_search"}]
)
3

Handle errors gracefully

Use the OpenAI SDK’s error handling:
from openai import OpenAI, APIError, RateLimitError

try:
    completion = client.chat.completions.create(...)
except RateLimitError:
    print("Rate limit exceeded, please retry later")
except APIError as e:
    print(f"API error: {e.message}")
4

Use streaming for better UX

Stream responses for real-time user experience:
stream = client.chat.completions.create(
    model="sonar-pro",
    messages=[{"role": "user", "content": "Long query..."}],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Recommended: Perplexity SDK

We recommend using Perplexity’s native SDKs for the best developer experience:
  • Cleaner preset syntax - Use preset="pro-search" directly instead of extra_body={"preset": "pro-search"}
  • Type safety - Full TypeScript/Python type definitions for all parameters
  • Enhanced features - Direct access to all Perplexity-specific features
  • Better error messages - Perplexity-specific error handling
  • Simpler setup - No need to configure base URLs
See the Perplexity SDK Guide for details.

Next Steps

Migrating to the Perplexity SDK

Switch to the Perplexity SDK for enhanced features and cleaner syntax. With the Perplexity SDK, you can use presets directly without extra_body and get full type safety:
1

Install the Perplexity SDK

pip install perplexityai
2

Update the import and client

# Before (OpenAI SDK)
from openai import OpenAI
client = OpenAI(
    api_key="pplx-...",
    base_url="https://api.perplexity.ai/v2"
)

# After (Perplexity SDK)
from perplexity import Perplexity
client = Perplexity(api_key="pplx-...")
# Or just: client = Perplexity() if PERPLEXITY_API_KEY env var is set
No base URL needed - The Perplexity SDK automatically uses the correct endpoint.
3

Update the API calls

The API calls are very similar:
# Chat Completions API - same interface
completion = client.chat.completions.create(
    model="sonar-pro",
    messages=[{"role": "user", "content": "Hello!"}]
)

# Agentic Research API - same interface
response = client.responses.create(
    model="openai/gpt-5-mini",
    input="Hello!"
)
4

Use presets with cleaner syntax

The Perplexity SDK supports presets with cleaner syntax compared to OpenAI SDK:
# Before (OpenAI SDK) - extra_body required
response = client.responses.create(
    input="What are the latest developments in AI?",
    extra_body={"preset": "pro-search"}
)

# After (Perplexity SDK) - direct parameter
response = client.responses.create(
    preset="pro-search",
    input="What are the latest developments in AI?"
)