Models
Does the API Currently Support Web Browsing?
Does the API Currently Support Web Browsing?
What Are the Context Windows for Each Model?
What Are the Context Windows for Each Model?
Do You Support Fine-Tuning?
Do You Support Fine-Tuning?
How Are Models Versioned and Deprecated?
How Are Models Versioned and Deprecated?
Do You Expose CoTs If I Use Your Reasoning APIs or Deep Research API?
Do You Expose CoTs If I Use Your Reasoning APIs or Deep Research API?
Output & Capabilities
Can I Get Structured (JSON) Output?
Can I Get Structured (JSON) Output?
response_format parameter to constrain a response to a JSON schema. See Structured Outputs for Sonar and Structured Outputs for the Agent API. For reasoning models, see the note below about <think> tokens preceding the JSON.Why Isn't the `response_format` Parameter Working for Reasoning Models?
Why Isn't the `response_format` Parameter Working for Reasoning Models?
sonar-reasoning-pro model is designed to output a <think> section containing reasoning tokens, immediately followed by a valid JSON object. As a result, the response_format parameter does not remove these reasoning tokens from the output.We recommend using a custom parser to extract the valid JSON portion. An example implementation can be found here.Does the API Support Streaming Responses?
Does the API Support Streaming Responses?
stream: true to receive the response incrementally as it is generated, instead of waiting for the full result. See Streaming Responses for Sonar and Streaming Responses for the Agent API. The official SDKs expose streaming directly.Does the API Support Tool or Function Calling?
Does the API Support Tool or Function Calling?
To What Extent Is the API OpenAI Compatible?
To What Extent Is the API OpenAI Compatible?
id, model, and usage—and supports analogous parameters like model, messages, and stream.Key Differences from the standard OpenAI response include:-
Response Object Structure:
- OpenAI responses typically have an
objectvalue of"chat.completion"and acreatedtimestamp, whereas our response usesobject: "response"and acreated_atfield. - Instead of a
choicesarray, our response content is provided under anoutputarray that contains detailed message objects.
- OpenAI responses typically have an
-
Message Details:
- Each message in our output includes a
type(usually"message"), a uniqueid, and astatus. - The actual text is nested within a
contentarray that contains objects withtype,text, and anannotationsarray for additional context.
- Each message in our output includes a
-
Additional Fields:
- Our API response provides extra meta-information (such as
status,error,instructions, andmax_output_tokens) that are not present in standard OpenAI responses. - The
usagefield also differs, offering detailed breakdowns of input and output tokens (including fields likeinput_tokens_detailsandoutput_tokens_details).
- Our API response provides extra meta-information (such as
Are the Reasoning Tokens in Deep Research Same as CoTs in the Answer?
Are the Reasoning Tokens in Deep Research Same as CoTs in the Answer?
Search & Results
Why Are the Results from the API Different from the UI?
Why Are the Results from the API Different from the UI?
- The API uses the same search system as the UI with differences in configuration—so their outputs may differ.
- The underlying AI model might differ between the API and the UI for a given query.
Is the Internet Data Access Provided by the API Identical to That of Perplexity's Web Interface?
Is the Internet Data Access Provided by the API Identical to That of Perplexity's Web Interface?
Does the API Use Content Filtering or SafeSearch?
Does the API Use Content Filtering or SafeSearch?
Account, Billing & Limits
How Does API Pricing Work?
How Does API Pricing Work?
How Do I Add Credits or Set Up Billing?
How Do I Add Credits or Set Up Billing?
How Can I Upgrade to the Next Usage Tier?
How Can I Upgrade to the Next Usage Tier?
| Tier | Credit Purchase (all time) |
|---|---|
| Tier 0 | - |
| Tier 1 | $50 |
| Tier 2 | $250 |
| Tier 3 | $500 |
| Tier 4 | $1000 |
| Tier 5 | $5000 |
What Are the Limitations to the Number of API Calls?
What Are the Limitations to the Number of API Calls?
How Can I Track My Spend/Usage per API Key?
How Can I Track My Spend/Usage per API Key?
Errors & Troubleshooting
How Should I Respond to 401: Authorization Errors?
How Should I Respond to 401: Authorization Errors?
How Should I Handle 429: Too Many Requests Errors?
How Should I Handle 429: Too Many Requests Errors?
What Should I Do About 5xx or Connection and Timeout Errors?
What Should I Do About 5xx or Connection and Timeout Errors?
X-Request-ID response header to include when you contact support. The Error Handling guide covers the SDK exception types and recovery patterns.Where Can I Check API Status and Incidents?
Where Can I Check API Status and Incidents?
Data, Privacy & Security
Will User Data Submitted Through the API Be Used for Model Training or Other Purposes?
Will User Data Submitted Through the API Be Used for Model Training or Other Purposes?
Where Are Perplexity's Language Models Hosted?
Where Are Perplexity's Language Models Hosted?
What Security and Compliance Certifications Does the API Have?
What Security and Compliance Certifications Does the API Have?
Support & Reliability
How Do I File a Bug Report and What Happens Afterward?
How Do I File a Bug Report and What Happens Afterward?
How Do I Request a New Feature?
How Do I Request a New Feature?
- Requesting support for a new model or capability (e.g., image processing, fine-tuning options)
- Asking for new API parameters (e.g., additional filters, search options)
- Suggesting performance improvements (e.g., faster response times, better citation handling)
- Enhancing existing API features (e.g., improving streaming reliability, adding new output formats)
What's the Best Way to Stay Up to Date with API Updates?
What's the Best Way to Stay Up to Date with API Updates?
Does Perplexity Provide Service Quality Assurances Such as Service Uptime, Frequency of Failures, and Target Recovery Time in the Event of a Failure?
Does Perplexity Provide Service Quality Assurances Such as Service Uptime, Frequency of Failures, and Target Recovery Time in the Event of a Failure?
I Have Another Question or an Issue
I Have Another Question or an Issue