Supported Models

Perplexity Models

ModelParameter CountContext LengthModel Type
llama-3-sonar-small-32k-chat8B32768Chat Completion
llama-3-sonar-small-32k-online8B28000Chat Completion
llama-3-sonar-large-32k-chat70B32768Chat Completion
llama-3-sonar-large-32k-online70B28000Chat Completion

Open-Source Models

Where possible, we try to match the Hugging Face implementation.

ModelParameter CountContext LengthModel Type
llama-3-8b-instruct8B8192Chat Completion
llama-3-70b-instruct70B8192Chat Completion
mixtral-8x7b-instruct8x7B16384Chat Completion

Special Tokens

We do not raise any exceptions if your chat inputs contain messages with special tokens. If avoiding prompt injections is a concern for your use case, it is recommended that you check for special tokens prior to calling the API. For more details, read Meta's recommendations for Llama.

Online LLMs

Note that the search subsystem of the Online LLMs do not attend to the system prompt. You can use the system prompt to provide instructions related to style, tone, and language of the response.

Access to citations and images via API is in closed beta. To request access to citations, fill out this form and send an email describing your use case to [email protected].