Image Attachments

Overview

The Agent API supports image analysis through direct image uploads. Images can be provided either as base64 encoded strings within a data URI or as standard HTTPS URLs.

When using base64 encoding, the API currently only supports images up to 50 MB per image.
Supported formats for base64 encoded images: PNG (image/png), JPEG (image/jpeg), WEBP (image/webp), and GIF (image/gif).
When using an HTTPS URL, the model will attempt to fetch the image from the provided URL. Ensure the URL is publicly accessible.

Examples

Base64 Encoded Data
HTTPS URL

Use this method when you have the image file locally and want to embed it directly into the request payload. Remember the 50MB size limit and supported formats (PNG, JPEG, WEBP, GIF).

import base64
from perplexity import Perplexity

client = Perplexity()

# Read and encode image as base64
def encode_image(image_path):
    with open(image_path, "rb") as image_file:
        return base64.b64encode(image_file.read()).decode("utf-8")

image_path = "image.png"
base64_image = encode_image(image_path)

# Analyze the image
response = client.responses.create(
    model="openai/gpt-5.6-sol",
    input=[
        {
            "role": "user",
            "content": [
                {"type": "input_text", "text": "what's in this image?"},
                {
                    "type": "input_image",
                    "image_url": f"data:image/png;base64,{base64_image}",
                },
            ],
        }
    ],
)

print(response.output_text)

import Perplexity from '@perplexity-ai/perplexity_ai';
import * as fs from 'fs';

const client = new Perplexity();

// Read and encode image as base64
const imageBuffer = fs.readFileSync('image.png');
const base64Image = imageBuffer.toString('base64');
const imageDataUri = `data:image/png;base64,${base64Image}`;

// Analyze the image
const response = await client.responses.create({
    model: 'openai/gpt-5-mini',
    input: [
        {
            role: 'user',
            content: [
                { type: 'input_text', text: "What's in this image?" },
                { type: 'input_image', image_url: imageDataUri }
            ]
        }
    ],
} as any);

console.log(response.output_text);

curl https://api.perplexity.ai/v1/agent \
  -H "Authorization: Bearer $PERPLEXITY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-5-mini",
    "input": [
      {
        "role": "user",
        "content": [
          {
            "type": "input_text",
            "text": "What'\''s in this image?"
          },
          {
            "type": "input_image",
            "image_url": "data:image/png;base64,$BASE64_ENCODED_IMAGE"
          }
        ]
      }
    ]
  }' | jq

Use this method when you have a publicly accessible image URL. The model will fetch the image from the provided URL.

from perplexity import Perplexity

client = Perplexity()

image_url = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"

# Analyze the image
response = client.responses.create(
    model="openai/gpt-5.6-sol",
    input=[
        {
            "role": "user",
            "content": [
                {"type": "input_text", "text": "Can you describe the image at this URL?"},
                {
                    "type": "input_image",
                    "image_url": image_url,
                },
            ],
        }
    ],
)

print(response.output_text)

import Perplexity from '@perplexity-ai/perplexity_ai';

const client = new Perplexity();

const imageHttpsUrl = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg";

// Analyze the image
const response = await client.responses.create({
    model: 'openai/gpt-5-mini',
    input: [
        {
            role: 'user',
            content: [
                { type: 'input_text', text: 'Can you describe the image at this URL?' },
                { type: 'input_image', image_url: imageHttpsUrl }
            ]
        }
    ],
} as any);

console.log(response.output_text);

curl https://api.perplexity.ai/v1/agent \
  -H "Authorization: Bearer $PERPLEXITY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-5-mini",
    "input": [
      {
        "role": "user",
        "content": [
          {
            "type": "input_text",
            "text": "Can you describe the image at this URL?"
          },
          {
            "type": "input_image",
            "image_url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        ]
      }
    ]
  }' | jq

Request Format

Agent API

Images must be embedded in the input array when using message array format. Each image should be provided using the following structure:

{
  "role": "user",
  "content": [
    {
      "type": "input_text",
      "text": "What's in this image?"
    },
    {
      "type": "input_image",
      "image_url": "<IMAGE_URL_OR_BASE64_DATA>"
    }
  ]
}

The image_url field accepts either:

A URL of the image: A publicly accessible HTTPS URL pointing directly to the image file
The base64 encoded image data: A data URI in the format data:image/{format};base64,{base64_content}

Pricing

Images are tokenized based on their pixel dimensions using the following formula:

tokens = (width px × height px) / 750

Examples:

A 1024×768 image would consume: (1024 × 768) / 750 = 1,048 tokens
A 512×512 image would consume: (512 × 512) / 750 = 349 tokens

These image tokens are then priced according to the input token pricing of the model you’re using. The image tokens are added to your total token count for the request alongside any text tokens.

Next Steps

Agent API Quickstart

Get started with the Agent API

Web Search

Learn about the web_search tool.

Getting Started

Agent API

Search API

Sonar API

Embeddings API

Perplexity SDK

Admin & Management

Resources

Overview

Examples

Request Format

Agent API

Pricing

Next Steps

Agent API Quickstart

Web Search

​Overview

​Examples

​Request Format

​Agent API

​Pricing

​Next Steps

Agent API Quickstart

Web Search

Overview

Examples

Request Format

Agent API

Pricing

Next Steps