perplexity/sonar
Model Card
Sonar is Perplexity's lightweight, cost-effective search model built on Llama 3.3 70B, optimized for speed (1200 tokens/second) and quick factual queries. It provides real-time web search with grounding and citations, ideal for simple Q&A and straightforward integrations. Best for everyday use cases where fast, accurate answers are needed without complex reasoning.
Context Window
N/A
tokens
Max Output
N/A
tokens
Input Cost
$1
per million tokens
Output Cost
$1
per million tokens
API Usage Example
Add Perplexity: Sonar to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "perplexity/sonar"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Perplexity: Sonar to your app without worrying about API keys or setup.
Read the Docs View Tutorials