x-ai/grok-2-vision-1212
Model Card
Grok 2 Vision 1212 is xAI's updated multimodal vision model released December 2024, featuring improved accuracy, instruction-following, and multilingual capabilities over the original Grok 2 Vision. It combines advanced visual comprehension with text understanding, excelling at object recognition, style analysis, and document-based question answering with a 32K context window.
Context Window
33K
tokens
Max Output
33K
tokens
Input Cost
$2
per million tokens
Output Cost
$10
per million tokens
Release Date
Dec 12, 2024
API Usage Example
Add Grok 2 Vision 1212 to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "x-ai/grok-2-vision-1212"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Grok 2 Vision 1212 to your app without worrying about API keys or setup.
Read the Docs View Tutorials