Allen AI API
Access Allen AI instantly with Puter.js, and add AI to any app in a few lines of code without backend or API keys.
// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';
puter.ai.chat("Explain AI like I'm five!", {
model: "allenai/olmo-3-32b-think"
}).then(response => {
console.log(response);
});
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain AI like I'm five!", {
model: "allenai/olmo-3-32b-think"
}).then(response => {
console.log(response);
});
</script>
</body>
</html>
List of Allen AI Models
AllenAI: Molmo2 8B
allenai/molmo-2-8b
Molmo 2 8B is an open vision-language model from AI2 built on Qwen3-8B with SigLIP 2 vision backbone, supporting image, video, and multi-image understanding with state-of-the-art grounding and tracking capabilities. It outperforms larger models on video pointing benchmarks and excels at counting and captioning tasks.
ChatAllenAI: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b-instruct
OLMo 3.1 32B Instruct is AI2's most capable fully open chat model, optimized for multi-turn conversation, instruction-following, and tool use. It represents the strongest fully open 32B-scale instruct model in AI2's evaluations, built on Dolma 3 with extended RL training.
ChatAllenAI: Olmo 3.1 32B Think
allenai/olmo-3.1-32b-think
OLMo 3.1 32B Think is the updated flagship reasoning model from AI2 with an additional three weeks of reinforcement learning training beyond OLMo 3. It uses long chain-of-thought thinking for complex reasoning tasks and remains fully open with all training artifacts publicly available.
ChatAllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think
OLMo 3 32B Think is AI2's flagship open reasoning model that uses long chain-of-thought thinking to improve performance on math and coding tasks. Trained on the Dolma 3 dataset with reinforcement learning from verifiable rewards, it's one of the strongest fully open reasoning models in its class.
ChatAllenAI: Olmo 3 7B Instruct
allenai/olmo-3-7b-instruct
OLMo 3 7B Instruct is a lightweight fully open chat model from AI2, optimized for multi-turn dialogue, instruction-following, and tool use. Pre-trained on Dolma 3 and post-trained with SFT, DPO, and RLVR, it delivers strong performance while running on modest hardware.
ChatAllenAI: Olmo 3 7B Think
allenai/olmo-3-7b-think
OLMo 3 7B Think is AI2's efficient reasoning model that surfaces intermediate thinking steps for complex prompts in math and coding. It matches Qwen 3 8B on MATH benchmarks and leads on HumanEvalPlus for coding while remaining fully open and accessible on consumer hardware.
ChatAllenAI: Olmo 2 32B Instruct
allenai/olmo-2-0325-32b-instruct
OLMo 2 32B Instruct (March 2025) is a fully open instruction-tuned language model from AI2, trained on the Dolma dataset with supervised fine-tuning, DPO, and RLVR. It's the first fully open model to outperform GPT-3.5 Turbo and GPT-4o mini on academic benchmarks while providing complete transparency into training data, code, and weights.
Frequently Asked Questions
The Allen AI API gives you access to models for AI chat. Through Puter.js, you can start using Allen AI models instantly with zero setup or configuration.
Puter.js supports a variety of Allen AI models, including AllenAI: Molmo2 8B, AllenAI: Olmo 3.1 32B Instruct, AllenAI: Olmo 3.1 32B Think, and more. Find all AI models supported by Puter.js in the AI model list.
With the User-Pays model, users cover their own AI costs through their Puter account. This means you can build apps without worrying about infrastructure expenses.
Puter.js is a JavaScript library that provides access to AI, storage, and other cloud services directly from a single API. It handles authentication, infrastructure, and scaling so you can focus on building your app.
Yes — the Allen AI API through Puter.js works with any JavaScript framework, Node.js, or plain HTML. Just include the library and start building. See the documentation for more details.