Blog

Perceptron Mk1 Is Now Available in Puter.js

On this page

Puter.js now supports Perceptron Mk1, Perceptron AI's new vision-language model built for video understanding and embodied reasoning—priced 80–90% below comparable frontier vision-language models.

What is Perceptron Mk1?

Perceptron Mk1 is the first model from Perceptron AI, a startup founded by former Meta multimodal researchers and launched on May 12, 2026. Unlike text-first models that bolt on image inputs, Mk1 is designed from the ground up for "physical AI"—reasoning about the real world through video. Key features include:

  • Native video understanding: Processes video at up to 2 frames per second within a 32K token context window, maintaining object identity and temporal continuity across frames
  • Structured outputs: Returns timecodes for events, bounding boxes for objects, and point coordinates that are parseable at scale—not just free-form prose
  • State-of-the-art spatial reasoning: Scores 88.5 on VSI-Bench (the highest recorded among compared models), 85.1 on EmbSpatialBench, and 72.4 on RefSpatialBench—outperforming frontier models from OpenAI, Google, and Anthropic on these benchmarks
  • Frontier capabilities at a fraction of the cost: $0.15 per million input tokens, 80–90% cheaper than comparable vision-language models from major labs
  • Built for real-world workloads: Video QA, event detection, open-vocabulary object detection and counting, OCR on documents and signage, hand pose estimation, and point-by-example grounding

Examples

Video question answering

puter.ai.chat(
  "Summarize what happens in this video and return timecodes for key events.",
  "https://assets.puter.site/puppy.mp4",
  { model: "perceptron/perceptron-mk1" }
);

Image understanding

puter.ai.chat(
  "What do you see in this image?",
  "https://assets.puter.site/doge.jpeg",
  { model: "perceptron/perceptron-mk1" }
);

Object detection and counting

puter.ai.chat(
  "List every object in this scene with its approximate location.",
  "https://assets.puter.site/doge.jpeg",
  { model: "perceptron/perceptron-mk1" }
);

Document OCR

puter.ai.chat(
  "Extract all text from this document and return it as plain text.",
  "https://assets.puter.site/letter.png",
  { model: "perceptron/perceptron-mk1" }
);

Get Started Now

Just add one library to your project:

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

Or add one script tag to your HTML:

<script src="https://js.puter.com/v2/"></script>

No API keys needed. Start building with Perceptron Mk1 immediately.

Learn more:

Free, Serverless AI and Cloud

Start creating powerful web applications with Puter.js in seconds!

Get Started Now

Read the Docs Try the Playground