InclusionAI Ling and Ring 2.6 Models Are Now Available in Puter.js

Reynaldi Chernando

May 14, 2026

On this page

Puter.js now supports the full Ling and Ring 2.6 lineup from InclusionAI (Ant Group) — a high-efficiency Flash tier, a trillion-parameter fast-thinking flagship, and a dedicated reasoning model purpose-built for autonomous agents.

What is Ling 2.6 Flash?

Ling 2.6 Flash is the efficiency-focused tier in the Ling family — a Mixture-of-Experts model with 104B total parameters and only 7.4B active per token, delivering strong performance at a fraction of the compute cost.

262K context window with up to 32K output tokens
Intelligence Index of 26 on Artificial Analysis — nearly double the open-weight non-reasoning median of 13, and a 10-point jump over Ling-flash-2.0
59.3% on GPQA Diamond
Trained with Agentic Reinforcement Learning for tool use, terminal operations, and multi-step agent tasks
Well suited for cost-sensitive agent pipelines and high-throughput automation

What is Ling 2.6 1T?

Ling 2.6 1T is InclusionAI's trillion-parameter non-reasoning flagship, built by Ant Group's AGI initiative. It uses a Mixture-of-Experts architecture with roughly 50B active parameters per token, employing a "fast thinking" approach that cuts token costs to about a quarter of comparable models while keeping flagship-tier output quality.

262K context window with up to 32K output tokens
Intelligence Index of 34 on Artificial Analysis — far above the open-weight non-reasoning median of 13
State-of-the-art on AIME 2025 and SWE-bench Verified, and #1 among open-source models on ArtifactsBench for front-end code generation
Flagship-level reasoning without a dedicated thinking model
Strong fit for autonomous agents, large-scale coding workflows, and cost-sensitive pipelines

What is Ring 2.6 1T?

Ring 2.6 1T is the reasoning counterpart — a trillion-parameter MoE with 63B active parameters per token, purpose-built for agentic workflows like coding agents, tool use, and long-horizon multi-step execution. It introduces adaptive reasoning effort with high and xhigh modes that dynamically allocate thinking budget based on task complexity.

262K context window with up to 66K output tokens
95.83 on AIME 2026 and 88.27 on GPQA Diamond
87.60 on PinchBench in agent mode — ahead of GPT-5.4 and Gemini 3.1 Pro
Leading scores on ClawEval, TAU2-Bench, and GAIA2-search
A strong pick for autonomous agent systems and reasoning-heavy pipelines

Examples

High-throughput agent work with Ling 2.6 Flash

puter.ai.chat("Write a Node.js script that watches a directory, parses any new JSON files, validates them against a schema, and moves invalid files to a quarantine folder", {
    model: "inclusionai/ling-2.6-flash"
})

Fast-thinking flagship with Ling 2.6 1T

puter.ai.chat("Refactor this Express handler into a typed Fastify route with input validation, structured logging, and proper error responses", {
    model: "inclusionai/ling-2.6-1t"
})

Long-horizon reasoning with Ring 2.6 1T

puter.ai.chat("Design an autonomous agent that monitors a Postgres database for anomalies, opens a triage ticket, and proposes a remediation plan. Walk through the architecture and failure modes.", {
    model: "inclusionai/ring-2.6-1t"
})

Agentic coding across a long context

puter.ai.chat(`Given this repository tree and these files, identify where authentication
state is leaking between requests and propose a minimal fix:\n\n${repoDump}`, {
    model: "inclusionai/ring-2.6-1t"
})

Streaming with reasoning traces

const response = await puter.ai.chat(
    "Prove that the sum of the first n odd numbers equals n², then generalize the result",
    { model: "inclusionai/ring-2.6-1t", stream: true }
);

for await (const part of response) {
    if (part?.reasoning) puter.print(part?.reasoning);
    else puter.print(part?.text);
}

Get Started Now

Just add one library to your project:

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

Or add one script tag to your HTML:

<script src="https://js.puter.com/v2/"></script>

No API keys or account needed. Start building with Ling 2.6 Flash, Ling 2.6 1T, and Ring 2.6 1T immediately.

Learn more:

Puter.js AI Documentation

Free, Serverless AI and Cloud

Start creating powerful web applications with Puter.js in seconds!

Get Started Now

Read the Docs • Try the Playground