Free, Unlimited NVIDIA Nemotron API

Reynaldi Chernando

Updated: January 20, 2026

On this page

This tutorial will show you how to use Puter.js to access NVIDIA Nemotron models, including Nemotron 3, Nemotron 2, Nano, Vision, and Super for free, without any API keys or usage restrictions.

Puter.js uses the User-Pays model, where users of your application cover their own AI costs. This means you as a developer don't pay anything for your users' usage, making your app practically free to run. You can scale to unlimited users and pay nothing for the AI or server usage.

Getting Started

You can use Puter.js without any API keys. To start using Puter.js, include the following script tag in your HTML file, either in the <head> or <body> section:

<script src="https://js.puter.com/v2/"></script>

Nothing else is required to start using Puter.js for free access to NVIDIA Nemotron models and capabilities.

Example 1: Regular chat with Nemotron 3 Nano

To generate text using Nemotron 3 Nano, use the puter.ai.chat() function:

puter.ai.chat("Explain the concept of neural networks in simple terms", { model: "nvidia/nemotron-3-nano-30b-a3b:free" })
.then(response => {
    puter.print(response);
});

Full code example:

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain the concept of neural networks in simple terms", { model: "nvidia/nemotron-3-nano-30b-a3b:free" })
            .then(response => {
                puter.print(response);
            });
    </script>
</body>
</html>

Example 2: Image analysis with Nemotron Nano 12B V2 VL

To analyze images, simply provide an image URL to puter.ai.chat() using the vision model Nemotron Nano 12B V2 VL:

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat(
            "Describe this image in detail and identify any objects you see.",
            "https://assets.puter.site/doge.jpeg",
            { model: 'nvidia/nemotron-nano-12b-v2-vl' }
        ).then(response => {
            puter.print(response);
        });
    </script>
</body>
</html>

Example 3: Complex reasoning with Nemotron Super

Nemotron Super excels at complex reasoning and problem-solving tasks. Here's how to use it for advanced analytical tasks:

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat(
            "Analyze the potential impacts of quantum computing on current encryption methods and suggest strategies for post-quantum cryptography.",
            { model: "nvidia/llama-3.3-nemotron-super-49b-v1.5" }
        )
        .then(response => {
            puter.print(response);
        });
    </script>
</body>
</html>

Example 4: Stream responses for longer queries

For longer responses, use streaming to get results in real-time:

async function streamResponse() {
    const response = await puter.ai.chat(
        "Explain the complete process of training a large language model, from data collection to deployment",
        { model: "nvidia/nemotron-3-nano-30b-a3b:free", stream: true }
    );

    for await (const part of response) {
        puter.print(part?.text);
    }
}

streamResponse();

Full code example:

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        async function streamResponse() {
            const response = await puter.ai.chat(
                "Explain the complete process of training a large language model, from data collection to deployment",
                { model: "nvidia/nemotron-3-nano-30b-a3b:free", stream: true }
            );

            for await (const part of response) {
                if(part?.reasoning)
                    puter.print(part?.reasoning);
                else
                    puter.print(part?.text);
            }
        }

        streamResponse();
    </script>
</body>
</html>

List of supported models

The following NVIDIA Nemotron models are supported by Puter.js:

nvidia/llama-3.1-nemotron-70b-instruct
nvidia/llama-3.1-nemotron-ultra-253b-v1
nvidia/llama-3.3-nemotron-super-49b-v1.5
nvidia/nemotron-3-nano-30b-a3b
nvidia/nemotron-3-nano-30b-a3b:free
nvidia/nemotron-nano-12b-v2-vl
nvidia/nemotron-nano-12b-v2-vl:free
nvidia/nemotron-nano-9b-v2
nvidia/nemotron-nano-9b-v2:free

Conclusion

Using Puter.js, you can gain access to NVIDIA Nemotron without having to set up the AI server yourself. And thanks to the User-Pays model, your users cover their own AI usage, not you as the developer. This means you can build powerful applications without worrying about AI usage costs.

You can find all AI features supported by Puter.js in the documentation.

Free, Serverless AI and Cloud

Start creating powerful web applications with Puter.js in seconds!

Get Started Now

Read the Docs • Try the Playground