NVIDIA: Nemotron Nano 12B 2 VL API

Access NVIDIA: Nemotron Nano 12B 2 VL from NVIDIA using Puter.js AI API.

Get Started

Model Card

Nemotron Nano 12B V2 VL is a 12.6B parameter multimodal vision-language model built on a hybrid Mamba-Transformer architecture for document intelligence and video understanding. It processes multiple images, documents, and videos while achieving leading results on OCRBench v2 with up to 2.5x higher throughput using Efficient Video Sampling.

Context Window

N/A

tokens

Max Output

N/A

tokens

Input Cost

$0.2

per million tokens

Output Cost

$0.6

per million tokens

API Usage Example

Add NVIDIA: Nemotron Nano 12B 2 VL to your app with just a few lines of code.
No API keys, no backend, no configuration required.

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "nvidia/nemotron-nano-12b-v2-vl"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

Get started with Puter.js

Add NVIDIA: Nemotron Nano 12B 2 VL to your app without worrying about API keys or setup.

Read the Docs View Tutorials