Nous: Hermes 4 70B API

Access Nous: Hermes 4 70B from Nous Research using Puter.js AI API.

Get Started

Model Card

Hermes 4 70B is a hybrid reasoning model based on Llama-3.1-70B with toggleable deep thinking mode using think tags. It offers major improvements in math, code, STEM, logic, and creative writing while supporting JSON schema adherence, function calling, and reduced refusal rates compared to other models.

Context Window

N/A

tokens

Max Output

131,072

tokens

Input Cost

$0.11

per million tokens

Output Cost

$0.38

per million tokens

API Usage Example

Add Nous: Hermes 4 70B to your app with just a few lines of code.
No API keys, no backend, no configuration required.

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "nousresearch/hermes-4-70b"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

Get started with Puter.js

Add Nous: Hermes 4 70B to your app without worrying about API keys or setup.

Read the Docs View Tutorials