NousResearch: Hermes 2 Pro - Llama-3 8B API

Access NousResearch: Hermes 2 Pro - Llama-3 8B from Nous Research using Puter.js AI API.

Get Started

nousresearch/hermes-2-pro-llama-3-8b

Model Card

Hermes 2 Pro Llama 3 8B is an 8B parameter model fine-tuned on Meta's Llama 3, optimized for function calling (90% accuracy) and structured JSON outputs (84% accuracy). It features dedicated tool-call parsing tokens for agentic capabilities and outperforms Llama-3 8B Instruct on AGIEval, TruthfulQA, and BigBench benchmarks.

Context Window N/A

tokens

Max Output 8K

tokens

Input Cost $0.14

per million tokens

Output Cost $0.14

per million tokens

Release Date Apr 30, 2024

API Usage Example

Add NousResearch: Hermes 2 Pro - Llama-3 8B to your app with just a few lines of code.
No API keys, no backend, no configuration required.

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

puter.ai.chat("Explain quantum computing in simple terms", {
    model: "nousresearch/hermes-2-pro-llama-3-8b"
}).then(response => {
    document.body.innerHTML = response.message.content;
});

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "nousresearch/hermes-2-pro-llama-3-8b"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

More Models from Nous Research

Chat

Nous: Hermes 4 70B

Hermes 4 70B is a hybrid reasoning model based on Llama-3.1-70B with toggleable deep thinking mode u...

Chat

Nous: Hermes 4 405B

Hermes 4 405B is a frontier hybrid-mode reasoning model based on Llama-3.1-405B, trained on a 60B to...

Chat

Nous: Hermes 3 405B Instruct

Hermes 3 Llama 3.1 405B is a frontier-level 405B parameter full fine-tune of Llama-3.1-405B, focused...

View all Nous Research models →

Frequently Asked Questions

What is this NousResearch: Hermes 2 Pro - Llama-3 8B API about?

The NousResearch: Hermes 2 Pro - Llama-3 8B API gives you access to Nous Research's chat model through Puter.js. With just a few lines of JavaScript, you can integrate NousResearch: Hermes 2 Pro - Llama-3 8B into any web app or Node.js project — no API keys, no backend, and no configuration required.

Who created NousResearch: Hermes 2 Pro - Llama-3 8B?

NousResearch: Hermes 2 Pro - Llama-3 8B was created by Nous Research and released on Apr 30, 2024.

What is the max output length of NousResearch: Hermes 2 Pro - Llama-3 8B?

NousResearch: Hermes 2 Pro - Llama-3 8B can generate up to 8K tokens in a single response.

How much does it cost?

The NousResearch: Hermes 2 Pro - Llama-3 8B API is available through the User-Pays Model. As a developer, you can add the NousResearch: Hermes 2 Pro - Llama-3 8B API to your app for free — your users pay for their own AI costs directly.

	Price per 1M tokens
Input	$0.14
Output	$0.14

How do I access the NousResearch: Hermes 2 Pro - Llama-3 8B API?

You can access the NousResearch: Hermes 2 Pro - Llama-3 8B API with just a few lines of JavaScript — no API keys, no backend, and no configuration required. Include the Puter.js library in your project and start making calls right away. For more details, check out our documentation.

Does the NousResearch: Hermes 2 Pro - Llama-3 8B API work with React / Vue / Vanilla JS / Node / etc.?

Yes — the NousResearch: Hermes 2 Pro - Llama-3 8B API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.

Get started with Puter.js

Add NousResearch: Hermes 2 Pro - Llama-3 8B to your app without worrying about API keys or setup.

Read the Docs View Tutorials