Nous Research: DeepHermes 3 Mistral 24B Preview
This model is no longer available.Add AI to your application with Puter.js.
Explore Other ModelsModel Card
DeepHermes 3 Mistral 24B Preview is a 24B parameter instruction-tuned model based on Mistral-Small-24B, featuring a dual-mode system that toggles between intuitive chat responses and deep reasoning mode with extended chains of thought. It excels at function calling, structured JSON outputs, and multi-turn reasoning with the ability to use up to 13,000 tokens for complex problems.
Context Window N/A
tokens
Max Output 33K
tokens
Input Cost $0.02
per million tokens
Output Cost $0.1
per million tokens
Release Date Mar 2, 2025
Code Example
Add AI to your app with the Puter.js AI API — no API keys or setup required.
// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';
puter.ai.chat("Explain quantum computing in simple terms").then(response => {
document.body.innerHTML = response.message.content;
});
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms").then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
More AI Models From Nous Research
Find other Nous Research models →
Hermes 4 70B
Hermes 4 70B is a hybrid reasoning model based on Llama-3.1-70B with toggleable deep thinking mode using think tags. It offers major improvements in math, code, STEM, logic, and creative writing while supporting JSON schema adherence, function calling, and reduced refusal rates compared to other models.
ChatHermes 4 405B
Hermes 4 405B is a frontier hybrid-mode reasoning model based on Llama-3.1-405B, trained on a 60B token dataset with verified reasoning traces. It features toggleable deep reasoning via think tags, massive improvements in math, code, STEM, and logic, and achieves state-of-the-art on RefusalBench for reduced censorship.
ChatHermes 3 405B Instruct
Hermes 3 Llama 3.1 405B is a frontier-level 405B parameter full fine-tune of Llama-3.1-405B, focused on user alignment with powerful steering capabilities. It features advanced agentic capabilities, roleplaying, reasoning, multi-turn conversation, and improved code generation, competitive with or superior to Llama-3.1 Instruct models.
Frequently Asked Questions
You can access DeepHermes 3 Mistral 24B Preview by Nous Research through Puter.js AI API. Include the library in your web app or Node.js project and start making calls with just a few lines of JavaScript — no backend and no configuration required. You can also use it with Python or cURL via Puter's OpenAI-compatible API.
Yes, it is free if you're using it through Puter.js. With the User-Pays Model, you can add DeepHermes 3 Mistral 24B Preview to your app at no cost — your users pay for their own AI usage directly, making it completely free for you as a developer.
| Price per 1M tokens | |
|---|---|
| Input | $0.02 |
| Output | $0.1 |
DeepHermes 3 Mistral 24B Preview was created by Nous Research and released on Mar 2, 2025.
DeepHermes 3 Mistral 24B Preview can generate up to 33K tokens in a single response.
Yes — the DeepHermes 3 Mistral 24B Preview API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.
Get started with Puter.js
Add AI to your application without worrying about API keys or setup.
Explore Models View Tutorials