OpenGVLab: InternVL3 78B API
Access OpenGVLab: InternVL3 78B from OpenGVLab using Puter.js AI API.
Get Startedopengvlab/internvl3-78b
Model Card
InternVL3 78B is an open-source multimodal large language model from OpenGVLab that combines a 6B vision encoder with a 72B language model to process images, videos, documents, and text. It achieves state-of-the-art performance among open-source MLLMs on both perception and reasoning benchmarks, rivaling closed-source models like GPT-4o. Key innovations include Variable Visual Position Encoding (V2PE), native multimodal pre-training, and Mixed Preference Optimization.
Context Window
N/A
tokens
Max Output
32,768
tokens
Input Cost
$0.1
per million tokens
Output Cost
$0.39
per million tokens
API Usage Example
Add OpenGVLab: InternVL3 78B to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "opengvlab/internvl3-78b"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add OpenGVLab: InternVL3 78B to your app without worrying about API keys or setup.
Read the Docs View Tutorials