inceptron

nvidia/llama-3.3-70b-instruct-fp8

Llama 3.3 70B Instruct FP8 by NVIDIA/Meta, a high-performance open-source model with 131K context window. Supports function calling for agentic workflows.

Learn more

Provider:

inceptron

Model type:

chat

Location:

europe

Context Window

131072

Intelligence Rating

Speed Rating

Cost Efficiency Rating

Pricing

$

0.12

Input tokens per million

$

0.38

Output tokens per million

Features

Tool Calling

Supported

JSON Mode

Supported

Create an account and start building today.

Book a demo

Explore docs

Create an account and start building today.

Book a demo

Explore docs