ModelMeta
493 model profiles·24 providers·Refresh cadence hourly
Back to models
Metallama-3-1

Llama 3.1 405B Instruct

FlagshipRecommendedActive

The world's largest publicly available LLM, matching the frontier performance of GPT-4o.

Context window

128K

128,000 tokens

Max output

4K

4,096 tokens

Input price

Not published

No official price captured yet

Output price

Not published

No official price captured yet

Modalities

Text

Input: Text; output: Text

API surface

Chat Completions

1 supported endpoint

Overview

Where this model fits best

Use this section to quickly decide whether the model belongs in chat, coding, reasoning, embedding, rerank, vision, audio, or agent workflows.

Use cases

What this model should be considered for

Selection signal
text-chat
function-calling

Best fit

Use this model when you need a well-documented, structured option inside the registry and want a single place to inspect pricing, capabilities, and operational limits.

Capabilities

What you can actually do with it

Feature flags, API endpoints, and tool support are separated so integration constraints are easy to scan.

Capabilities

High-level features exposed by the model runtime.

StreamingFunction CallingJson ModeSystem Prompt

Endpoints

APIs and surfaces this model can be called through.

Chat Completions

Pricing

Pricing and billing signals

Known prices are shown per 1M tokens. Missing official prices are marked as not published instead of being treated as free.

No official pricing has been captured for this model yet. ModelMeta keeps unknown pricing separate from free pricing, so missing data is not shown as $0.

Check official pricing source

Controls

Runtime knobs worth knowing

Supported request parameters help developers understand sampling, output limits, reasoning controls, and structured output behavior.

Temperature

1

0 to 2

Top P

1

0 to 1

Presence Penalty

0

-2 to 2

Frequency Penalty

0

-2 to 2

Response Format

text

text, json_object

Specifications

Technical reference

Canonical identifiers, family, modalities, token limits, training information, and update metadata.

Model ID

llama-3.1-405b-instruct

Use this exact identifier in API calls and SDK configuration.

Provider

Meta

Family

llama-3-1

Access type

Open Weights

Input modalities

text

Output modalities

text

Max output tokens

4,096