Qwen

Qwen 2.5 72B Instruct

Alibaba's flagship open-source model. Exceptional multilingual performance (especially Chinese), strong coding, and 128K context.

open sourcemultilingual

Use this model free ← All models

Pricing via Neureus

Save 10% vs OpenRouter

Context window 128K tokens

Max output 8K tokens

Input (OpenRouter) $0.35/M

Input (Neureus) $0.32/M

Output (OpenRouter) $0.40/M

Output (Neureus) $0.36/M

Neureus prices all models at 10% below published OpenRouter rates, updated monthly. Free tier includes 5M tokens →

Use this model

One API. Every model.

Swap any model ID in a single field. Neureus handles routing, caching, and tenant isolation automatically.

  chat.ts 
const res = await fetch('https://app.neureus.ai/ai/chat', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${NEUREUS_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'qwen/qwen-2.5-72b-instruct',
    messages: [{ role: 'user', content: 'Hello!' }],
  }),
});
const { message } = await res.json();

Semantic response caching — repeated queries served free from edge

Per-tenant spend caps — set a monthly limit per customer

PII guardrails — optional PHI/PII detection before each request

Automatic failover — falls back to an alternate provider on error

Qwen 2.5 72B Instruct

One API. Every model.

Start using Qwen 2.5 72B Instructfor free.

Start using Qwen 2.5 72B Instruct
for free.