Qwen

Qwen 2.5 Coder 32B

Qwen's code-specialized model. Competitive with GPT-4o on HumanEval at a fraction of the cost. Supports 40+ programming languages.

open sourcecodecost effective

Use this model free ← All models

Pricing via Neureus

Save 10% vs OpenRouter

Context window 33K tokens

Max output 8K tokens

Input (OpenRouter) $0.070/M

Input (Neureus) $0.063/M

Output (OpenRouter) $0.16/M

Output (Neureus) $0.14/M

Neureus prices all models at 10% below published OpenRouter rates, updated monthly. Free tier includes 5M tokens →

Use this model

One API. Every model.

Swap any model ID in a single field. Neureus handles routing, caching, and tenant isolation automatically.

  chat.ts 
const res = await fetch('https://app.neureus.ai/ai/chat', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${NEUREUS_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'qwen/qwen-2.5-coder-32b',
    messages: [{ role: 'user', content: 'Hello!' }],
  }),
});
const { message } = await res.json();

Semantic response caching — repeated queries served free from edge

Per-tenant spend caps — set a monthly limit per customer

PII guardrails — optional PHI/PII detection before each request

Automatic failover — falls back to an alternate provider on error

Qwen 2.5 Coder 32B

One API. Every model.

Start using Qwen 2.5 Coder 32Bfor free.

Start using Qwen 2.5 Coder 32B
for free.