Qwen

Qwen 2.5 Coder 32B

Qwen's code-specialized model. Competitive with GPT-4o on HumanEval at a fraction of the cost. Supports 40+ programming languages.

open sourcecodecost effective
Pricing via Neureus
Save 10% vs OpenRouter
Context window 33K tokens
Max output 8K tokens
Input (OpenRouter) $0.070/M
Input (Neureus) $0.063/M
Output (OpenRouter) $0.16/M
Output (Neureus) $0.14/M
Neureus prices all models at 10% below published OpenRouter rates, updated monthly. Free tier includes 5M tokens →
Code generation
Code completion
Debugging
40+ languages
HumanEval 90.2%
LiveCodeBench 51.3%
code

One API. Every model.

Swap any model ID in a single field. Neureus handles routing, caching, and tenant isolation automatically.

chat.ts
const res = await fetch('https://app.neureus.ai/ai/chat', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${NEUREUS_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'qwen/qwen-2.5-coder-32b',
    messages: [{ role: 'user', content: 'Hello!' }],
  }),
});
const { message } = await res.json();
Semantic response caching — repeated queries served free from edge
Per-tenant spend caps — set a monthly limit per customer
PII guardrails — optional PHI/PII detection before each request
Automatic failover — falls back to an alternate provider on error

Start using Qwen 2.5 Coder 32B
for free.

Free tier includes 5M AI tokens, RAG pipeline, agents, and all 7 composition patterns — no credit card required.