Qwen2.5-Coder-0.5B-Instruct 

Introduction

Qwen2.5-Coder-0.5B-Instruct is Code-Specific Qwen large language models, Significantly improvements in code generation, code reasoning and code fixing. Key highlights of this model include:

Type: Causal Language Model
Training Stage: Pretraining & Post-training
Architecture: Transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias, and tied word embeddings
Number of Parameters: 0.49B (0.36B non-embedding)
Number of Layers: 24
Number of Attention Heads (GQA): 14 for Q and 2 for KV
Context Length: Full 32,768 tokens and generation up to 8,192 tokens

Available NPU Models

Base Model

qwen2.5-Coder-0.5B-ax630c

The base model providing a 128 context window and a maximum output of 1,024 tokens.

Support Platforms: LLM630 Compute Kit, Module LLM, and Module LLM Kit

128 context window
1,024 max output tokens

Install

apt install llm-model-qwen2.5-coder-0.5b-ax630c

Manual installation: Click here to download llm-model-qwen2.5-coder-0.5b-ax630c