Models ====== Explore all available models and compare their capabilities. .. _Reasoning_models: Reasoning models ---------------- - :doc:`Qwen3-0.6B ` - :doc:`DeepSeek-R1-Distill-Qwen-1.5B ` .. _Flagship_chat_models: Flagship chat models -------------------- - :doc:`Qwen2.5-0.5B-Instruct ` - :doc:`Qwen2.5-1.5B-Instruct ` - :doc:`Qwen2.5-Coder-0.5B-Instruct ` - :doc:`Llama-3.2-1B-Instruct ` - :doc:`openbuddy-llama3.2-1b-v23.1-131k ` .. _Multimodal_models: Multimodal models ----------------- - :doc:`InternVL2_5-1B-MPO ` - :doc:`SmolVLM-256M-Instruct ` - :doc:`SmolVLM-500M-Instruct ` Text-to-speech -------------- Models that can convert text into natural sounding spoken audio. - :doc:`MeloTTS-English ` - :doc:`MeloTTS-Chinese ` - :doc:`MeloTTS-Japanese ` Transcription ------------- Model that can transcribe and translate audio into text. - :doc:`Whisper-tiny ` - :doc:`Whisper-base ` - :doc:`Whisper-small ` Keyword spotting ---------------- Models that can detect specific keywords in audio streams. - :doc:`Keyword spotting ` Voice activity detection ------------------------ Models that can detect whether there is speech in an audio stream. - :doc:`Silero-vad ` Automatic Speech Recognition ---------------------------- Models that can convert spoken language into text. - :doc:`Automatic Speech Recognition ` Vision ------ Models that can process images and perform tasks like object detection, etc. - :doc:`Yolo11n ` - :doc:`Depth-Anything-V2 ` Model Pages ----------- .. toctree:: :maxdepth: 1 :caption: Model Pages: :titlesonly: :hidden: /model_pages/qwen3-0.6b /model_pages/deepseek-r1-distill-qwen-1.5b /model_pages/qwen2.5-0.5b-instruct /model_pages/qwen2.5-1.5b-instruct /model_pages/qwen2.5-coder-0.5b-instruct /model_pages/llama-3.2-1b-instruct /model_pages/openbuddy-llama3.2-1b-v23.1-131k /model_pages/internvl2_5-1b-mpo /model_pages/smoivlm-256m-instruct /model_pages/smoivlm-500m-instruct /model_pages/melotts-english /model_pages/melotts-chinese /model_pages/melotts-japanese /model_pages/whisper-tiny /model_pages/whisper-base /model_pages/whisper-small /model_pages/yolo /model_pages/depthanything /model_pages/kws /model_pages/silero-vad /model_pages/asr