Model Finder

Find the perfect AI model for your task and hardware. Filter by use case, VRAM requirements, and licensing.

Found 190 models

Llama 3.1 8B

Meta

Popular
8Bllm4.4GB Q4

Most popular local LLM - runs on consumer hardware with excellent performance

chatcoding
OllamaHuggingFaceCommercial OK
MMLU:73%
HumanEval:72.6%

Claude 3.5 Sonnet

Anthropic

Popular
175Bllm96.3GB Q4

Anthropic's most intelligent model - excels at complex reasoning, coding, and analysis

reasoningcodinganalysis
Commercial OK
MMLU:88.7%
HumanEval:92%

GPT-4o

OpenAI

Popular
200Bllm110.0GB Q4

OpenAI's flagship omni model - text, vision, and audio in one

multimodalreasoningcoding
Commercial OK
MMLU:88.7%
HumanEval:90.2%

Whisper Large v3

OpenAI

Popular
1.55Baudio0.9GB Q4

Best open speech recognition - 99 languages supported

speech-to-texttranscriptionmultilingual
HuggingFaceCommercial OK

Llama 3.1 70B

Meta

Popular
70Bllm38.5GB Q4

Excellent balance of capability and efficiency for local deployment

reasoningcodingagentic
OllamaHuggingFaceCommercial OK
MMLU:86%
HumanEval:80.5%

Stable Diffusion XL 1.0

Stability AI

Popular
6.6Bimage3.6GB Q4

Industry-standard image generation model with excellent quality and flexibility

image-generationtext-to-imagehigh-resolution
HuggingFaceCommercial OK

FLUX.1 [dev]

Black Forest Labs

Popular
12Bimage6.0GB Q4

State-of-the-art image generation with exceptional prompt following

image-generationtext-to-imagehigh-quality
HuggingFace

Llama 3.3 70B

Meta

Popular
70Bllm38.5GB Q4

Latest Llama with improved tool use and multilingual capabilities

reasoningcodingmultilingual
OllamaHuggingFaceCommercial OK
MMLU:86%
HumanEval:88.4%

Llama 3.1 405B

Meta

Popular
405Bllm222.8GB Q4

Meta's largest and most capable open model with state-of-the-art performance

reasoningcodingmultilingual
OllamaHuggingFaceCommercial OK
MMLU:88.6%
HumanEval:89%

Mistral 7B

Mistral AI

Popular
7Bllm3.9GB Q4

The original Mistral - set new standards for 7B models

efficient
OllamaHuggingFaceCommercial OK
MMLU:62.5%
HumanEval:29.3%

Gemini 1.5 Pro

Google

Popular
175Bllm96.3GB Q4

Google's flagship with 2M token context - can process entire codebases

long-contextmultimodalreasoning
Commercial OK
MMLU:85.9%
HumanEval:84.1%

GPT-4o Mini

OpenAI

Popular
8Bllm4.4GB Q4

Small, fast, and cheap GPT-4o for most tasks

fast-inferencemultimodalefficient
Commercial OK
MMLU:82%
HumanEval:87%

Qwen2.5 Coder 32B

Alibaba

Popular
32Bllm17.6GB Q4

State-of-the-art coding model - rivals GPT-4 on coding benchmarks

codingcode-completioncode-generation
OllamaHuggingFaceCommercial OK
HumanEval:92.7%

Stable Diffusion 1.5

Stability AI

Popular
0.86Bimage0.5GB Q4

Classic SD model with massive ecosystem of fine-tunes and LoRAs

image-generationlightweightfine-tuning
HuggingFaceCommercial OK

FLUX.1 [dev]

Black Forest Labs

Popular
12Bimage6.6GB Q4

State-of-the-art image generation with exceptional quality

image-generationhigh-qualitystate-of-the-art
HuggingFace

FLUX.1 [schnell]

Black Forest Labs

Popular
12Bimage6.0GB Q4

Fast FLUX with Apache license for commercial use

image-generationfast-inferencetext-to-image
HuggingFaceCommercial OK

DALL·E 3

OpenAI

Popular
12Bimage6.6GB Q4

OpenAI's latest image generation model

image-generationtext-to-imagecreative
Commercial OK

BGE Large EN v1.5

BAAI

Popular
0.335Bembedding0.2GB Q4

Best-in-class English embedding model for RAG applications

embeddingsemantic-searchrag
OllamaHuggingFaceCommercial OK

Whisper Large v3 Turbo

OpenAI

Popular
0.8Baudio0.4GB Q4

8x faster Whisper with minimal quality loss

speech-to-textfast-transcriptionmultilingual
HuggingFaceCommercial OK

ElevenLabs Multilingual v2

ElevenLabs

Popular
1Baudio0.6GB Q4

Industry-leading voice synthesis with cloning

text-to-speechvoice-cloningmultilingual
Commercial OK

Mixtral 8x7B

Mistral AI

Popular
46.7Bllm25.7GB Q4

Highly efficient MoE model - GPT-3.5 quality at fraction of compute

efficientmultilingual
OllamaHuggingFaceCommercial OK
MMLU:70.6%
HumanEval:40.2%

DeepSeek V3

DeepSeek

Popular
671Bllm369.1GB Q4

State-of-the-art MoE model rivaling GPT-4 at fraction of compute

reasoningcodingmath
OllamaHuggingFaceCommercial OK
MMLU:88.5%
HumanEval:91%

Gemini 2.0 Flash

Google

Popular
100Bllm55.0GB Q4

Google's latest multimodal model with native tool use and real-time streaming

multimodalreasoningagentic
Commercial OK
MMLU:85%

Qwen2.5 Coder 7B

Alibaba

Popular
7Bllm3.9GB Q4

Best coding model for consumer GPUs

codingcode-completion
OllamaHuggingFaceCommercial OK
HumanEval:78%

FLUX.1 [schnell]

Black Forest Labs

Popular
12Bimage6.6GB Q4

Fast FLUX variant - 4-step generation with Apache 2.0 license

image-generationfast4-step
HuggingFaceCommercial OK

BGE-M3

BAAI

Popular
0.568Bembedding0.3GB Q4

Multi-lingual, multi-functionality, multi-granularity embedding

embeddingmultilinguallong-context
OllamaHuggingFaceCommercial OK

All-MiniLM-L6-v2

Sentence Transformers

Popular
0.022Bembedding0.0GB Q4

Classic lightweight embedding - runs on CPU

embeddingtinycpu-friendly
OllamaHuggingFaceCommercial OK

Llama 3.2 11B Vision

Meta

Popular
11Bmultimodal6.1GB Q4

Efficient multimodal model for consumer hardware

visionmultimodal
OllamaHuggingFaceCommercial OK

Qwen 2.5 72B

Alibaba

Popular
72Bllm39.6GB Q4

Alibaba's flagship model - rivals GPT-4 on many benchmarks

reasoningcodingmath
OllamaHuggingFaceCommercial OK
MMLU:86.1%
HumanEval:86.6%

Qwen 2.5 7B

Alibaba

Popular
7Bllm3.9GB Q4

Best-in-class 7B model with 128K context

efficientlong-context
OllamaHuggingFaceCommercial OK
MMLU:74%
HumanEval:70%

Claude 3.5 Haiku

Anthropic

Popular
20Bllm11.0GB Q4

Fast and cost-effective Claude for high-volume applications

fast-inferencechatcoding
Commercial OK
MMLU:78%

Gemini 1.5 Flash

Google

Popular
30Bllm16.5GB Q4

Fast Gemini with 1M context for high-volume applications

fast-inferencemultimodallong-context
Commercial OK
MMLU:78.9%

SDXL Turbo

Stability AI

Popular
6.6Bimage3.6GB Q4

Distilled SDXL for near-real-time generation (1-4 steps)

image-generationfastreal-time
HuggingFaceCommercial OK

FLUX.1 [pro]

Black Forest Labs

Popular
12Bimage6.6GB Q4

Highest quality FLUX via API

image-generationhighest-qualitytext-to-image
Commercial OK

BGE Base EN v1.5

BAAI

Popular
0.109Bembedding0.1GB Q4

Efficient embedding model for resource-constrained environments

embeddingsemantic-search
OllamaHuggingFaceCommercial OK

Faster Whisper Large v3

SYSTRAN

Popular
1.55Baudio0.9GB Q4

CTranslate2-optimized Whisper - 4x faster inference

speech-to-textoptimizedfast
HuggingFaceCommercial OK

Llama 3.2 3B

Meta

3Bllm1.7GB Q4

Compact model for edge deployment and low-resource environments

lightweightedge
OllamaHuggingFaceCommercial OK

Mistral Nemo 12B

Mistral AI

12Bllm6.6GB Q4

Efficient 12B model with 128K context - great for consumer GPUs

long-contextreasoning
OllamaHuggingFaceCommercial OK
MMLU:68%

Gemma 2 9B

Google

9Bllm5.0GB Q4

Efficient 9B model with strong instruction following

efficientinstruction-following
OllamaHuggingFaceCommercial OK
MMLU:71.3%
HumanEval:40.2%

GPT-4 Turbo

OpenAI

175Bllm96.3GB Q4

GPT-4 with vision and 128K context

reasoningcodingvision
Commercial OK
MMLU:86.4%
HumanEval:87.1%

Qwen2.5 Coder 14B

Alibaba

14Bllm7.7GB Q4

Strong coding performance in efficient package

codingcode-completion
OllamaHuggingFaceCommercial OK
HumanEval:85%

Stable Diffusion 3.5 Large

Stability AI

8.1Bimage4.5GB Q4

Largest open SD3 model with best quality

image-generationhigh-qualitytext-rendering
HuggingFaceCommercial OK

FLUX.1 [pro]

Black Forest Labs

12Bimage6.6GB Q4

Highest quality FLUX - API access only

image-generationhighest-qualityapi-only
Commercial OK

ControlNet SDXL

Various

2.5Bimage1.4GB Q4

Controlled generation for SDXL - pose, depth, canny, etc.

image-generationcontrolled-generationpose
HuggingFaceCommercial OK

Imagen 3

Google

20Bimage11.0GB Q4

Google's highest quality image generation

image-generationphotorealismtext-to-image
Commercial OK

E5 Large v2

Microsoft

0.335Bembedding0.2GB Q4

Microsoft's high-quality embedding model

embeddingsemantic-search
HuggingFaceCommercial OK

Nomic Embed Text v1.5

Nomic AI

0.137Bembedding0.1GB Q4

Matryoshka embedding with variable dimension support

embeddinglong-contextmatryoshka
OllamaHuggingFaceCommercial OK

All-MPNet-Base-v2

Sentence Transformers

0.109Bembedding0.1GB Q4

Higher quality classic embedding

embeddinghigh-quality
HuggingFaceCommercial OK

XTTS v2

Coqui

0.5Baudio0.3GB Q4

Best open-source voice cloning with 17 languages

text-to-speechvoice-cloningmultilingual
HuggingFaceCommercial OK

SD 3.5 Large Turbo

Stability AI

8.1Bimage4.5GB Q4

Fast version of SD3.5 Large (4-step generation)

image-generationfasthigh-quality
HuggingFaceCommercial OK

Llama 3.2 90B Vision

Meta

90Bmultimodal49.5GB Q4

Large multimodal model with image understanding capabilities

visionreasoningmultimodal
OllamaHuggingFaceCommercial OK
MMLU:86%

Mixtral 8x22B

Mistral AI

176Bllm96.8GB Q4

Large MoE model with excellent efficiency per active parameter

reasoningcodingmultilingual
OllamaHuggingFaceCommercial OK
MMLU:77.8%
HumanEval:45.1%

Qwen 2.5 32B

Alibaba

32Bllm17.6GB Q4

Sweet spot between capability and efficiency

reasoningcoding
OllamaHuggingFaceCommercial OK
MMLU:83%
HumanEval:79%

Phi-3 Mini 3.8B

Microsoft

3.8Bllm2.1GB Q4

Remarkably capable 3.8B model for edge deployment

lightweightreasoningedge
OllamaHuggingFaceCommercial OK
MMLU:70%

Claude 3 Opus

Anthropic

200Bllm110.0GB Q4

Most capable Claude 3 model for complex tasks

reasoninganalysiswriting
Commercial OK
MMLU:86.8%
HumanEval:84.9%

o1

OpenAI

200Bllm110.0GB Q4

OpenAI's reasoning model - thinks before answering for complex problems

reasoningmathcoding
Commercial OK
MMLU:92.3%
HumanEval:94%

Qwen2-VL 7B

Alibaba

7Bmultimodal3.9GB Q4

Efficient vision-language model for consumer hardware

visionmultimodalefficient
OllamaHuggingFaceCommercial OK

Zephyr 7B Beta

Hugging Face

7Bllm3.9GB Q4

DPO-tuned Mistral with excellent chat performance

chatdpo-tunedhelpful
OllamaHuggingFaceCommercial OK

Codestral 22B

Mistral AI

22Bllm12.1GB Q4

Mistral's flagship code model

codingcode-generationmulti-language
OllamaHuggingFace
HumanEval:81.1%

DeepSeek Coder V2 236B

DeepSeek

236Bllm129.8GB Q4

Massive MoE coding model with excellent performance

codingcode-generationdebugging
OllamaHuggingFaceCommercial OK
HumanEval:90.2%

Stable Diffusion 3 Medium

Stability AI

2Bimage1.1GB Q4

Latest SD architecture with improved text rendering and composition

image-generationtext-to-imagetext-rendering
HuggingFaceCommercial OK

AnimateDiff v3

AnimateDiff

1.5Bvideo0.8GB Q4

Animation module for Stable Diffusion models

video-generationanimationmotion
HuggingFaceCommercial OK

Ideogram 2.0

Ideogram

10Bimage5.5GB Q4

Best-in-class text rendering in images

image-generationtext-renderingtypography
Commercial OK

BGE Small EN v1.5

BAAI

0.033Bembedding0.0GB Q4

Smallest BGE for edge deployment

embeddinglightweight
OllamaHuggingFaceCommercial OK

GTE-Qwen2 7B Instruct

Alibaba

7Bembedding3.9GB Q4

State-of-the-art embedding with 128K context

embeddinglong-contextmultilingual
HuggingFaceCommercial OK

Whisper Medium

OpenAI

0.77Baudio0.4GB Q4

Balanced speed and accuracy for most use cases

speech-to-textefficientmultilingual
HuggingFaceCommercial OK

MusicGen Large

Meta

3.3Baudio1.8GB Q4

Generate music from text descriptions

music-generationtext-to-musicmelody-conditioning
HuggingFace

Qwen 2.5 14B

Alibaba

14Bllm7.7GB Q4

Strong performance in compact size

reasoningcoding
OllamaHuggingFaceCommercial OK
MMLU:79%
HumanEval:75%

Gemma 2 27B

Google

27Bllm14.9GB Q4

Google's largest open model with strong reasoning

reasoninginstruction-following
OllamaHuggingFaceCommercial OK
MMLU:75.2%
HumanEval:51.8%

Qwen2-VL 72B

Alibaba

72Bmultimodal39.6GB Q4

State-of-the-art open vision-language model

visionmultimodalvideo-understanding
OllamaHuggingFaceCommercial OK

Hermes 3 8B

Nous Research

8Bllm4.4GB Q4

Efficient agentic model for consumer hardware

agenticfunction-callingefficient
OllamaHuggingFaceCommercial OK

OpenHermes 2.5 7B

Teknium

7Bllm3.9GB Q4

Popular fine-tune with great function calling

chatfunction-callingstructured-output
OllamaHuggingFaceCommercial OK

LLaVA 1.6 7B

LLaVA Team

7Bmultimodal3.9GB Q4

Compact vision-language model

visionmultimodallightweight
OllamaHuggingFaceCommercial OK

DeepSeek Coder V2 16B

DeepSeek

16Bllm8.8GB Q4

Efficient MoE coding model for consumer hardware

codingcode-generation
OllamaHuggingFaceCommercial OK
HumanEval:81%

Code Llama 7B

Meta

7Bllm3.9GB Q4

Compact Code Llama for edge deployment

codinginfilling
OllamaHuggingFaceCommercial OK
HumanEval:33.5%

Playground v2.5

Playground AI

2.6Bimage1.4GB Q4

Aesthetic-focused model optimized for pleasing images

image-generationaesthetichigh-quality
HuggingFaceCommercial OK

Playground v2.5

Playground

2.6Bimage1.4GB Q4

Aesthetic-focused model rivaling Midjourney

image-generationaesthetictext-to-image
HuggingFaceCommercial OK

E5 Base v2

Microsoft

0.109Bembedding0.1GB Q4

Efficient E5 variant

embedding
HuggingFaceCommercial OK

GTE Large EN v1.5

Alibaba

0.434Bembedding0.2GB Q4

High-quality English embedding with 8K context

embeddingsemantic-search
HuggingFaceCommercial OK

Bark

Suno

0.4Baudio0.2GB Q4

Generate speech, music, and sound effects from text

text-to-speechsound-effectsmusic
HuggingFaceCommercial OK

Mistral Large 2

Mistral AI

123Bllm67.7GB Q4

Mistral's flagship model with strong reasoning and function calling

reasoningcodingmultilingual
HuggingFaceCommercial OK
MMLU:84%
HumanEval:92%

Gemma 2 2B

Google

2Bllm1.1GB Q4

Smallest Gemma 2 for edge deployment

lightweightedge
OllamaHuggingFaceCommercial OK

DeepSeek V2.5

DeepSeek

236Bllm129.8GB Q4

Efficient MoE with excellent code and reasoning

reasoningcoding
OllamaHuggingFaceCommercial OK
MMLU:80%
HumanEval:80%

Claude 3 Sonnet

Anthropic

70Bllm38.5GB Q4

Balanced Claude 3 model for most use cases

reasoningcodingvision
Commercial OK
MMLU:79%
HumanEval:73%

o1-mini

OpenAI

30Bllm16.5GB Q4

Faster, cheaper reasoning model for coding and STEM

reasoningcodingmath
Commercial OK
MMLU:85.2%
HumanEval:92%

Nemotron 70B

NVIDIA

70Bllm38.5GB Q4

NVIDIA's instruction-tuned model optimized for helpfulness

reasoninginstruction-followinghelpfulness
OllamaHuggingFaceCommercial OK
MMLU:85%
HumanEval:73%

TinyLlama 1.1B

Zhang Peiyuan

1.1Bllm0.6GB Q4

Compact model trained on 3T tokens - surprisingly capable

tinyefficientedge
OllamaHuggingFaceCommercial OK

LLaVA 1.6 34B

LLaVA Team

34Bmultimodal18.7GB Q4

State-of-the-art open vision-language model

visionmultimodalimage-understanding
OllamaHuggingFaceCommercial OK

DeepSeek Coder 6.7B

DeepSeek

6.7Bllm3.7GB Q4

Efficient coding model for consumer GPUs

codingcode-completion
OllamaHuggingFaceCommercial OK
HumanEval:65%

CogVideoX-5B

THUDM

5Bvideo2.8GB Q4

State-of-the-art open video generation model

video-generationtext-to-video
HuggingFaceCommercial OK

E5-Mistral 7B Instruct

Microsoft

7Bembedding3.9GB Q4

LLM-based embedding with 32K context

embeddinglong-contextinstruction-following
HuggingFaceCommercial OK

Jina Embeddings v3

Jina AI

0.572Bembedding0.3GB Q4

Task-specific multilingual embeddings

embeddingmultilingualtask-specific
HuggingFace

Whisper Small

OpenAI

0.24Baudio0.1GB Q4

Efficient Whisper for edge deployment

speech-to-textlightweightmultilingual
HuggingFaceCommercial OK

Fish Speech 1.4

Fish Audio

0.5Baudio0.3GB Q4

High-quality zero-shot TTS with voice cloning

text-to-speechvoice-cloningzero-shot
HuggingFaceCommercial OK

SeamlessM4T v2 Large

Meta

2.3Baudio1.3GB Q4

Unified speech translation across 100+ languages

speech-translationspeech-to-texttext-to-speech
HuggingFace

Qwen 2.5 3B

Alibaba

3Bllm1.7GB Q4

Compact model for resource-constrained environments

lightweightedge
OllamaHuggingFaceCommercial OK

Phi-3 Small 7B

Microsoft

7Bllm3.9GB Q4

Efficient reasoning model with long context

reasoningefficient
OllamaHuggingFaceCommercial OK
MMLU:75%

Command R+

Cohere

104Bllm57.2GB Q4

Optimized for RAG and agentic workflows

ragagentictool-use
OllamaHuggingFace
MMLU:75.7%

Claude 3 Haiku

Anthropic

20Bllm11.0GB Q4

Fastest Claude 3 for simple tasks

fast-inferencechat
Commercial OK

Hermes 3 70B

Nous Research

70Bllm38.5GB Q4

Best open model for agentic tasks and function calling

agenticfunction-callingtool-use
OllamaHuggingFaceCommercial OK
MMLU:80%

Dolphin 2.9 8B

Cognitive Computations

8Bllm4.4GB Q4

Smaller uncensored Dolphin

uncensoredchatcreative
OllamaHuggingFaceCommercial OK

LLaVA 1.6 13B

LLaVA Team

13Bmultimodal7.2GB Q4

Efficient LLaVA for consumer GPUs

visionmultimodalefficient
OllamaHuggingFaceCommercial OK

Moondream 2

Vikhyat

1.9Bmultimodal1.0GB Q4

Tiny vision model - runs anywhere

visiontinyedge
OllamaHuggingFaceCommercial OK

Qwen2.5 Coder 3B

Alibaba

3Bllm1.7GB Q4

Compact coding model for resource-constrained environments

codingcode-completionlightweight
OllamaHuggingFaceCommercial OK
HumanEval:65%

DeepSeek Coder 33B

DeepSeek

33Bllm18.2GB Q4

Strong coding model with good debugging capability

codingcode-generationdebugging
OllamaHuggingFaceCommercial OK
HumanEval:79.3%

Code Llama 34B

Meta

34Bllm18.7GB Q4

Strong Code Llama with infilling support

codingcode-generationinfilling
OllamaHuggingFaceCommercial OK
HumanEval:53.7%

Mochi 1 Preview

Genmo

10Bvideo5.5GB Q4

High-quality open video generation with Apache license

video-generationtext-to-videohigh-quality
HuggingFaceCommercial OK

GTE-Qwen2 1.5B Instruct

Alibaba

1.5Bembedding0.8GB Q4

Efficient GTE with long context support

embeddinglong-contextefficient
HuggingFaceCommercial OK

MusicGen Medium

Meta

1.5Baudio0.8GB Q4

Balanced music generation model

music-generationtext-to-music
HuggingFace

Stable Audio Open

Stability AI

1.1Baudio0.6GB Q4

Generate variable-length audio up to 47 seconds

music-generationsound-effectsvariable-length
HuggingFaceCommercial OK

Llama 3.2 1B

Meta

1Bllm0.6GB Q4

Smallest Llama for mobile and embedded deployment

lightweightedgemobile
OllamaHuggingFaceCommercial OK

Phi-3 Medium 14B

Microsoft

14Bllm7.7GB Q4

Strong reasoning and math in compact form

reasoningmathcoding
OllamaHuggingFaceCommercial OK
MMLU:78%

Yi 1.5 34B

01.AI

34Bllm18.7GB Q4

Strong bilingual (English/Chinese) model

reasoningmultilingual
OllamaHuggingFaceCommercial OK
MMLU:76.8%

Command R

Cohere

35Bllm19.3GB Q4

Efficient RAG and tool-use model

ragagentictool-use
OllamaHuggingFaceCommercial OK

Gemini 1.5 Flash 8B

Google

8Bllm4.4GB Q4

Smallest Gemini for cost-sensitive applications

fast-inferencelightweightlong-context
Commercial OK

Grok 2

xAI

314Bllm172.7GB Q4

xAI's flagship model with real-time information access

reasoningcodingreal-time-info
Commercial OK
MMLU:87.5%
HumanEval:88%

OpenChat 3.5 7B

OpenChat

7Bllm3.9GB Q4

High-quality 7B chat model trained with C-RLFT

chatefficient
OllamaHuggingFaceCommercial OK
MMLU:64.3%

StarCoder2 15B

BigCode

15Bllm8.3GB Q4

Strong open code model trained on The Stack v2

code-generationcode-completionmulti-language
OllamaHuggingFaceCommercial OK
HumanEval:46.3%

Vicuna 13B

LMSYS

13Bllm7.2GB Q4

Popular chat model for consumer GPUs

chatinstruction-following
OllamaHuggingFaceCommercial OK

Dolphin 2.9 70B

Cognitive Computations

70Bllm38.5GB Q4

Uncensored model for research and creative use

uncensoredchatcreative
OllamaHuggingFaceCommercial OK

MiniCPM-V 2.6

OpenBMB

8Bmultimodal4.4GB Q4

Strong OCR and chart understanding

visionocrchart-understanding
OllamaHuggingFaceCommercial OK

Mathstral 7B

Mistral AI

7Bllm3.9GB Q4

Mistral's math-specialized model

mathreasoningscience
OllamaHuggingFaceCommercial OK

Code Llama 70B

Meta

70Bllm38.5GB Q4

Meta's largest coding model

codingcode-generationinfilling
OllamaHuggingFaceCommercial OK
HumanEval:67.8%

Code Llama 13B

Meta

13Bllm7.2GB Q4

Efficient Code Llama for consumer hardware

codinginfilling
OllamaHuggingFaceCommercial OK
HumanEval:42.7%

CodeGemma 7B

Google

7Bllm3.9GB Q4

Google's coding model with infilling support

codingcode-completioninfilling
OllamaHuggingFaceCommercial OK
HumanEval:52%

Stable Diffusion 2.1

Stability AI

0.86Bimage0.5GB Q4

SD 2.1 with improved quality at 768px

image-generation768-resolution
HuggingFaceCommercial OK

PixArt-Σ

PixArt-alpha

0.6Bimage0.3GB Q4

Efficient DiT model capable of 4K generation

image-generationefficient4k-capable
HuggingFaceCommercial OK

Kolors

Kwai

5Bimage2.8GB Q4

Strong Chinese-English image generation

image-generationchinese-textmultilingual
HuggingFaceCommercial OK

Voyage 3

Voyage AI

0.5Bembedding0.3GB Q4

Premium embedding API with best-in-class retrieval

embeddinglong-contextretrieval
Commercial OK

Parler TTS Large

Hugging Face

2.3Baudio1.3GB Q4

Natural-sounding TTS with style control via text descriptions

text-to-speechexpressivecontrollable
HuggingFaceCommercial OK

MeloTTS

MyShell

0.1Baudio0.1GB Q4

Fast multilingual TTS for edge deployment

text-to-speechmultilinguallightweight
HuggingFaceCommercial OK

SOLAR 10.7B

Upstage

10.7Bllm5.9GB Q4

Efficient 10.7B model with depth up-scaling

efficientinstruction-following
OllamaHuggingFaceCommercial OK
MMLU:66%

Vicuna 7B

LMSYS

7Bllm3.9GB Q4

Efficient chat model

chatlightweight
OllamaHuggingFaceCommercial OK

StableLM Zephyr 3B

Stability AI

3Bllm1.7GB Q4

Fast chat model for edge deployment

chatlightweightefficient
OllamaHuggingFaceCommercial OK

Neural Chat 7B

Intel

7Bllm3.9GB Q4

Intel-optimized chat model

chatintel-optimized
OllamaHuggingFaceCommercial OK

SmolLM 1.7B

Hugging Face

1.7Bllm0.9GB Q4

Small but capable model from HuggingFace

tinyefficientedge
OllamaHuggingFaceCommercial OK

Jamba 1.5 Large

AI21 Labs

398Bllm218.9GB Q4

Novel Mamba-Transformer hybrid with 256K context

long-contextefficienthybrid-architecture
HuggingFaceCommercial OK

StarCoder2 15B

BigCode

15Bllm8.3GB Q4

Multi-language coding model trained on The Stack v2

codingcode-completionmulti-language
OllamaHuggingFaceCommercial OK
HumanEval:46.3%

WizardCoder 33B

WizardLM

33Bllm18.2GB Q4

Strong instruction-following coding model

codinginstruction-following
OllamaHuggingFaceCommercial OK
HumanEval:73.2%

PixArt-Σ

PixArt

0.6Bimage0.3GB Q4

Efficient 4K image generation with small footprint

image-generationefficient4k-generation
HuggingFaceCommercial OK

Parler TTS Mini

Hugging Face

0.88Baudio0.5GB Q4

Efficient Parler TTS for edge deployment

text-to-speechefficient
HuggingFaceCommercial OK

MusicGen Small

Meta

0.3Baudio0.2GB Q4

Efficient music generation for consumer hardware

music-generationlightweight
HuggingFace

VoiceCraft

Meta

0.83Baudio0.5GB Q4

Edit speech with natural voice cloning

speech-editingvoice-cloningzero-shot
HuggingFace

Qwen 2.5 0.5B

Alibaba

0.5Bllm0.3GB Q4

Smallest Qwen for embedded and mobile

tinyedgemobile
OllamaHuggingFaceCommercial OK

Yi 1.5 9B

01.AI

9Bllm5.0GB Q4

Efficient bilingual model

efficientmultilingual
OllamaHuggingFaceCommercial OK

InternLM 2.5 20B

Shanghai AI Lab

20Bllm11.0GB Q4

Strong Chinese-English model from Shanghai AI Lab

reasoningmathcoding
OllamaHuggingFaceCommercial OK
MMLU:78%

Starling LM 7B

Berkeley

7Bllm3.9GB Q4

RLHF-tuned model with high MT-Bench scores

chathelpfulharmless
OllamaHuggingFaceCommercial OK

StarCoder2 7B

BigCode

7Bllm3.9GB Q4

Efficient code model for consumer hardware

code-generationcode-completion
OllamaHuggingFaceCommercial OK
HumanEval:35.4%

Vicuna 33B

LMSYS

33Bllm18.2GB Q4

Strong chat model from LMSYS

chatinstruction-following
OllamaHuggingFaceCommercial OK

Granite 34B Code

IBM

34Bllm18.7GB Q4

IBM's enterprise-grade code model

codingenterprise
OllamaHuggingFaceCommercial OK

Qwen2.5 Coder 1.5B

Alibaba

1.5Bllm0.8GB Q4

Smallest Qwen coder for embedded/mobile

codinglightweightedge
OllamaHuggingFaceCommercial OK
HumanEval:55%

StarCoder2 7B

BigCode

7Bllm3.9GB Q4

Efficient StarCoder for consumer hardware

codingcode-completion
OllamaHuggingFaceCommercial OK
HumanEval:35%

Kandinsky 3

Sber AI

3Bimage1.7GB Q4

Multilingual image generation with strong Russian support

image-generationmultilingual
HuggingFaceCommercial OK

HunyuanDiT

Tencent

1.5Bimage0.8GB Q4

Tencent's bilingual DiT model

image-generationchinese-textbilingual
HuggingFaceCommercial OK

Whisper Base

OpenAI

0.07Baudio0.0GB Q4

Smallest Whisper for real-time on CPU

speech-to-texttinyfast
HuggingFaceCommercial OK

Bark Small

Suno

0.1Baudio0.1GB Q4

Smaller Bark for faster generation

text-to-speechlightweight
HuggingFaceCommercial OK

AudioGen Medium

Meta

1.5Baudio0.8GB Q4

Generate sound effects and ambient audio from text

sound-generationtext-to-audiosound-effects
HuggingFace

MARS5 TTS

CAMB.AI

0.4Baudio0.2GB Q4

Novel TTS with fine-grained prosody control

text-to-speechvoice-cloningprosody-control
HuggingFaceCommercial OK

Orca 2 13B

Microsoft

13Bllm7.2GB Q4

Microsoft's reasoning-focused model

reasoningexplanation
OllamaHuggingFace
MMLU:60%

StableLM 2 12B

Stability AI

12Bllm6.6GB Q4

Stability AI's flagship chat model

chatmultilingual
OllamaHuggingFaceCommercial OK

Granite 8B Code

IBM

8Bllm4.4GB Q4

Efficient IBM code model

codingefficient
OllamaHuggingFaceCommercial OK

Jamba 1.5 Mini

AI21 Labs

52Bllm28.6GB Q4

Smaller Jamba with same 256K context

long-contextefficient
HuggingFaceCommercial OK

CodeGemma 2B

Google

2Bllm1.1GB Q4

Tiny CodeGemma for embedded systems

codinglightweightedge
OllamaHuggingFaceCommercial OK
HumanEval:30%

WizardCoder 15B

WizardLM

15Bllm8.3GB Q4

Efficient WizardCoder for consumer hardware

codinginstruction-following
OllamaHuggingFaceCommercial OK
HumanEval:57.3%

Yi 1.5 6B

01.AI

6Bllm3.3GB Q4

Compact bilingual model for edge

lightweightmultilingual
OllamaHuggingFaceCommercial OK

Grok 2 Mini

xAI

28Bllm15.4GB Q4

Fast Grok for simple tasks

fast-inferencechat
Commercial OK

InternLM 2.5 7B

Shanghai AI Lab

7Bllm3.9GB Q4

Efficient Chinese-English model

efficientmath
OllamaHuggingFaceCommercial OK

WizardLM 2 8x22B

Microsoft

176Bllm96.8GB Q4

Microsoft's MoE model for complex reasoning

reasoningcodingmath
HuggingFaceCommercial OK
MMLU:78%

StarCoder2 3B

BigCode

3Bllm1.7GB Q4

Smallest StarCoder2 for edge deployment

code-completionlightweight
OllamaHuggingFaceCommercial OK
HumanEval:31.7%

Orca 2 7B

Microsoft

7Bllm3.9GB Q4

Smaller Orca for edge deployment

reasoninglightweight
OllamaHuggingFace

StableLM 2 1.6B

Stability AI

1.6Bllm0.9GB Q4

Smallest StableLM for mobile/edge

chattinyedge
OllamaHuggingFaceCommercial OK

SmolLM 360M

Hugging Face

0.36Bllm0.2GB Q4

Tiny model for embedded systems

tinyedgemobile
OllamaHuggingFaceCommercial OK

Amazon Titan Text Express

Amazon

30Bllm16.5GB Q4

Amazon's general-purpose LLM via Bedrock

generalenterprise
Commercial OK

DeepSeek Coder 1.3B

DeepSeek

1.3Bllm0.7GB Q4

Tiny coding model for embedded systems

codinglightweightedge
OllamaHuggingFaceCommercial OK
HumanEval:45%

StarCoder2 3B

BigCode

3Bllm1.7GB Q4

Compact StarCoder for edge deployment

codinglightweight
OllamaHuggingFaceCommercial OK
HumanEval:28%

MPT 7B Instruct

MosaicML

7Bllm3.9GB Q4

Efficient model with 65K context support

instruction-followinglong-context
OllamaHuggingFaceCommercial OK

RWKV-6 World 7B

RWKV Foundation

7Bllm3.9GB Q4

Linear attention model with infinite context potential

long-contextefficientlinear-complexity
HuggingFaceCommercial OK

Falcon 180B

TII

180Bllm99.0GB Q4

TII's largest open model

multilingualreasoning
HuggingFaceCommercial OK
MMLU:70.4%

Baichuan 2 13B

Baichuan

13Bllm7.2GB Q4

Strong Chinese chat model

chinesemultilingual
OllamaHuggingFaceCommercial OK

MPT 30B Chat

MosaicML

30Bllm16.5GB Q4

MosaicML's chat model with 8K context

chatlong-context
OllamaHuggingFaceCommercial OK

BLOOM 176B

BigScience

176Bllm96.8GB Q4

Largest open multilingual model - 46 languages

multilingualresearch
HuggingFaceCommercial OK

Falcon 40B

TII

40Bllm22.0GB Q4

Strong open multilingual model

multilingual
OllamaHuggingFaceCommercial OK

Dolly v2 12B

Databricks

12Bllm6.6GB Q4

First commercially usable instruction-tuned model

instruction-followingcommercial
HuggingFaceCommercial OK

RedPajama INCITE 7B

Together

7Bllm3.9GB Q4

Open reproduction of LLaMA trained on RedPajama

chatinstruction-following
HuggingFaceCommercial OK

BLOOMZ 7B1

BigScience

7Bllm3.9GB Q4

Instruction-tuned BLOOM

multilingualinstruction-following
HuggingFaceCommercial OK

OpenLLaMA 13B

OpenLM

13Bllm7.2GB Q4

Fully open reproduction of LLaMA

generalopen-source
HuggingFaceCommercial OK

Pythia 12B

EleutherAI

12Bllm6.6GB Q4

Research model with full training checkpoints

researchinterpretability
HuggingFaceCommercial OK

OpenLLaMA 7B

OpenLM

7Bllm3.9GB Q4

Smaller OpenLLaMA variant

generalopen-source
HuggingFaceCommercial OK

Pythia 6.9B

EleutherAI

6.9Bllm3.8GB Q4

Smaller Pythia variant

researchinterpretability
HuggingFaceCommercial OK

Cerebras GPT 13B

Cerebras

13Bllm7.2GB Q4

Compute-optimal GPT trained by Cerebras

researchcompute-optimal
HuggingFaceCommercial OK
Showing 30 of 190 models. Refine your search to see more specific results.