Name: RunLocalAI local LLM benchmark dataset
Creator: RunLocalAI
License: https://creativecommons.org/licenses/by/4.0/

Selected coverage slice

22 measured or reviewed / 0 estimated / 0 wanted / 6 unstarted

78.6% of 28 selected cells measured or reviewed

Selected benchmark coverage by hardware and model. Measured, reviewed, estimated, and wanted cells are visually distinct.
Hardware / Model	Trendyol LLM Asure	Kumru 2B	Mistral Turkish v2	Turkcell LLM 7B v1	YTU Turkish Gemma	RefinedNeuro RN TR	RefinedNeuro RN TR	Malhajar Mistral 7	Llama 3.1 8B Instr	Qwen 2.5 Coder 14B	Llama 3.2 1B Instr	CodeGemma 7B	DeepSeek Coder V2	DeepSeek R1 Distil
NVIDIA GeForce RTX 3080 16GB	43Trendyol LLM Asure 12B on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 43.4 tok/s, high confidence	174Kumru 2B on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 174.2 tok/s, high confidence	107Mistral Turkish v2 (brooqs) on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 106.8 tok/s, high confidence	86Turkcell LLM 7B v1 on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 85.8 tok/s, high confidence	66YTU Turkish Gemma 9B v0.1 on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 66.0 tok/s, high confidence	80RefinedNeuro RN TR R1 on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 79.9 tok/s, high confidence	79RefinedNeuro RN TR R2 on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 79.3 tok/s, high confidence	87Malhajar Mistral 7B Turkish on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 87.3 tok/s, high confidence	Llama 3.1 8B Instruct on NVIDIA GeForce RTX 3080 16GB (Mobile): not started	Qwen 2.5 Coder 14B Instruct on NVIDIA GeForce RTX 3080 16GB (Mobile): not started	190Llama 3.2 1B Instruct on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 189.5 tok/s, high confidence	81CodeGemma 7B on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 80.6 tok/s, high confidence	152DeepSeek Coder V2 Lite (16B) on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 152.0 tok/s, high confidence	80DeepSeek R1 Distill Qwen 7B on NVIDIA GeForce RTX 3080 16GB (Mobile): RunLocalAI measured 80.3 tok/s, high confidence
NVIDIA GeForce RTX 5080	82Trendyol LLM Asure 12B on NVIDIA GeForce RTX 5080: RunLocalAI measured 82.0 tok/s, high confidence	444Kumru 2B on NVIDIA GeForce RTX 5080: RunLocalAI measured 443.7 tok/s, high confidence	161Mistral Turkish v2 (brooqs) on NVIDIA GeForce RTX 5080: RunLocalAI measured 161.1 tok/s, high confidence	145Turkcell LLM 7B v1 on NVIDIA GeForce RTX 5080: RunLocalAI measured 145.1 tok/s, high confidence	101YTU Turkish Gemma 9B v0.1 on NVIDIA GeForce RTX 5080: RunLocalAI measured 101.1 tok/s, high confidence	134RefinedNeuro RN TR R1 on NVIDIA GeForce RTX 5080: RunLocalAI measured 133.6 tok/s, high confidence	133RefinedNeuro RN TR R2 on NVIDIA GeForce RTX 5080: RunLocalAI measured 133.4 tok/s, high confidence	130Malhajar Mistral 7B Turkish on NVIDIA GeForce RTX 5080: RunLocalAI measured 130.4 tok/s, high confidence	136Llama 3.1 8B Instruct on NVIDIA GeForce RTX 5080: RunLocalAI measured 135.6 tok/s, high confidence	79Qwen 2.5 Coder 14B Instruct on NVIDIA GeForce RTX 5080: RunLocalAI measured 79.0 tok/s, high confidence	Llama 3.2 1B Instruct on NVIDIA GeForce RTX 5080: not started	CodeGemma 7B on NVIDIA GeForce RTX 5080: not started	DeepSeek Coder V2 Lite (16B) on NVIDIA GeForce RTX 5080: not started	DeepSeek R1 Distill Qwen 7B on NVIDIA GeForce RTX 5080: not started

RunLocalAI measuredReproducedVendor-publishedCommunity-reviewedLow/unverified measuredEstimatedCritical wantedWantedNot started

Fast measured/reviewed cells

Model	Hardware	Provenance	Quant	Ctx	Tokens / sec	TTFT	Date
Turkcell LLM 7B v1	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	85.8tok/s	96 ms	Jun 2, 26
Hermes 3 Llama 3.1 8B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	81.5tok/s	357 ms	Jun 2, 26
Malhajar Mistral 7B Turkish	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	87.3tok/s	74 ms	Jun 2, 26
Llama 3.2 11B Vision Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	67.0tok/s	411 ms	Jun 2, 26
Mistral 7B Instruct v0.3	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	89.6tok/s	80 ms	Jun 2, 26
Mistral Nemo 12B Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	65.7tok/s	367 ms	Jun 2, 26
Phi-3.5 Mini Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	155.4tok/s	66 ms	Jun 2, 26
Phi-4 Reasoning 14B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	40.4tok/s	226 ms	Jun 2, 26
Qwen 2.5 7B Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	80.4tok/s	335 ms	Jun 2, 26
Qwen 3 14B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	38.3tok/s	310 ms	Jun 2, 26
Qwen 3 4B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	103.7tok/s	303 ms	Jun 2, 26
RefinedNeuro RN TR R1	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	79.9tok/s	361 ms	Jun 2, 26
RefinedNeuro RN TR R2	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	79.3tok/s	366 ms	Jun 2, 26
Llama 3.2 1B Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	189.5tok/s	359 ms	Jun 2, 26
Kumru 2B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	174.2tok/s	129 ms	Jun 2, 26
Trendyol LLM Asure 12B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	43.4tok/s	391 ms	Jun 2, 26
YTU Turkish Gemma 9B v0.1	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	66.0tok/s	369 ms	Jun 2, 26
Mistral Turkish v2 (brooqs)	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	106.8tok/s	73 ms	Jun 2, 26
CodeGemma 7B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	80.6tok/s	383 ms	Jun 2, 26
DeepSeek Coder V2 Lite (16B)	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	152.0tok/s	211 ms	Jun 2, 26
DeepSeek R1 Distill Qwen 7B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	80.3tok/s	300 ms	Jun 2, 26
Gemma 2 9B Instruct	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	68.2tok/s	358 ms	Jun 2, 26
Gemma 3 12B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	43.3tok/s	767 ms	Jun 2, 26
Gemma 3 1B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	160.4tok/s	790 ms	Jun 2, 26
Gemma 3 4B	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	97.7tok/s	743 ms	Jun 2, 26
Gemma 4 E2B (Effective 2B)	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	99.1tok/s	792 ms	Jun 2, 26
Gemma 4 E4B (Effective 4B)	NVIDIA GeForce RTX 3080 16GB (Mobile)	EditorialM	Q4_K_M	4K	78.1tok/s	790 ms	Jun 2, 26
Trendyol LLM Asure 12B	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	4K	82.0tok/s	136 ms	May 28, 26
Llama 3.1 8B Instruct	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	4K	135.6tok/s	130 ms	May 28, 26
Qwen 2.5 Coder 14B Instruct	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	4K	79.0tok/s	117 ms	May 28, 26
Kumru 2B	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	2K	443.7tok/s	—	May 28, 26
Mistral Turkish v2 (brooqs)	NVIDIA GeForce RTX 5080	EditorialM	Q4_0	2K	161.1tok/s	—	May 28, 26
Turkcell LLM 7B v1	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	2K	145.1tok/s	—	May 28, 26
YTU Turkish Gemma 9B v0.1	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	2K	101.1tok/s	—	May 28, 26
Trendyol LLM Asure 12B	NVIDIA GeForce RTX 5080	EditorialM	unknown	2K	79.1tok/s	—	May 28, 26
RefinedNeuro RN TR R1	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	2K	133.6tok/s	—	May 28, 26
RefinedNeuro RN TR R2	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	2K	133.4tok/s	—	May 28, 26
Malhajar Mistral 7B Turkish	NVIDIA GeForce RTX 5080	EditorialM	Q5_K_M	2K	130.4tok/s	—	May 28, 26
Trendyol LLM Asure 12B	NVIDIA GeForce RTX 5080	EditorialM	Q4_K_M	8K	61.5tok/s	323 ms	May 27, 26

Local LLM benchmarks

22 measured or reviewed / 0 estimated / 0 wanted / 6 unstarted

Public evidence split

How strong is the visible provenance?

How fresh is the dataset, month by month?

Throughput leaderboard

Latest 39 runs