RUNLOCALAIv38
->Will it run?Best GPUCompareTroubleshootStartLearnPulseModelsHardwareToolsBench
Run check
RUNLOCALAI

Independently operated catalog for local-AI hardware and software. Hand-written verdicts. Source-cited claims. Reproducible commands when we have them.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
TOOLS
  • Will it run?
  • Compare hardware
  • Cost vs cloud
  • Choose my GPU
  • Prompting kits
  • Quick answers
REF
  • All buyer guides
  • Learn local AI
  • Methodology
  • Glossary
  • Errors KB
  • Trust
EDITOR
  • About
  • Author
  • How we make money
  • Editorial policy
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

© 2026 runlocalai.coIndependently operated
RUNLOCALAI · v38
  1. >
  2. Benchmarks
  3. /Browse

Benchmark results browser

Every benchmark in the corpus, filterable by source, scenario, and reproducibility. Newer rows show cold-start vs steady-state, P5/P95 CI, tokens-per-watt, and accuracy when those fields were captured; legacy rows are labeled as rigor pending.

Total:39
Operator-measured:39
Marked reproduced:0
Source:
AllOperator-measuredCommunityVendor-published
Scenario:
AllSingle-stream2 concurrent4 concurrent
Reproducibility:
AllMarked reproduced
HardwareModelQuantTok/sRigorSourceDate
NVIDIA GeForce RTX 3080 16GB (Mobile)Turkcell LLM 7B v1Q4_K_M85.8
cold85.6 tok/ssteady85.8 tok/sCI85.4–86.1scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)RefinedNeuro RN TR R2Q4_K_M79.3
cold78.7 tok/ssteady79.3 tok/sCI78.9–79.5scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)RefinedNeuro RN TR R1Q4_K_M79.9
cold79.1 tok/ssteady79.9 tok/sCI79.5–80.8scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Qwen 3 4BQ4_K_M103.7
cold103.1 tok/ssteady103.7 tok/sCI101.3–106.1scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Qwen 3 14BQ4_K_M38.3
cold38.8 tok/ssteady38.3 tok/sCI38.2–38.4scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Qwen 2.5 7B InstructQ4_K_M80.4
cold80.7 tok/ssteady80.4 tok/sCI79.0–81.7scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Phi-4 Reasoning 14BQ4_K_M40.4
cold41.0 tok/ssteady40.4 tok/sCI39.7–40.5scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Phi-3.5 Mini InstructQ4_K_M155.4
cold154.7 tok/ssteady155.4 tok/sCI152.4–157.3scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Mistral Nemo 12B InstructQ4_K_M65.7
cold66.1 tok/ssteady65.7 tok/sCI65.3–66.0scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Mistral 7B Instruct v0.3Q4_K_M89.6
cold90.2 tok/ssteady89.6 tok/sCI87.9–91.2scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Llama 3.2 11B Vision InstructQ4_K_M67.0
cold67.2 tok/ssteady67.0 tok/sCI67.0–67.1scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Malhajar Mistral 7B TurkishQ4_K_M87.3
cold83.0 tok/ssteady87.3 tok/sCI82.7–89.6scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Hermes 3 Llama 3.1 8BQ4_K_M81.5
cold82.2 tok/ssteady81.5 tok/sCI81.3–81.8scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 4 E4B (Effective 4B)Q4_K_M78.1
cold79.3 tok/ssteady78.1 tok/sCI77.9–78.4scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 4 E2B (Effective 2B)Q4_K_M99.1
cold98.5 tok/ssteady99.1 tok/sCI98.1–101.0scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 3 4BQ4_K_M97.7
cold96.4 tok/ssteady97.7 tok/sCI97.2–98.2scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 3 1BQ4_K_M160.4
cold156.4 tok/ssteady160.4 tok/sCI159.7–162.0scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 3 12BQ4_K_M43.3
cold43.6 tok/ssteady43.3 tok/sCI43.2–43.4scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Gemma 2 9B InstructQ4_K_M68.2
cold69.4 tok/ssteady68.2 tok/sCI67.9–69.1scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)DeepSeek R1 Distill Qwen 7BQ4_K_M80.3
cold80.3 tok/ssteady80.3 tok/sCI79.4–81.6scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)DeepSeek Coder V2 Lite (16B)Q4_K_M152.0
cold151.2 tok/ssteady152.0 tok/sCI149.9–152.7scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)CodeGemma 7BQ4_K_M80.6
cold80.2 tok/ssteady80.6 tok/sCI79.2–81.2scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Mistral Turkish v2 (brooqs)Q4_K_M106.8
cold100.9 tok/ssteady106.8 tok/sCI105.7–107.9scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)YTU Turkish Gemma 9B v0.1Q4_K_M66.0
cold66.6 tok/ssteady66.0 tok/sCI65.7–66.8scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Trendyol LLM Asure 12BQ4_K_M43.4
cold43.7 tok/ssteady43.4 tok/sCI43.1–43.4scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Kumru 2BQ4_K_M174.2
cold171.9 tok/ssteady174.2 tok/sCI171.7–175.6scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 3080 16GB (Mobile)Llama 3.2 1B InstructQ4_K_M189.5
cold189.9 tok/ssteady189.5 tok/sCI186.8–190.9scnSingle-streamn5
Measured here2026-06-02
NVIDIA GeForce RTX 5080Malhajar Mistral 7B TurkishQ5_K_M130.4
steady130.4 tok/sCI129.6–130.8scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080RefinedNeuro RN TR R2Q4_K_M133.4
steady133.4 tok/sCI132.8–133.6scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080RefinedNeuro RN TR R1Q4_K_M133.6
steady133.6 tok/sCI133.1–134.0scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Trendyol LLM Asure 12Bunknown79.1
steady79.1 tok/sCI78.4–79.6scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080YTU Turkish Gemma 9B v0.1Q4_K_M101.1
steady101.1 tok/sCI100.6–101.6scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Turkcell LLM 7B v1Q4_K_M145.1
steady145.1 tok/sCI144.2–146.2scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Mistral Turkish v2 (brooqs)Q4_0161.1
steady161.1 tok/sCI159.9–161.7scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Kumru 2BQ4_K_M443.7
steady443.7 tok/sCI399.3–452.7scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Qwen 2.5 Coder 14B InstructQ4_K_M79.0
cold77.4 tok/ssteady79.0 tok/sCI78.5–79.1scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Llama 3.1 8B InstructQ4_K_M135.6
cold136.5 tok/ssteady135.6 tok/sCI134.5–137.1scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Trendyol LLM Asure 12BQ4_K_M82.0
cold81.7 tok/ssteady82.0 tok/sCI81.7–82.3scnSingle-streamn5
Measured here2026-05-28
NVIDIA GeForce RTX 5080Trendyol LLM Asure 12BQ4_K_M61.5
cold61.6 tok/ssteady61.5 tok/sCI61.5–61.6scnSingle-streamn3
Measured here2026-05-27

Showing up to 200 rows, newest first. See /resources/benchmark-protocol for what the rigor pills mean and how to reproduce any row.