llama
8B parameters
Commercial OK
Reviewed May 2026

RefinedNeuro RN TR R2

RefinedNeuro RN TR R2 is an Apache-2.0 Llama-family 8B model distributed on Hugging Face and Ollama. It is measured alongside R1 to compare same-size RefinedNeuro variants on local RTX 5080 inference.

License: Apache-2.0·Context: 8,192 tokens

Overview

RefinedNeuro RN TR R2 is an Apache-2.0 Llama-family 8B model distributed on Hugging Face and Ollama. It is measured alongside R1 to compare same-size RefinedNeuro variants on local RTX 5080 inference.

Strengths

  • Apache-2.0 license
  • Llama-family runtime compatibility
  • Stable measured throughput across five runs

Weaknesses

  • Low public adoption signal at intake
  • Not tagged as a Turkish-specific model in the sweep
  • Needs quality evaluation before capability claims

Quantization variants

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

QuantizationFile sizeVRAM required
Q4_K_M4.9 GB7 GB

Get the model

Ollama

One-line install

ollama run RefinedNeuro/RN_TR_R2:latestRead our Ollama review →

HuggingFace

Original weights

huggingface.co/RefinedNeuro/RN_TR_R2

Source repository — direct quantization required.

Benchmarks

Real measurements on real hardware. Numbers ship with the runner version, quant, and date.

2 runs on record
HardwareProvenanceQuantCtxTokens / secVRAMTTFTDate
NVIDIA GeForce RTX 5080
EditorialM
Q4_K_M2K
133.4tok/s
May 28, 26
NVIDIA GeForce RTX 3080 16GB (Mobile)
EditorialM
Q4_K_M4K
79.3tok/s
366 msJun 2, 26

What to do next

Got this model running on real hardware? Share what you measured — the form arrives with the model pre-selected.

Hardware that runs this

Cards with enough VRAM for at least one quantization of RefinedNeuro RN TR R2.

Compare alternatives

Models worth comparing

Same parameter band, plus what's one tier above and below — so you can decide what actually fits your hardware.

Frequently asked

What's the minimum VRAM to run RefinedNeuro RN TR R2?

7GB of VRAM is enough to run RefinedNeuro RN TR R2 at the Q4_K_M quantization (file size 4.9 GB). Higher-quality quantizations need more.

Can I use RefinedNeuro RN TR R2 commercially?

Yes — RefinedNeuro RN TR R2 ships under the Apache-2.0, which permits commercial use. Always read the license text before deployment.

What's the context length of RefinedNeuro RN TR R2?

RefinedNeuro RN TR R2 supports a context window of 8,192 tokens (about 8K).

How do I install RefinedNeuro RN TR R2 with Ollama?

Run `ollama pull RefinedNeuro/RN_TR_R2:latest` to download, then `ollama run RefinedNeuro/RN_TR_R2:latest` to start a chat session. The default quantization is Q4_K_M.

Source: huggingface.co/RefinedNeuro/RN_TR_R2

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.

Related — keep moving

Before you buy

Verify RefinedNeuro RN TR R2 runs on your specific hardware before committing money.