other
0.175B parameters
Commercial OK
Reviewed May 2026

Dostoevsky Doesn't Write It GPT2

A 175M-parameter GPT-2 model fine-tuned on Dostoevsky's digitized works, built on top of ruGPT3-small. Trained for five epochs, it generates Russian prose in a 19th-century literary register. Context tops out at 1024 tokens.

License: mit·Context: 1,024 tokens
BLK · VERDICT

Our verdict

OP · Fredoline Eruo|VERIFIED MAY 29, 2026
9.1/10

This is a novelty model, not a workhorse. If you need to generate pastiche Russian literary prose for a game, art project, or demo, it fits the brief at almost zero hardware cost. Don't expect it to generalize — it is deliberately narrow. The near-zero community traction means you are largely on your own if outputs go sideways. Hedge: worth a quick test if the use case matches exactly, skip otherwise.

Why this rating

Auto-generated rating (Opus 4.7 judge, claude-opus-4-7). Overall 9.05/10. The row is honest, well-scoped, and accurately reflects the HF card — license is explicitly MIT, the Dostoevsky/ruGPT3-small lineage is correct, and the verdict appropriately flags it as a novelty. Editorial voice is operator-grade with concrete hedges. However, brand fit is the weak point: runlocalai's audience is local-AI builders and small ops teams, and a 175M-param Russian literary pastiche GPT-2 with 955 downloads and 2 likes is a fringe novelty with no real operator use case. Parameter count is also slightly imprecise (ruGPT3-small is ~125M, not 175M — though GPT-2 small is 124M and the 600MB file size is ambiguous; the claim is unverified by the card). That parameter discrepancy plus marginal brand fit push this just under the 9.0 bar.

Flags: - parameterCountB of 0.175 is not stated on the HF card; ruGPT3-small is typically ~125M parameters — verify before publishing - Very low community signal (955 downloads, 2 likes) combined with narrow novelty scope makes brand fit marginal for an operator-focused catalog - contextLength of 1024 is inferred from GPT-2 architecture, not stated on card (acceptable but worth noting)

Overview

A 175M-parameter GPT-2 model fine-tuned on Dostoevsky's digitized works, built on top of ruGPT3-small. Trained for five epochs, it generates Russian prose in a 19th-century literary register. Context tops out at 1024 tokens.

Strengths

  • Produces Russian text with a recognizable Dostoevsky-adjacent literary tone
  • MIT license — commercial use is fine
  • Tiny footprint (175M params) runs on minimal hardware
  • Based on ruGPT3-small, a known stable Russian base

Weaknesses

  • 955 downloads and 2 likes — very little community validation
  • 1024-token context cuts off anything longer than a short passage
  • Scope is narrow: expect stylistic drift on anything outside 19th-century Russian prose
  • 175M parameters means shallow reasoning and repetition under pressure

Quantization variants

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

QuantizationFile sizeVRAM required
Q4_K_M0.1 GB1 GB

Get the model

HuggingFace

Original weights

huggingface.co/evilfreelancer/dostoevsky_doesnt_write_it_gpt2

Source repository — direct quantization required.

Hardware that runs this

Cards with enough VRAM for at least one quantization of Dostoevsky Doesn't Write It GPT2.

Compare alternatives

Models worth comparing

Same parameter band, plus what's one tier above and below — so you can decide what actually fits your hardware.

Step up
More capable — bigger memory footprint
Step down
Smaller — faster, runs on weaker hardware
No verdicted models in the next tier down yet.

Frequently asked

What's the minimum VRAM to run Dostoevsky Doesn't Write It GPT2?

1GB of VRAM is enough to run Dostoevsky Doesn't Write It GPT2 at the Q4_K_M quantization (file size 0.1 GB). Higher-quality quantizations need more.

Can I use Dostoevsky Doesn't Write It GPT2 commercially?

Yes — Dostoevsky Doesn't Write It GPT2 ships under the mit, which permits commercial use. Always read the license text before deployment.

What's the context length of Dostoevsky Doesn't Write It GPT2?

Dostoevsky Doesn't Write It GPT2 supports a context window of 1,024 tokens (about 1K).

Source: huggingface.co/evilfreelancer/dostoevsky_doesnt_write_it_gpt2

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.

Related — keep moving

Before you buy

Verify Dostoevsky Doesn't Write It GPT2 runs on your specific hardware before committing money.