Llama 3.1 8B Instruct on NVIDIA GeForce RTX 4090 — local inference guide