Weights & Biases

Weights & Biases (W&B) is a cloud-based MLOps platform for tracking experiments, visualizing metrics, and managing model artifacts. Operators use it to log training runs (loss, accuracy, hyperparameters) and compare results across configurations. It integrates with Hugging Face Transformers, PyTorch, and other frameworks via a Python SDK. For local AI operators, W&B is relevant when fine-tuning models on consumer hardware—logging runs to a cloud dashboard helps track overfitting, learning rate schedules, and VRAM usage without manual note-taking. The free tier is limited but sufficient for individual projects.

An operator fine-tuning Llama 3.2 3B on an RTX 4090 can use wandb.init() in their training script to log training loss and GPU memory every 10 steps. The W&B dashboard shows a live plot of loss curves across runs, making it easy to spot when a learning rate is too high (loss spikes) or when VRAM is near capacity (memory line flattens). This replaces manual CSV logging and terminal monitoring.

In a typical fine-tuning workflow with Hugging Face Transformers, the operator adds import wandb and wandb.init(project='llama-finetune') before the training loop. During trainer.train(), metrics like train/loss and eval/accuracy are automatically logged to the W&B project page. After training, the operator can compare multiple runs in the dashboard to select the best checkpoint for quantization with llama.cpp.

Reviewed by Fredoline Eruo. See our editorial policy.

When it doesn't work

Practical example

Workflow example