Classical ML algorithms

UMAP

UMAP (Uniform Manifold Approximation and Projection) is a dimensionality reduction technique used to visualize high-dimensional data, such as embeddings from language models, in 2D or 3D. It preserves local structure better than t-SNE and is faster, making it practical for exploring clusters of model activations or text embeddings. Operators encounter UMAP when analyzing how a model internally represents concepts or when inspecting embedding spaces for retrieval-augmented generation (RAG) pipelines.

Deeper dive

UMAP constructs a high-dimensional graph representation of the data and then optimizes a low-dimensional layout to be as similar as possible. It balances local and global structure, often producing more meaningful visualizations than t-SNE. Key parameters include n_neighbors (controls local vs. global focus) and min_dist (controls how tightly points cluster). For operator use, UMAP is commonly applied to sentence embeddings from models like all-MiniLM-L6-v2 to visualize topic clusters or outlier queries. It runs on CPU or GPU; GPU acceleration via cuML can reduce runtime from minutes to seconds for large datasets.

Practical example

An operator running a RAG pipeline with 10,000 document chunks might use UMAP to visualize their embedding space. Using the all-MiniLM-L6-v2 model, embeddings are 384-dimensional. Applying UMAP with n_neighbors=15 and min_dist=0.1 reduces them to 2D. The resulting plot reveals clusters of related topics, helping the operator identify if certain queries fall into sparse regions where retrieval might fail.

Workflow example

In a Python notebook, the operator loads embeddings from a vector database, then runs import umap; reducer = umap.UMAP(n_neighbors=15, min_dist=0.1, n_components=2); embedding_2d = reducer.fit_transform(embeddings). They then plot with matplotlib or plotly. If using GPU, they might replace with from cuml import UMAP for faster execution. The visualization helps debug retrieval quality by showing whether query embeddings land near relevant document clusters.

Reviewed by Fredoline Eruo. See our editorial policy.

Buyer guides

When it doesn't work