Index types beyond B-tree

A B-tree is the right index for =, <, >, BETWEEN, and ORDER BY on a scalar column — which covers most queries. But B-trees index one comparable value per row, and some columns don't fit that shape: a jsonb blob, an array of tags, a full-text document, a huge append-only log. Postgres ships several other access methods, plus a few tricks that reshape any index. This lesson surveys them, each with a query it wins.

The seed is a docs table of five million rows: a tags array, a jsonb meta column, a lopsided status, mixed-case email, an owner_id, and a created_at inserted in time order. Big enough that each index's win shows up in the plan and in the timing. Take a look:

sql

SELECT id, owner_id, status, email, tags, meta, created_at FROM docs ORDER BY id LIMIT 5;

GIN: many values in one row

A B-tree can't answer "which docs have the tag gin" — the whole array is one opaque value to it. GIN (Generalized Inverted Index) solves the "many values per row" problem the way a book index does: it stores each element (each tag, each jsonb key, each lexeme) once and points back to the rows containing it.

Index the tags array and ask for containment:

sql

CREATE INDEX docs_tags_gin ON docs USING gin (tags);

sql

EXPLAIN ANALYZE SELECT id FROM docs WHERE tags @> ARRAY['gin'];

GIN: many values in one row

GiST: ranges, geometry, and nearest-neighbor

BRIN: tiny indexes for huge, ordered tables

Hash: equality only

Partial indexes: index only the rows you query

Expression indexes: index a computed value

Covering indexes: skip the heap with INCLUDE

Which index should I use?

What you learned