📚 Database Indexes

Indexes improve query performance by allowing databases to locate rows faster without scanning every record.

🧩 What is a Database Index?

An index is a data structure that helps locate rows quickly based on column values, similar to a book index.

Without an index → Full table scan (O(n)) With an index → Tree or hash lookup (O(log n) or O(1))

🧠 Common Index Data Structures

Type	Used By	Description	Time Complexity
B-Tree / B+ Tree	MySQL, Postgres	Balanced tree keeping keys sorted	O(log n)
Hash Index	Postgres, Redis	Hash table for exact matches	O(1)
Bitmap Index	ClickHouse, Oracle	Bit arrays for low-cardinality data	Bit ops
Inverted Index	ElasticSearch	Term → document mapping	O(log n)
GeoSpatial index	Spatial databases	Multi-dimensional data (2D/3D)	-
GiST / GIN / SP-GiST	PostgreSQL	Specialized trees for complex data	varies
Clustered Index	MySQL (InnoDB)	Data physically ordered by key	O(log n)
Non-Clustered Index	Most RDBMS	Separate index pointing to data	O(log n)

🔍 Index Types and Use Cases

1. B+ Tree Index

Use case: Range queries (>, <, BETWEEN), sorting, prefix match.
Tech: Balanced tree, keys in internal nodes, data in leaf nodes.
Used by: MySQL, Postgres (default).

alt text

2. Hash Index

Use case: Exact match (=) lookups.
Tech: Hash table mapping key → row pointer.
Limitation: Not suitable for range queries.

alt text

3. Bitmap Index

Use case: Low-cardinality columns like gender, status.
Tech: Bit vectors for each unique value; uses bitwise ops for filtering.
Used by: Analytics DBs like Oracle, ClickHouse.

4. Inverted Index

Use case: Full-text search, tokenized matching.
Tech: Maps terms → list of document IDs containing them.
Used by: Elasticsearch, Solr.

alt text

5. Geospatial Index

Use case: Spatial/geolocation queries.
Tech: Bounding rectangles to group nearby coordinates.
Types:
- GeoHashing
- Quad Tree
- R Tree

alt text

6. GiST / GIN / SP-GiST (PostgreSQL)

Type	Use Case	Description
GiST	Ranges, geometric, fuzzy matching	Extensible balanced tree
GIN	Full-text search, JSONB keys	Multi-value indexing
SP-GiST	IP ranges, quadtrees	Space-partitioned indexing

7. Clustered vs Non-Clustered Index

Type	Description	Example
Clustered	Data physically ordered by index key (only one allowed).	Primary key in InnoDB
Non-Clustered	Separate structure pointing to data (many allowed).	Secondary indexes

📊 Summary Table

Index Type	Use Case	Pros	Cons
B+ Tree	Range queries	Balanced, ordered	Slower for exact match
Hash	Exact match	Fast lookup	No range, rehash overhead
Bitmap	Low-cardinality	Fast filters	Bad for frequent writes
Inverted	Full-text	Fast search	Large space
GiST/GIN/SP-GiST	Specialized	Versatile	Higher write cost
Clustered	Primary key	Fast range reads	Table reorder costly
Non-Clustered	Secondary	Multiple allowed	Extra I/O hop

💡 Key Takeaways

Indexes speed up reads but slow down writes.
Choose index type based on query pattern:
- Equality → Hash
- Range → B+ Tree
- Analytics → Bitmap
- Full-text → Inverted
- Spatial → GeoHashing / Quad Tree / R Tree

🧩 What is a Database Index?​

🧠 Common Index Data Structures​

🔍 Index Types and Use Cases​

1. B+ Tree Index​

2. Hash Index​

3. Bitmap Index​

4. Inverted Index​

5. Geospatial Index​

6. GiST / GIN / SP-GiST (PostgreSQL)​

7. Clustered vs Non-Clustered Index​

📊 Summary Table​

💡 Key Takeaways​

Quick decision making​

🧩 What is a Database Index?

🧠 Common Index Data Structures

🔍 Index Types and Use Cases

1. B+ Tree Index

2. Hash Index

3. Bitmap Index

4. Inverted Index

5. Geospatial Index

6. GiST / GIN / SP-GiST (PostgreSQL)

7. Clustered vs Non-Clustered Index

📊 Summary Table

💡 Key Takeaways

Quick decision making