AI Engineering

October 5, 2024

Vector Databases Explained: Choosing the Right One for Your Use Case

What Vector Databases Actually Do

A vector database stores and searches high-dimensional vectors — numerical representations of content (text, images, audio, code) produced by embedding models.

When you run a piece of text through an embedding model, you get back a vector: an array of typically 768 to 3,072 floating-point numbers. These numbers capture the semantic meaning of the text in a form that machines can compare mathematically.

The key operation: given a query vector, find the N stored vectors most similar to it. This is approximate nearest neighbor (ANN) search. Vector databases are optimized for this operation at scale — supporting millions to billions of vectors with sub-second query latency.

This is the infrastructure layer under RAG (retrieval-augmented generation), semantic search, recommendation systems, and any application that needs to find "things similar to this thing."

When You Need a Dedicated Vector Database

The first question is whether you need a dedicated vector database at all.

pgvector is a Postgres extension that adds vector storage and ANN search. If you're already using Postgres and your vector collection is under 1-10 million vectors, pgvector is often the right answer. You get:

No additional infrastructure to operate
Joins between vector results and your relational data (simple, no cross-database queries)
All the operational maturity of Postgres (backups, monitoring, access control)
Good enough query performance for most applications

The performance ceiling of pgvector is real: at tens of millions of vectors, query latency and indexing overhead become significant. But reaching that scale is a good problem to have, and you can migrate to a dedicated vector database when you need to.

Use a dedicated vector database when:

You have more than 10 million vectors and need sub-10ms query latency at scale
Your vector collection is the primary data (not a feature of a relational dataset)
You need features pgvector doesn't support: multi-tenancy at the vector level, advanced filtering, built-in hybrid search
You're running on infrastructure where Postgres isn't available

The Main Options Compared

Pinecone

The original managed vector database. Fully serverless — you don't manage any infrastructure.

Strengths: Easiest to get started, fully managed, auto-scaling, reliable at scale (many production deployments).

Weaknesses: Expensive at scale compared to self-hosted alternatives. No self-hosting option. Less flexibility in indexing configuration.

Best for: Teams that want to move fast without managing infrastructure, and where cost is secondary to speed and simplicity. Strong choice for the first production deployment.

Weaviate

Full-featured, open-source vector database with a self-hosting option and a managed cloud offering.

Strengths: Built-in hybrid search (vector + BM25 keyword search in a single query), GraphQL API, multi-modal support (vectors for text and images in the same collection), good documentation, active community.

Weaknesses: More complex to configure and operate than Pinecone. Self-hosted requires more operational knowledge.

Best for: Applications that need hybrid search (a strong feature for RAG), teams comfortable with self-hosting who want to avoid managed DB costs at scale, multi-modal applications.

Qdrant

Written in Rust for performance. Available as managed cloud and self-hosted (including Docker).

Strengths: Extremely fast query performance. Good filtering capabilities (filter on payload fields alongside vector search). Reasonable pricing on the managed offering. Rust implementation means low memory footprint and stable performance under load.

Weaknesses: Smaller community than Pinecone or Weaviate. Less mature enterprise features.

Best for: Performance-critical applications. Teams comfortable with self-hosting. Applications where cost efficiency at scale matters. Strong technical teams who want a high-performance self-hosted option.

Chroma

Developer-focused, open-source vector database. Designed for ease of use and getting started quickly.

Strengths: Easiest setup and usage of any option. Excellent for development and prototyping. Simple Python API that many developers find intuitive.

Weaknesses: Production readiness is limited compared to the others — Chroma is best for development and small production workloads. Less scale-tested than Pinecone, Weaviate, or Qdrant.

Best for: Early prototyping. Development environments. Learning vector database concepts before choosing a production system.

The Decision Framework

Step 1: Start with pgvector

If you're building a new application and expect under 1 million vectors initially, use pgvector. It's available in Supabase, Neon, and most managed Postgres services. Defer the vector database decision until you actually need it.

Step 2: When you need a dedicated vector database

Want fully managed, maximum simplicity, cost is secondary → Pinecone
Need hybrid search, want open-source, comfortable with configuration → Weaviate
Need maximum performance, want self-hosted, technical team → Qdrant
Prototyping or learning → Chroma

Step 3: Evaluate with your actual data

Benchmarks are useful starting points but your data is what matters. Before committing to a vector database in production, test it with a representative subset of your actual vectors and queries. Measure:

Query latency at your expected vector count
Index build time for your vector dimensions
Cost at your expected monthly query volume
How filtering behaves with your metadata patterns

Hybrid Search: Often the Right Answer

For RAG applications specifically, pure vector search often underperforms hybrid search — a combination of semantic vector search and traditional keyword (BM25) search.

Vector search finds semantically similar content but can miss exact term matches. Keyword search finds exact matches but misses semantic similarity. Hybrid search uses both and combines the results (typically via RRF — reciprocal rank fusion).

For user-facing search over technical documents (product documentation, legal documents, code), hybrid search typically outperforms pure vector search significantly. Weaviate has the best native hybrid search support; Pinecone added hybrid search but it requires managing a separate keyword index.

If your use case involves any kind of search that users interact with directly, evaluate hybrid search as your baseline rather than pure vector search.

Meet the author

Tequity Team