Is Qdrant, Weaviate, or Milvus faster for vector search?

In a 5M-vector test (768 dimensions, single 16-vCPU/64GB VM), Qdrant was fastest for pure similarity search at 8ms p95 and 2400 QPS, versus Weaviate at 12ms/1800 QPS and Milvus at 14ms/1200 QPS. Weaviate won on filtered queries (10ms) and is the only one of the three supporting hybrid vector+keyword search.

Which vector database uses the least memory?

At 5M vectors (768 dimensions), Qdrant used 14GB of RAM, Weaviate used 18GB, and Milvus used 22GB, making Qdrant the most memory-efficient of the three.

When should I use SQLite FTS5 instead of a vector database?

Under roughly 10K documents, SQLite FTS5 often outperforms a vector DB because BM25 keyword matching handles most practical retrieval well, with sub-1ms latency, no server, and near-zero memory overhead. Switch to a vector DB above ~50K documents or when semantic similarity (not keyword matching) is required.

Is pgvector good enough, or do I need a dedicated vector database?

pgvector works well for teams that already run Postgres, handling up to roughly 1M vectors before performance degrades. For new projects or workloads above 1M vectors, a dedicated vector database performs better.

How much memory does a vector database need per million vectors?

At 768 dimensions, plan for roughly 3GB of memory per 1M vectors and ~30GB for 10M vectors. Most production workloads fit comfortably on a single 32GB VM, while deployments above 100M vectors should plan for sharding.

Vector DB 2026 Selection

Meta Description: Tested all three on 5M vectors. Latency, throughput, memory, setup pain. When to skip vector DB for SQLite FTS5.

The vector DB space settled in 2026. Qdrant, Weaviate, Milvus dominate. This article tests all three on a 5M-vector workload and tells you when to use which — plus when to skip vector DB entirely.

⚡ TL;DR #

Qdrant: simplest setup, fastest single-node. Best for solo/small team RAG.

Weaviate: best hybrid search (vector + keyword + filters). Best for production with complex queries.

Milvus: best horizontal scaling. Best for billion-scale workloads.

Skip vector DB if < 10K docs — SQLite FTS5 often wins.

Test Setup #

5M vectors, 768 dimensions (BGE-large embeddings)
Mix of similarity-only queries + filtered queries
Single VM: 16 vCPU, 64GB RAM, 1TB NVMe
100 concurrent clients

Results #

Latency (p95, ms) #

Workload	Qdrant	Weaviate	Milvus
Pure similarity (top 10)	8	12	14
Filtered similarity	15	10	22
Hybrid (vector + keyword)	N/A	16	N/A

Verdict: Qdrant fastest for pure similarity. Weaviate wins filters + hybrid.

Throughput (queries/sec at p95 < 50ms) #

	Qdrant	Weaviate	Milvus
QPS	2400	1800	1200

Verdict: Qdrant fastest single-node. Milvus catches up at multi-node scale.

Memory at 5M vectors #

	Qdrant	Weaviate	Milvus
RAM used	14GB	18GB	22GB

Verdict: Qdrant most memory-efficient.

Setup time #

	Qdrant	Weaviate	Milvus
Docker compose	5 min	10 min	20 min
Production tuning	1-2 hrs	2-4 hrs	4-8 hrs

Verdict: Qdrant easiest. Milvus most complex.

When to Skip Vector DB Entirely #

Under 10K documents, SQLite FTS5 often outperforms vector DB for the following reasons:

BM25 + keyword match handles most practical retrieval well
100x simpler ops (one file, no server)
< 1ms query latency
Zero memory overhead beyond the file

Try this first:

import sqlite3
conn = sqlite3.connect("docs.db")
conn.execute("CREATE VIRTUAL TABLE docs USING fts5(title, content)")
# Insert docs, query with MATCH operator

Above 50K documents or when semantic similarity (not keyword) matters, switch to vector DB.

Choosing Between the Three #

Single-node, simple RAG, small team → Qdrant
Need hybrid search (vector + keyword + filters) → Weaviate
Multi-node, billion+ vectors → Milvus
Already have Postgres → pgvector (up to ~1M vectors)
< 10K docs → SQLite FTS5

Recommended Infrastructure #

For vector DB hosting:

DigitalOcean — $200 credit, droplets with NVMe
HTStack — Hong Kong VPS for low-latency Asia queries

Affiliate links — same price, supports dibi8.com.

Conclusion #

All three vector DBs are production-ready in 2026. Pick by workload: Qdrant for simplicity, Weaviate for hybrid search, Milvus for billion-scale. Skip them entirely for small corpora — SQLite FTS5 wins on simplicity and is often sufficient.

The real lesson: most teams over-engineer their retrieval layer. Start with the simplest thing that works, upgrade when you measure a real ceiling. Vector DB justifies its complexity only above the simple-tool threshold.

Vector DB 2026 Selection

⚡ TL;DR #

Test Setup #

Results #

Latency (p95, ms) #

Throughput (queries/sec at p95 < 50ms) #

Memory at 5M vectors #

Setup time #

When to Skip Vector DB Entirely #

Choosing Between the Three #

Recommended Infrastructure #

Conclusion #

References & Sources #

📦 Featured in collections

💬 Discussion

⚡ TL;DR #

Test Setup #

Results #

Latency (p95, ms) #

Throughput (queries/sec at p95 < 50ms) #

Memory at 5M vectors #

Setup time #

When to Skip Vector DB Entirely #

Choosing Between the Three #

Recommended Infrastructure #

Conclusion #

References & Sources #

🔗 Related Resources

📦 Featured in collections

💬 Discussion