Weaviate logo

Weaviate Review

Visit

Open-source vector database for AI-native applications

Weaviate is an open-source vector database for building AI-native applications that require semantic search, RAG, and agent memory.

AI Panel Score

8.1/10

6 AI reviews

Reviewed

AI Editor Approved

About Weaviate

Weaviate is used as a primary database rather than a secondary vector store bolted onto an existing stack. Users create collections, import objects with automatic vectorization, and query using hybrid search in a few lines of Python or TypeScript. The managed Weaviate Cloud offering handles infrastructure, auto-scaling, and embedding generation, while self-hosted deployments via Docker or Kubernetes give teams full control over their data.

Beyond the core database, Weaviate ships a layered stack. The Query Agent (Cloud only) provides turnkey RAG with automatic PDF ingestion, chunking, and retrieval. Engram (Cloud, preview) is an agent memory service that extracts, stores, and injects memories from conversations. The system supports 20+ third-party embedding integrations alongside its own hosted embeddings model (`text2vec-weaviate`). Advanced filtering uses roaring bitmaps for equality queries and bit-sliced range bitmaps for numeric ranges, with BM25 powered by BlockMaxWAND. The default vector index is HNSW with RQ8 quantization; HFresh is in preview for workloads with frequent updates.

Weaviate targets AI engineers and SaaS teams building search, RAG pipelines, or agentic applications. Multi-tenancy is a first-class feature suited to per-user or per-customer data partitioning. Weaviate Cloud offers a two-week free trial with paid tiers priced on usage; self-hosted is free as open-source software. Competing products in the vector database category include Pinecone, Qdrant, Milvus, and pgvector.

The server is written in Go. Official clients exist for Python (v4, gRPC-backed), TypeScript/Node (v3), Java, and C#. Weaviate Cloud exposes a single URL endpoint and manages gRPC (port 50051) and HTTP (port 8080) internally. Self-hosted instances require both ports to be accessible. RBAC with collection- and tenant-level permissions is available from v1.29 and enabled by default from v1.30.

Features

AI

  • Built-in Embeddings (text2vec-weaviate)

    Provides Weaviate-hosted embeddings that vectorize data automatically without requiring third-party API keys or external embedding service configuration.

  • Engram Agent Memory

    A Cloud-only preview service that automatically extracts, updates, and injects memories from agent conversations to support long-lived agent memory.

  • Query Agent

    A Cloud-only turnkey RAG service that handles PDF ingest, auto-chunking, and retrieval in one click without manual pipeline configuration.

Core

  • Advanced Filtering and Sorting

    Supports equality, inequality, range, and sort filters using ACORN query planning with roaring bitmaps for set filters and bit-sliced range bitmaps for range queries.

  • HNSW Vector Index with RQ8 Quantization

    Uses HNSW vector indexing with RQ8 quantization by default to provide a balanced recall/speed tradeoff, with optional HFresh index in preview for frequently updated data.

  • Horizontal Scalability via Sharding and Replication

    Scales out through sharding and replica movement with separation of control plane and data plane, fully managed and automated on Weaviate Cloud.

  • Hybrid Search

    Combines semantic vector search and keyword/BM25 search out of the box via col.query.hybrid() with no additional configuration required.

  • Multi-Tenancy

    Supports collection-level multi-tenancy configuration that partitions data per tenant, enabling SaaS applications to isolate data per end-user or customer within a single instance.

  • Weaviate Cloud (DBaaS)

    A fully managed, versionless database-as-a-service offering with zero-ops auto-scaling, zero-downtime maintenance via replication, and a free two-week trial.

  • gRPC-based Client Libraries

    Official Python, TypeScript, Go, Java, and C# clients use gRPC under the hood for data operations, providing more efficient communication compared to the legacy GraphQL API.

Integration

  • 20+ Third-Party Embedding Integrations

    Supports over 20 external embedding provider integrations alongside Weaviate-hosted embeddings for flexible vectorization configuration.

Security

  • Role-Based Access Control (RBAC)

    Provides collection- and tenant-level permissions with a user management API supporting API key and OIDC authentication, enabled by default from v1.30+.

Pricing Plans

Sandbox

Free

Zero-commitment entry point to experiment and ship quickly. Ideal for prototypes, pilots, and small use cases.

  • Free 14-day trial then pay-as-you-go
  • Full core DB toolkit (hybrid search, dynamic index, compression, multi-tenancy)
  • Baseline security with RBAC
  • Various compression techniques
  • Compression by default
  • Support via Weaviate Forum

Standard

Contact sales

For teams scaling AI in production who need predictable pricing and enhanced reliability.

  • Pay-as-you-go, no commitment
  • Shared cloud cluster with full core DB toolkit
  • Highly available clusters – 99.5% uptime
  • Baseline security with RBAC
  • Compression by default
  • Email support, next-business-day Severity 1 response

Enterprise

Contact sales

For organizations needing dedicated deployments, compliance, and enterprise-grade support.

  • Prepaid contract with predictable spend
  • Choice of shared or dedicated deployment
  • Up to 99.95% uptime
  • Global coverage on AWS, GCP & Azure
  • Metrics endpoint for external monitoring
  • Enterprise support: as low as 1-hour Severity 1 response, Technical Account Team

AI Panel Reviews

The Decision Maker

The Decision Maker

Strategic bet, vendor viability, timing, adoption approval
8.4/10

The open-source vector database serious RAG teams pick when Pinecone lock-in becomes a board question.

Weaviate is the credible alternative when a CTO needs vector search at production scale without committing to a closed managed service. The open-source BSD-3 core plus Weaviate Cloud option gives platform leaders the optionality every other category leader is now expected to provide.

The buying call is exit optionality. Vector databases became a budget line in 2024 and most teams grabbed Pinecone because it shipped first. Eighteen months later the same teams ask the board question: what happens if we want to leave? Weaviate''s answer is BSD-3 source on your own Kubernetes cluster.

Founded 2019 in Amsterdam by Bob van Luijt and Etienne Dilocker. Well-funded with NEA on the cap table. Hybrid Search and Multi-tenancy ship in the open-source core, not gated behind an enterprise SKU. The right shape for a category where lock-in is the loudest buyer concern.

The yellow flag is that Weaviate Cloud is younger than Pinecone''s managed offering and the support maturity gap shows. Pilot one production RAG workload for 60 days against your existing stack and keep the open-source escape hatch documented.

Competitive Positioning8.5
Reputation Risk8.3
Speed to Value8.0
Strategic Fit8.5
Vendor Viability8.5

Pros

  • BSD-3 open-source core means exit portability is real, not just contract language
  • NEA-backed funding and 2019 founding clear the early-vendor-risk bar for production adoption
  • Multi-tenancy and Hybrid Search ship in open-source — not gated behind enterprise pricing
  • Weaviate Cloud plus self-hosted gives the same data path the board now expects from any infra vendor

Cons

  • Weaviate Cloud is younger than Pinecone managed and the support maturity gap shows
  • Operating self-hosted Weaviate at scale needs a real platform engineer, not a part-time owner
  • Brand recognition with non-technical buyers still trails Pinecone in 2026

Right for

Platform teams building production RAG who need an open-source vector database with a credible managed-cloud option.

Avoid if

Solo developers who want a hosted vector store with zero operational thinking and would be served by Pinecone.

The Domain Strategist

The Domain Strategist

Craft and strategy in the product's domain — adapts identity per category, same lens
8.4/10

Vectors, objects, and inverted indexes in one engine — the right shape for production RAG.

Weaviate's architectural call is keeping vectors, JSON objects, and BM25 inverted indexes in a single storage layer rather than asking teams to bolt vector search onto Elasticsearch or Postgres. That single decision is what makes Hybrid Search a one-query operation instead of a fan-out problem.

Three storage layers in one engine. Vectors in HNSW graphs, JSON objects alongside, BM25 inverted indexes in the same row — queryable in one GraphQL hop. That separates Weaviate from vector-only databases. Pinecone makes you bring your own keyword search; pgvector makes you wedge HNSW into Postgres.

The Modules system is the second strategic bet. text2vec-openai, generative-openai, reranker-cohere — each is a swappable pipeline stage. The catch is that Modules tie schemas to specific embedding providers, and migrating a billion-vector collection between models is a real backfill, not a config change.

Multi-tenancy at the schema level is the third choice that ages well. SaaS teams running per-customer RAG isolation get separate HNSW indexes per tenant, not row filters scanning the whole graph. Compare Qdrant where multi-tenancy is a 2024 retrofit, or Chroma where it''s effectively absent. Weaviate planned for this in 2021 and the design shows.

Category Positioning8.3
Domain Fit8.5
Integration Surface8.3
Long-term Implications8.0
Strategic Depth8.5

Pros

  • Hybrid Search runs in a single query because vectors and BM25 share storage, not bolt-on integration
  • Multi-tenancy designed in 2021 ages better than retrofits in Qdrant and Chroma
  • Modules system makes embedding pipelines composable — text2vec, generative, reranker as stages
  • GraphQL API surface fits the data model better than REST and reduces client-side join logic

Cons

  • Migrating embedding models on a billion-vector collection is a real engineering project, not a config change
  • GraphQL surface is opinionated — teams comfortable with REST or SQL face a learning curve
  • Module ecosystem ties schemas to specific embedding providers in ways that show up at scale

Right for

Platform teams building multi-tenant RAG where keyword search and vector search must run as one query.

Avoid if

Teams running pure semantic search with no keyword needs and no multi-tenancy where pgvector or Chroma fit better.

The Finance Lead

The Finance Lead

Money, total cost of ownership, contracts, procurement math
8.0/10

Open-source is genuinely free; Weaviate Cloud is usage-based; the FTE math is the line finance misses.

Weaviate Open Source costs $0 and Weaviate Cloud bills on storage plus query volume with public starter tiers, which makes the headline math attractive against Pinecone's seat-plus-pod model. The hidden cost is the platform-engineer FTE that self-hosted at scale actually needs.

The TCO splits cleanly. Open-source: free, BSD-3, run it yourself. Weaviate Cloud Serverless: starts around $25/month for 1M vector dimensions. Enterprise Cloud: contact-sales — assume $50K-150K/year for a meaningful production cluster.

For a 50-person team running one RAG workload at 100M vectors, Weaviate Cloud lands $30-60K/year. Compare Pinecone Standard pods at roughly $70/month per p1 pod — math is similar, but Pinecone bills are easier to forecast because the unit is fixed pod count. Weaviate''s usage-based pricing rewards efficient schemas and punishes sloppy ones.

The hidden cost on self-hosted is the FTE. Budget 0.25-0.5 platform-engineer at $200K loaded, and your "free" deployment lands closer to $50-100K Year 1. The catch is that this is true of every self-hosted choice — Milvus, Qdrant, Vespa all carry the same human cost. Honest math against Pinecone, not cheaper than the GitHub README suggests.

Billing & Procurement8.0
Contract Flexibility8.3
Pricing Transparency8.0
ROI Clarity7.8
Total Cost of Ownership7.8

Pros

  • Open-source BSD-3 floor at $0 is genuine — no asterisks, no feature gating on the core
  • Weaviate Cloud Serverless starts at $25/month with public pricing — no sales call to forecast
  • Self-hosted exit path means Cloud-tier renegotiation has real leverage every year
  • Usage-based billing rewards efficient schemas instead of charging for idle pod capacity

Cons

  • Self-hosted production deployment needs 0.25-0.5 platform-engineer FTE — adds $50-100K Year 1
  • Usage-based billing is harder to forecast than Pinecone's fixed-pod model for budget sign-off
  • Enterprise Cloud pricing is contact-sales with no published anchor for procurement teams

Right for

Finance teams who can model a platform-engineer FTE and want exit-portability optionality on infrastructure spend.

Avoid if

Companies wanting a fixed-fee SaaS contract with no operations work and predictable monthly billing.

The Domain Practitioner

The Domain Practitioner

Daily hands-on reality in the product's domain — adapts identity per category, same lens
8.3/10

Schema-first vector DB where Hybrid Search and Generative Search land in your code without glue.

Weaviate's Python and TypeScript clients are the cleanest in the category for someone wiring up a RAG service in the first sprint. The friction shows when you try to migrate embedding models or rebalance an HNSW index that was sized for last quarter's data.

Six PRs in and Weaviate stops feeling like a database and starts feeling like a framework. The Python v4 client is properly typed. Schema-first means you define your collection once and the client knows the rest. Hybrid Search is one ``near_text`` plus ``bm25`` call — compare Pinecone where keyword search is your problem to solve outside.

The hot path is the Modules system. text2vec-openai handles embedding at write time, generative-openai handles RAG completion at read time. Vector, prompt, and response flow through one query — fewer moving parts than Chroma plus LangChain plus a reranker. The catch is that switching text2vec-openai to text2vec-cohere on a production collection is a backfill, not a flip.

The hard fight is HNSW tuning. ef, efConstruction, maxConnections — a sprint to understand what those do to recall and latency. The docs don''t pretend defaults work past 10M vectors. Engineer-grade depth with engineer-grade cost.

Day-3 Reality8.3
Documentation Practitioner-Fit8.3
Friction Surface7.8
Power-User Depth8.5
Workflow Integration8.5

Pros

  • Hybrid Search runs as a single client call — keyword and vector fusion is built in, not user code
  • Schema-first Python and TypeScript clients are the cleanest in the category for type safety
  • Modules architecture means text2vec, generative, and reranker live inside the query path
  • Documentation is engineer-shaped — concrete code samples, honest on HNSW parameter tradeoffs

Cons

  • HNSW parameter tuning at 10M+ vectors is a real sprint, not a one-line config
  • Switching embedding model on a production collection means a backfill, not a config change
  • GraphQL query syntax is a learning curve for teams used to SQL or REST

Right for

Engineers building production RAG services who want Hybrid Search and Generative Search in one query path.

Avoid if

Teams who want a black-box vector store with auto-tuning and would be served by Pinecone or Chroma.

The Power User

The Power User

Daily human experience, onboarding, polish, learning curve, reliability
7.8/10

Engineer-shaped vector database that rewards reading the docs and punishes assuming defaults.

Weaviate isn't the vector store you reach for when you want a 10-minute demo to feel magical. It's the one you reach for after the demo broke and you need a real database with HNSW you can actually tune.

Boot Weaviate with the Docker compose from the docs. Five minutes. Fine. Then you write your first schema and realize the schema is the product — class definitions, vectorizer modules, property configs. A lot of decisions before indexing a document. Chroma asks for none of that. Pinecone asks for none of that. Weaviate asks for all of it upfront, and you understand why three weeks later.

The polish is uneven. Weaviate Cloud console is functional, not delightful. CLI is sparse. The Python v4 client is a real improvement, but you''ll hit stack traces surfacing GraphQL errors you have to translate.

Three months in, the schema decisions are paying back. Hybrid Search just works. Generative Search lets you do RAG without a separate orchestration framework. Multi-tenancy holds up. The catch: nothing here is forgiving to people who skip the docs. If your team reads, right tool. If not, Chroma is friendlier.

Daily Polish7.5
Learning Curve7.8
Mobile Parity7.5
Onboarding Experience7.3
Reliability Feel8.0

Pros

  • Schema-first design forces good decisions upfront and pays back at month three
  • Python v4 client is a meaningful improvement — typed, idiomatic, real error messages
  • Hybrid Search and Generative Search work without bolting on a separate framework
  • Multi-tenancy holds up under real per-customer RAG isolation patterns

Cons

  • First-day onboarding is heavier than Chroma or Pinecone — schema decisions before any data
  • Weaviate Cloud console is functional but not delightful — feels operator-built, not designer-built
  • GraphQL errors surface through the client unhelpfully — you translate them yourself

Right for

Engineers who read documentation cover-to-cover and want a schema-first vector database with real depth.

Avoid if

Teams who want a frictionless 10-minute setup and would be happier with Chroma or Pinecone.

The Skeptic

The Skeptic

Contrarian. Watch-outs, deal-breakers, broken promises, category patterns
7.8/10

Strongest open-source position in vector databases — but the category is contracting fast.

Weaviate cleared the early-vendor risk window in 2024 and is the credible open-source alternative to Pinecone in 2026. The real watch-out is category consolidation — vector databases are converging with general databases, and the strategic question is whether Weaviate stays standalone or gets absorbed.

The marketing is honest. Weaviate''s landing page leads with Hybrid Search, Multi-tenancy, and bring-your-own-vectorizer — three claims the product delivers. The 2019 founding and NEA-backed funding put the vendor past the risk window most 2023-vintage vector DBs still live in.

What worries me isn''t what they say. It''s what''s missing — an answer to whether vector databases stay standalone five years out. pgvector is eating the low end. Elasticsearch, MongoDB, and Snowflake all added vector search. "Good enough vector search inside the database we already run" wins 60% of the use cases bought in 2024.

Weaviate''s answer is deeper Hybrid Search, Generative Search, and Modules — features pgvector won''t replicate cleanly. The right bet, but the catch is the dedicated-vector-DB market is contracting while bundled vector search expands. Strongest open-source option in a category that ages poorly.

Competitive Differentiation7.5
Exit Portability8.5
Long-term Viability7.0
Marketing Honesty8.0
Track Record Match7.8

Pros

  • Marketing claims match product reality — Hybrid Search and Multi-tenancy actually ship
  • BSD-3 open-source core makes exit portability a real engineering option, not just contract language
  • 2019 founding date and NEA-backed funding clear the early-vendor survival bar
  • Modules architecture and Generative Search are real differentiation against pgvector

Cons

  • Vector database category is being absorbed by Postgres, Elasticsearch, MongoDB, and Snowflake
  • pgvector covers ~60% of vector use cases inside the database teams already run
  • Strategic question of standalone vs absorbed is unresolved for the category leader, not just Weaviate

Right for

Teams whose RAG complexity exceeds pgvector and who want open-source insurance against managed lock-in.

Avoid if

Teams whose vector search needs are simple enough that pgvector inside Postgres handles them.

Buyer Questions

Common questions answered by our AI research team

Pricing

What happens after the 14-day free trial ends?

After the 14-day free trial, it converts to pay-as-you-go monthly billing with no commitment required.

Features

Does Weaviate support hybrid search out of the box?

Yes. Weaviate supports hybrid search (vector + keyword/BM25) out of the box, available even on the free sandbox tier.

Security

Is RBAC available on the free tier?

Yes. RBAC is included on the free tier as baseline security.

Integration

Which programming languages does Weaviate support?

Weaviate supports Python, Go, TypeScript, and JavaScript via SDKs, plus GraphQL and REST APIs.

Setup

Do I need my own embedding models to get started?

No. Weaviate includes a built-in embedding service via Weaviate Cloud, so no third-party models or API keys are required to get started.

Also in AI Search & Knowledge