What is Vector Database? — Technical Definition

What is a Vector Database?

A vector database stores content such as text, images, or product records as numeric embeddings and answers the question “is this similar in meaning?” instead of only “does this exact keyword match?” That makes it useful for finding related documents, equivalent intent, and near matches that classic keyword search can miss.

How It Works

Content is first converted into a high-dimensional vector by an embedding model. When a user searches, the query is embedded with the same model, and the database retrieves the nearest vectors. Similarity is commonly calculated with cosine similarity, dot product, or Euclidean distance.

Pinecone, Weaviate, Qdrant, Milvus, and pgvector on PostgreSQL are common options. Embedding quality depends on the model, while LLM-based RAG systems pass the retrieved document chunks into the answer as context. Teams already using PostgreSQL can often add pgvector before adopting a separate service.

Business Use

Vector databases are used for document search, support bots, product recommendations, candidate-CV matching, and internal knowledge base search. For example, a company assistant can match “late delivery penalty” to contract clauses that use different wording but describe the same issue.

The main design risks are stale indexes, weak access control, embedding cost, and poor retrieval quality. Sensitive documents need permission checks at search time, not only when the data is originally indexed.

What is a Vector Database?

How It Works

Business Use

Related Terms