JaguarDB
The Most Scalable Vector Database                
Home Technology Product Document Download



Vector database sharding

Multimodal search

JaguarDB quantization

JaguarDB Vector API

Best Vector databases



JaguarDB in Docker

Setup JaguarDB with tar package

Setup JaguarDB on multiple nodes

Vector index sharing

How zeromove works

Video introduction



API Reference

JaguarDB is not just a distributed vector database; it is a comprehensive solution that goes beyond vector data management. While it excels at handling vector data, it also seamlessly processes non-vector data within a fully integrated framework.

JaguarDB stores high-dimensional vectors with the state-of-the-art HNSW graph index store. HNSW, short for Hierarchical Navigable Small World, is a data structure and algorithm used for approximate nearest neighbor search in high-dimensional spaces. It is designed to efficiently find data points that are close to a given query point in a high-dimensional space, without exhaustively searching through all data points.


HNSW Graphs

HNSW creates a hierarchical structure of data points that forms a graph. Each level of the hierarchy is a different graph that represents the data points at different levels of detail. HNSW maintains a "small world" property, which means that even though the graph is not fully connected like a traditional graph, it is still possible to navigate from one node to another through a relatively small number of edges. HNSW constructs the hierarchical graph in a way that ensures data points are connected to nearby points, enabling efficient traversal of the graph to find approximate nearest neighbors.


Vector Management

create store TABLE (
    key: …KEY…,
    value: VECCOL vector( dimension, 'DIST_INPUT_QUANTIZATION'),
        …other_fields…
)


The symbol "VECCOL" designates the name of the vector column, while "dimension" denotes the count of components within a vector. Standard dimensions often include values like 768, 1024, 1536, etc. The string "DIST_INPUT_QUANTIZATION" is a vector definition that serves to specify the nature of the distance, input data type, and level of quantization employed in the vector storage and search of similarity between vectors.


Distance Metrics

Distance metric measures the distance between a vector and another. JaguarDB supports a variaty of distance types including: Euclidean, Cosine, Hamming, Manhatten, Jeccard, InnerProduct, Minkoski, and Chebyshev.


Input Format

The input type in JaguarDB refers to the expected data format in the input vectors. There are two input types: fraction and whole. JaguarDB excels not only in managing vector embeddings but also in handling a diverse range of feature vectors. These vectors can include various types and forms, whether they are normalized or unnormalized, presented in fractional or full original formats. This versatility underscores the capability of JauarDB to accommodate a wide array of data formats.


Quantization

There are three quantization levels in JaguarDB: byte, short, and float. The process of quantizing input vectors yields efficient memory utilization within the system. While storing a float number demands 4 bytes, employing fewer bytes for storing vector components can yield substantial memory savings. When components are stored as signed integers, memory savings can reach 50%, while utilizing only a single byte for vector components can result in an impressive 75% reduction in memory usage. This approach is termed "short quantization level" for the utilization of signed integers and "byte quantization level" for the use of a single byte. The quantization of input vectors aligns with the level specified by the user during vector creation, optimizing memory consumption while maintaining data integrity.


Add Vectors

JaguarDB can integrate all application and vector data, facilitating streamlined data management for real-world scenarios. It enables the incorporation of vector data alongside other pertinent information related to business objects, allowing for comprehensive and cohesive data representation.

insert into STORE ( …, VECCOL, …) values (…, 'VECTOR_STRING', … ))



For a comprehensive description of the Jaguar API, please visit the following link:

Jaguar User Manual








JaguarDB

JaguarDB offers comprehensive support for vector database in artificial intelligence, along with instantly scalable datalake storage for raw media files and robust similarity search capabilities. This facilitates efficient handling of large datasets and enhances AI applications that require rapid data retrieval and similarity comparisons. JaguarDB, with integrated features, provides a seamless solution for managing and analyzing complex data in AI-driven environments.



Products

AI VectorDB
AI Datalake
Time Series
Geospatial
JaguarDB
Client Drivers

Resources

Cloud Admin Manual
Developer Guide
Configuration Help
Frequent Questions
ZeroMove Demo
Video Introduction

Social

  LinkedIn
 
Youtube
 
Contact Us