Database Backends
Amgix supports multiple database backends, allowing you to choose based on your performance, cost, and infrastructure requirements. You can use your existing database (PostgreSQL, MariaDB) or a specialized vector database (Qdrant) for maximum performance.
Qdrant: The Specialized Vector Database
Qdrant is a high-performance vector similarity search engine that serves as the default backend for Amgix.
- Optimized for Vectors: Purpose-built for efficient vector storage and similarity search.
- High Performance: Best choice for applications requiring high query throughput and low latency.
SQL Databases: Hybrid Search on Your Existing Infrastructure
Amgix supports hybrid search on traditional SQL databases, offering several advantages:
- Cost Efficiency: Use databases you already operate and manage.
- Operational Simplicity: Consolidate data storage within your existing relational database infrastructure.
- Data Compliance & Governance: Keep your data within your established compliance boundaries and security policies.
- Sparse Vector Support: Amgix implements sparse vector search directly within SQL, complementing dense vectors where supported.
MariaDB
MariaDB is an open-source relational database that Amgix supports for hybrid search.
Notes: MariaDB 11.7.1+ supports dense vector columns, limited to one dense vector per collection. Sparse vectors (model-based and lexical) are fully supported.
PostgreSQL
PostgreSQL is an open-source object-relational database that Amgix supports for hybrid search.
Notes: Requires pgvector extension for dense vector support. Sparse vectors (model-based and lexical) are fully supported.
Choosing Your Backend
The best backend for your Amgix deployment depends on your specific requirements:
-
Choose Qdrant if:
- You need maximum performance for vector search.
- You want a dedicated vector database.
- You're comfortable managing a separate database instance.
-
Choose a SQL Database (MariaDB, PostgreSQL) if:
- You want to use your existing database infrastructure.
- Cost efficiency and operational simplicity are priorities.
- You prefer to keep data within a relational database.
- Your primary data is already in SQL.
- Note on Scale: SQL backends are recommended for modest data volumes. Dedicated vector databases like Qdrant typically offer better scalability for large datasets.
Amgix provides a consistent API across all backends.