Releases · ZeusDB/zeusdb-vector-database

20 Aug 00:46

doubleinfinity

v0.4.1

0bd5d6d

Version 0.4.1 Latest

Latest

Added

Comprehensive test suite for overwrite behavior verification (test_overwrite_fix.py)
Product Quantization (PQ) overwrite testing across all storage modes (test_pq_overwrite_comprehensive.py)
Enhanced logging and storage analysis for overwrite operations
Training state cleanup during document removal operations

Changed

Overwrite operations now use two-phase process (remove then add) to prevent duplicates
remove_point() method now delegates to internal remove_point_internal() for better code reuse
Enhanced add() method with comprehensive PQ support and storage mode awareness
Improved error handling and logging throughout overwrite operations

Fixed

Critical: Fixed duplicate document bug where overwrite=True created multiple entries instead of replacing existing ones
Memory leak from accumulated duplicate vectors in HNSW graph during overwrites
Product Quantization codes and training state not properly cleaned up during document removal
Vector count inconsistencies when removing documents during overwrite operations

Removed

Legacy overwrite behavior that created duplicates instead of proper replacements

Assets 2

13 Aug 15:02

doubleinfinity

v0.4.0

a1c4608

Version 0.4.0

Added

Enterprise-grade structured logging with Python+Rust coordination
Smart environment detection (production/development/testing/jupyter/CI)
Automatic logging configuration with graceful fallbacks
JSON and human-readable log formats with configurable targets (console/file)
File logging with daily rotation and intelligent path handling
Performance timing instrumentation on all hot paths (add, search, training)
Comprehensive error context logging with field standardization
Cross-platform logging support (Windows, macOS, Linux)
Environment variable configuration for all logging aspects
Production-ready observability for operations teams

Changed

Replaced debug println! statements with structured tracing throughout codebase
Enhanced error handling with rich logging context instead of panic conditions
Improved vector addition pipeline with detailed operation tracking
Updated quantization training process with progress logging and timing metrics
Modernized persistence operations with comprehensive save/load logging

Fixed

Eliminated potential panic conditions in distance space validation
Improved error propagation with proper logging context
Enhanced thread safety in concurrent logging scenarios
Resolved cross-platform path handling inconsistencies

Assets 2

05 Aug 14:44

doubleinfinity

v0.3.0

1c8a717

Version 0.3.0

Added

save() method to HNSWIndex for persisting index state to disk via Python and Rust.
New persistence.rs module implementing index save/load logic, including manifest and file structure generation.
PyO3 bindings for persistence-related methods, exposing them to Python.
Internal unit tests for the save function to ensure correct file output and manifest validation.
HNSW graph structure persistence via native hnsw-rs file_dump() integration
Enhanced save workflow with Phase 2 graph serialization support
Comprehensive Phase 2 integration test suite for full persistence validation
Complete component loading infrastructure with helper functions for all ZeusDB file types
load() method to VectorDatabase class for loading saved indexes from disk
Comprehensive component validation and data consistency checking in load workflow
Python API integration for load_index function with proper PyO3 bindings
End-to-end test suite for component loading validation and error handling
Complete HNSW graph loading functionality using NoData pattern from hnsw-rs
anndists dependency for NoDist distance type compatibility
Phase 2 graph structure loading with validation and error handling
Full persistence roundtrip capability: save and load HNSW graph structures
Empty index handling with conditional graph file creation for zero-vector scenarios
Training state preservation with ID collection tracking during persistence
Storage mode awareness in persistence (quantized_only vs quantized_with_raw handling)
PQ centroids and codes serialization for complete quantization state preservation
Compression statistics and memory usage reporting in manifest files
Directory size calculation and file inventory tracking in manifest generation
rebuilding_from_persistence flag to prevent training ID contamination during reconstruction
Smart reconstruction approach using existing add() logic instead of complex graph deserialization
Thread-safe data access patterns during save operations with proper lock management

Changed

Refactored hnsw_index.rs to integrate persistence logic and support serialization.
Updated lib.rs to register the persistence module and ensure all new methods are exposed to Python.
Enhanced error handling and docstrings for persistence operations.
Modified HNSW initialization to use fixed max_layer=16 for hnsw-rs dump compatibility
Updated manifest generation to include HNSW graph files (.hnsw.graph) and exclude data files (.hnsw.data)
Enhanced save_manifest() with graph file tracking and size calculation
Replaced placeholder load_index() with complete component loading implementation
Enhanced lib.rs module exports to include load_index function for Python access
Updated persistence.rs with comprehensive file loading and validation infrastructure
Extended persistence.rs with complete HNSW graph loading using HnswIo and ReloadOptions
Updated test suite to recognize and validate HNSW graph loading success
Enhanced quantization config validation to include training state and storage mode persistence
Modified PQ implementation to support set_trained() for persistence restoration
Updated index reconstruction to use "Simple Reconstruction" pattern for reliability
Refactored training threshold calculation to be self-healing during load operations
Enhanced error collection and reporting throughout persistence workflow

Fixed

Improved reliability of index serialization and file output.
Addressed edge cases in directory creation and file writing during persistence.
Resolved critical "nb_layer != NB_MAX_LAYER" error preventing HNSW graph dumps
Fixed layer count compatibility issue between ZeusDB and hnsw-rs library requirements
Enabled successful HNSW graph structure serialization for graph files
Resolved Python binding compilation error for load_index function export
Fixed missing #[pyfunction] annotation preventing Python module integration
Established proper API consistency between save and load methods
Resolved anndists dependency issues for NoDist import compatibility
Fixed HNSW graph loading import paths for hnsw-rs v0.3.0+ compatibility
Resolved training ID loss during graph reconstruction by adding persistence rebuild flag
Fixed PQ training state restoration ensuring loaded instances are properly marked as trained
Corrected training progress calculation inconsistencies between save/load cycles
Addressed quantization state contamination during index reconstruction
Resolved thread safety issues in concurrent data access during persistence operations
Fixed storage mode detection and raw vector preservation based on configuration
Prevented training ID re-collection during persistence rebuild operations

Assets 2

30 Jul 11:46

doubleinfinity

v0.2.1

1a93da4

Version 0.2.1

Added

Storage mode configuration for product quantization: New storage_mode parameter in quantization config allows users to choose between:
- "quantized_only" (default): Maximum memory efficiency by discarding raw vectors after quantization
- "quantized_with_raw": Keep both quantized codes and raw vectors for exact reconstruction
Case-insensitive storage mode validation: Accepts variations like "Quantized_Only", "QUANTIZED_WITH_RAW"
Automatic memory usage warnings: Users are warned when quantized_with_raw mode will use significantly more memory
Enhanced subvector divisor suggestions: _suggest_subvector_divisors() now returns list[int] for programmatic use
StorageMode enum: Rust backend support for quantized_only and quantized_with_raw storage modes with JSON serialization
Storage mode parsing: Complete quantization config parsing in HNSWIndex constructor with proper error handling
Intelligent vector retrieval: get_records() method now prioritizes raw vectors over PQ reconstruction when available
Enhanced statistics: get_stats() now reports storage mode, memory usage breakdown, and storage strategy information
Memory usage tracking: Real-time memory usage calculations for both raw vectors and quantized codes

Changed

Quantization config validation: Now includes comprehensive validation and normalization of all parameters
Error messages: Improved clarity for storage mode validation with sorted mode suggestions
Defensive programming: Added final safety checks to ensure complete configuration before passing to Rust backend
QuantizationConfig struct: Now includes storage_mode field with backward-compatible defaults
add_quantized_vector logic: Respects storage mode configuration to conditionally store raw vectors
get_stats output: Enhanced with storage strategy descriptions ("memory_optimized" vs "quality_optimized")
Vector storage behavior: quantized_only mode stops storing raw vectors after PQ training for maximum memory efficiency

Fixed

Configuration completeness: All quantization parameters now have guaranteed defaults to prevent missing key errors
None value handling: Python config cleaning now properly removes None values before passing to Rust backend
Constructor parameter validation: Improved error handling for missing or invalid quantization parameters
Memory statistics accuracy: Corrected memory usage calculations based on actual storage mode behavior

Assets 2

28 Jul 06:13

doubleinfinity

v0.2.0

f73a237

Version 0.2.0

Added

Product Quantization (PQ) Support
- Quantized vector storage with configurable compression ratios (4x-256x)
- Automatic training pipeline with intelligent threshold detection
- 3-path storage architecture for optimal memory usage:
  - Path A: Raw storage (no quantization)
  - Path B: Raw storage + ID collection (pre-training)
  - Path C: Quantized storage (post-training)
Quantized Search API
- Unified search interface supports both raw and quantized vectors transparently.
Automatic fallback to raw search if quantization is not yet trained.
Quantization-aware batch addition for efficient ingestion at scale.
Detailed quantization diagnostics via get_quantization_info() (e.g., codebook stats, compression ratio, memory footprint).
Debug logging macro (ZEUSDB_DEBUG) for controlled diagnostic output in Rust backend.
Thread safety diagnostics in get_stats() (e.g., "thread_safety": "RwLock+Mutex").
Improved test coverage for quantized and raw modes, including edge cases and error handling.
Asymmetric Distance Computation (ADC) for fast quantized search
Memory-efficient k-means clustering for codebook generation
Configurable quantization parameters:
- subvectors: Number of vector subspaces (divisor of dimension)
- bits: Bits per quantized code (1-8)
- training_size: Vectors needed for training (minimum 1000)
- max_training_vectors: Maximum vectors used for training
Enhanced Vector Database API
Quantization configuration support in create() method
Training progress monitoring with get_training_progress()
Storage mode detection with get_storage_mode()
Quantization status methods:
- has_quantization(): Check if quantization is configured
- can_use_quantization(): Check if PQ model is trained
- is_quantized(): Check if index is using quantized storage
Quantization info retrieval with get_quantization_info()
Training readiness check with is_training_ready()
Training vectors needed with training_vectors_needed()
Performance Monitoring
- Compression ratio calculation and reporting
- Memory usage estimation for raw vs compressed storage
- Training time measurement and optimization
- Search performance metrics for quantized vs raw modes
- Detailed statistics in 'get_stats()' method
Input Handling
Enhanced dictionary input parsing with comprehensive error handling
Flexible metadata support for various Python object types
Automatic type detection and conversion for metadata
Graceful handling of None values and edge cases
Comprehensive input validation with descriptive error messages
Performance Optimizations
Batch processing for large-scale vector additions
Optimized memory allocation during training and storage
Efficient vector reconstruction from quantized codes
Fast ADC search implementation with SIMD optimizations
Automatic performance scaling post-training (up to 8x faster additions)

Changed

Vector Addition Behavior
Automatic training trigger when threshold is reached during vector addition
Dynamic storage mode switching from raw to quantized seamlessly
Enhanced error reporting with detailed failure information in AddResult
Improved batch processing with better memory management
Search Performance
Adaptive search strategy based on storage mode (raw vs quantized)
Optimized distance calculations for quantized vectors
Enhanced result quality with proper score normalization
Index Architecture
3-path storage system replaces simple raw storage
Intelligent memory management with automatic cleanup
Robust state transitions between storage modes
Enhanced concurrency handling with proper lock management
Statistics and Monitoring
Extended statistics including quantization metrics
Real-time progress tracking during training operations
Enhanced memory usage reporting with compression analysis
Detailed timing information for performance optimization
Default search parameters tuned for quantized and L1/L2 spaces (e.g., higher default ef_search for L1/L2).
Improved error messages for quantization-related failures and configuration issues.
Consistent handling of vector normalization (cosine) vs. raw (L1/L2) in all input/output paths.

Fixed

Memory Management
Fixed temporary value lifetime issues in PyO3 integration
Resolved borrow checker conflicts in quantization pipeline
Corrected memory leaks during large-scale operations
Fixed reference counting for Python object handling
Vector Processing
Fixed input format parsing for edge cases and invalid data
Resolved metadata conversion issues for complex Python objects
Corrected vector dimension validation with proper error messages
Fixed batch processing memory allocation issues
Performance Issues
Optimized training memory usage to prevent out-of-memory errors
Fixed search performance degradation in large indexes
Resolved training stability issues with improved k-means initialization
Corrected distance calculation accuracy in quantized mode
Error Handling
Enhanced validation for quantization configuration parameters
Improved error propagation from Rust to Python
Fixed panic conditions in edge cases
Better handling of invalid input combinations
Fixed rare edge case where quantization training could stall with duplicate vectors.
Resolved non-deterministic search results in small datasets with L1/L2 metrics by tuning search parameters.
Fixed debug output leaking to production logs (now controlled by environment variable).

Removed

Removed legacy single-path storage logic (now fully 3-path).
Deprecated or removed any old quantization/test hooks that are no longer needed.

Assets 2

16 Jul 15:18

doubleinfinity

v0.1.2

796dd26

Version 0.1.2

Added

Intelligent Batch Search: Automatic batch processing for multiple query vectors
- Transparent optimization: users get performance gains without API changes
- Smart strategy selection: sequential processing for ≤5 queries, parallel for 6+ queries
- Multiple input format support:
  - List[List[f32]] - Native Python lists of vectors
  - NumPy 2D arrays (N, dims) - Automatic batch detection
  - NumPy 1D arrays (dims,) - Single vector fallback
  - List[f32] - Traditional single vector (unchanged)
Added comprehensive batch search test suite

Changed

Optimized GIL release patterns for better concurrent performance
Reduced lock contention through intelligent batching strategies

Assets 2

15 Jul 13:20

doubleinfinity

v0.1.1

d18c422

Version 0.1.1

Added

Parallel batch insertion using rayon for large datasets (insert_batch).
GIL-optimized add_batch_parallel_gil_optimized() path for inserts ≥ 50 items.
Thread-safe locking using RwLock and Mutex for all core maps (vectors, id_map, etc.).
benchmark_concurrent_reads() and benchmark_raw_concurrent_performance() for performance diagnostics.
get_performance_info() for runtime introspection of bottlenecks and recommendations.
Added normalize_vector() helper function to match Rust implementation behavior
Added assert_vectors_close() utility for normalized vector comparison with tolerance
Added additional tests for parallel batch processing validation, thread safety verification, and performance benchmarking.

Changed

add() now selects between sequential and parallel batch paths based on batch size.
search() releases the Python GIL and performs fast concurrent metadata filtering and conversion.
All internal maps (vectors, metadata, etc.) are now thread-safe for concurrent reads.
Cosine vector normalization is now always applied consistently across all input formats.

Fixed

Prevented deadlocks and data races by isolating all shared state behind locks.
Ensured proper ID overwrite handling across HNSW and reverse mappings with lock safety.
Fixed HNSW test suite to properly account for cosine space vector normalization. Replace exact floating-point comparisons with normalized vector assertions. The HNSW implementation was working correctly from the start. The tests
were actually validating that cosine normalization was properly implemented.
Fixed comprehensive search test expectations for HNSW approximation behavior

Removed

Legacy single-threaded insertion behavior (now delegated via add_batch_* paths).

Assets 2

13 Jul 11:43

doubleinfinity

v0.1.0

4c656aa

Version 0.1.0

Added

Generic create() method for extensible vector index creation
- Registry-based architecture supporting multiple index types
- Case-insensitive index type matching: create("HNSW") or create("hnsw")
- Comprehensive parameter defaults with Rust backend validation
- Self-updating error messages showing all available index types
- Supports case-insensitive index types (e.g. "HNSW" and "hnsw")
available_index_types() class method for programmatic type discovery
Future-ready architecture for IVF, LSH, Annoy, and Flat index types

Changed

⚠️ Breaking Change: Replaced index-specific factory methods with generic create()
- Migration: VectorDatabase().create_index_hnsw(dim=768) → VectorDatabase().create("hnsw", dim=768)
- All HNSW parameters now default to best-practice values; dim is the only commonly customized field. Most of the settings like m, ef_construction, expected_size, and space already have good defaults, so users typically don't change them. The only one they usually set themselves is dim, since it must match the shape of their data.
- Improved error messages with dynamic type listing

Fixed

Updated all internal testing files to use the new .create()` API

Removed

Index-specific factory methods (replaced by unified create() interface)

Assets 2

10 Jul 12:47

doubleinfinity

v0.0.9

da241fa

Version 0.0.9

Added

search() is a more accurate and industry-standard term for vector similarity retrieval.

Changed

⚠️ Breaking Changes - Renamed HNSWIndex.query() → HNSWIndex.search() to better reflect its role as a k-nearest neighbor (KNN) similarity search method.
Updated all internal references, tests, and examples to reflect the new .search() method name.

Removed

All usages of .query() must be replaced with .search().

Assets 2

10 Jul 02:59

doubleinfinity

v0.0.8

a468ded

Version 0.0.8

Added

Metadata filtering support for HNSW vector indexes
- Filters can be applied during query() using Python dictionaries
- Supported operators:
  - Basic equality: "field": value
  - Comparison: {"gt": val}, {"gte": val}, {"lt": val}, {"lte": val}
  - String ops: {"contains": "x"}, {"startswith": "x"}, {"endswith": "x"}
  - Array ops: {"in": [a, b, c]}
- Filters can be combined across fields using AND logic
- Supports None for null value matching
serde and serde_json dependencies:
- Enables typed serialization and deserialization of metadata
- Powers the new metadata filtering and storage system using serde_json::Value
Comprehensive test suite for metadata filtering:
- Covers string, numeric, boolean, array, and null filters
- Includes multi-condition queries and invalid filter error handling
- Validates type fidelity in round-trip metadata storage and retrieval

Changed

Vector metadata is now stored as HashMap<String, Value> for flexible typing

Fixed

Improved type extraction and conversion between Python and Rust for metadata fields

Assets 2

Releases: ZeusDB/zeusdb-vector-database

Version 0.4.1

Added

Changed

Fixed

Removed

Uh oh!

Version 0.4.0

Added

Changed

Fixed

Uh oh!

Version 0.3.0

Added

Changed

Fixed

Uh oh!

Version 0.2.1

Added

Changed

Fixed

Uh oh!

Version 0.2.0

Added

Changed

Fixed

Removed

Uh oh!

Version 0.1.2

Added

Changed

Uh oh!

Version 0.1.1

Added

Changed

Fixed

Removed

Uh oh!

Version 0.1.0

Added

Changed

Fixed

Removed

Uh oh!

Version 0.0.9

Added

Changed

Removed

Uh oh!

Version 0.0.8

Added

Changed

Fixed

Uh oh!