CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

This is a Python proof of concept project for accessing and analyzing Type 1 diabetes data from a DIY Loop system stored in MongoDB. The primary focus is CGM (Continuous Glucose Monitor) data analysis to identify patterns and trends for optimizing diabetes management.

Analysis Goals

Primary Focus: CGM Data Pattern Analysis

Time-scale analysis: past weeks, months (not viewing all 243K+ readings at once)
Temporal patterns: specific times of day, days of week
Trend identification for diabetes management optimization
Eventually: correlation with treatment settings for better glucose control

Analysis Workflow:

Exploratory analysis in marimo notebooks
Pattern discovery and statistical analysis
Treatment optimization insights (future goal)
Potential migration to other tools based on findings

Development Standards

This project follows Python best practices for professional coding with emphasis on:

Code Quality:

Type hints for all functions and methods
Comprehensive docstrings following Google/NumPy style
Clear variable and function names
Modular design with single responsibility principle
Error handling with informative messages

Reproducibility:

Pinned dependencies with exact versions
Environment configuration via .env files
Consistent data processing pipelines
Deterministic analysis workflows

Documentation:

All public functions must have detailed docstrings
Inline comments for complex logic
README with complete setup instructions
Analysis methodology documented in docs/
Code examples and usage patterns

Testing & Validation:

Input validation for all public methods
Data quality checks and cleaning
Error handling for database connections
Comprehensive testing of core functionality

Development Setup

This project uses uv for dependency management and follows a modern Python package structure with src layout. Key commands:

uv sync - Install dependencies
uv pip install -e . - Install package in editable mode (recommended for development)
uv run python -m src.sweetiepy.module - Run Python modules directly
uv add <package> - Add new dependencies

Package Installation: For development and usage, install the package in editable mode:

uv pip install -e .

This allows you to import modules naturally:

from sweetiepy.data.cgm import CGMDataAccess
from sweetiepy.connection.mongodb import MongoDBConnection

Project Structure

src/loopy/
├── connection/    # Database connectivity
│   └── mongodb.py
├── data/          # Data access modules
│   ├── cgm.py     # CGM/glucose data queries and analysis
│   ├── pump.py    # Pump/treatment data (bolus, basal, carbs)
│   └── merged.py  # CGM + pump settings time-synchronized analysis
└── utils/         # Utilities and debugging
    └── debug.py
docs/              # Documentation
dev/               # Development and analysis scripts
├── exploratory/   # Exploratory analysis notebooks
└── reports/       # Analysis reports
tests/             # Test modules

Key Dependencies

Current dependencies for MongoDB diabetes data analysis:

pymongo - MongoDB Python driver
pandas - Data manipulation and analysis (with PyArrow backend for performance)
pyarrow - High-performance backend for pandas
matplotlib and plotly - Data visualization
marimo - Interactive notebook environment for exploratory data analysis
python-dateutil - Date/time parsing utilities
python-dotenv - Environment variable management

Configuration:

import pandas as pd
pd.options.mode.dtype_backend = "pyarrow"  # Enable PyArrow backend

Module Imports:

from src.sweetiepy.connection.mongodb import MongoDBConnection
from src.sweetiepy.data.cgm import CGMDataAccess
from src.sweetiepy.data.pump import PumpDataAccess
from src.sweetiepy.data.merged import MergedDataAccess
from src.sweetiepy.utils.debug import debug_connection_info

Exploratory Analysis:

marimo notebooks for interactive data exploration and visualization
Run with: uv run marimo edit dev/exploratory/analysis.py
Ideal for pattern discovery and temporal analysis of CGM data

Data Architecture

The MongoDB database contains DIY Loop system data with these expected collections:

CGM readings (glucose values, timestamps)
Insulin pump data (basal rates, bolus doses, timestamps)
Loop system decisions and predictions

Development Stages

Stage 1: Database Connection

Create connection module with MongoDB URI from environment variables
Test basic connection and authentication
List available databases and collections
Verify connection can be established and closed properly

Stage 2: CGM Data Access

Connect to CGM/blood glucose readings collection
Explore collection schema and document structure
Implement basic query to retrieve recent CGM readings
Test data retrieval and verify data format

Stage 3: Time-Range Queries

Implement date/time filtering for CGM data
Add functions to query specific time periods (last 24h, week, custom range)
Test with various time ranges and validate results

Stage 4: Data Processing & DataFrame Integration

Convert MongoDB documents to pandas DataFrames with PyArrow backend
Implement efficient data cleaning and validation
Handle timestamp conversions and timezone management
Prepare data for time-series analysis (weeks/months focus)

Stage 5: Pattern Analysis & Temporal Insights

Time-of-day pattern analysis (hourly glucose trends)
Day-of-week pattern identification
Weekly and monthly trend analysis
Statistical summaries for specific time periods
Data preparation for marimo notebook exploration

Stage 6: Treatment Correlation Analysis ✅ (COMPLETED in v0.2.0)

MergedDataAccess module - synchronizes CGM readings with active pump settings
Time-based settings lookup - basal rates, carb ratios, ISF active at each CGM timestamp
Correlation analysis - how different settings affect glucose outcomes
Pattern identification - time-of-day and settings effectiveness analysis
Treatment context - recent insulin/carb events for comprehensive analysis

Stage 7: Advanced Analytics (Future)

Machine learning models for settings optimization
Predictive glucose modeling based on settings and treatments
Automated settings recommendations
Integration with other diabetes management tools

Analysis Patterns

CGM-Focused Time-Series Analysis:

Glucose trends over weeks/months (not full dataset)
Temporal patterns: hourly, daily, weekly cycles
Statistical analysis for specific time periods
Pattern discovery for treatment optimization
Efficient data processing for exploratory analysis in marimo notebooks

CGM + Pump Settings Correlation Analysis (NEW in v0.2.0):

Time-synchronized data - each CGM reading paired with active pump settings
Settings effectiveness - analyze glucose outcomes by basal rate, carb ratio, ISF
Temporal analysis - how settings perform at different times of day
Treatment context - include recent insulin/carb events for comprehensive view
Correlation studies - statistical relationships between settings and glucose control
Pattern identification - discover optimal settings for different conditions

Key Analysis Workflows:

# Basic merged analysis
with MergedDataAccess() as merged:
    df = merged.get_merged_cgm_and_settings(days=7)
    analysis = merged.analyze_settings_correlation(df)

# Treatment context analysis  
df_with_context = merged.get_merged_with_recent_treatments(days=3, lookback_hours=4)

# Time patterns with settings
hourly_patterns = df.groupby(['hour_of_day', 'active_basal'])['sgv'].mean()

See docs/analysis_patterns.md for detailed analysis methodology and approaches.

Key Module Commands

Test Database Connection:

uv run python -m src.sweetiepy.connection.mongodb

Debug Connection Issues:

uv run python -m src.sweetiepy.utils.debug

Test CGM Data Access:

uv run python -m src.sweetiepy.data.cgm

Test Merged Data (CGM + Settings):

uv run python -m src.sweetiepy.data.merged

Usage Example (3 months of data):

uv run python dev/usage_example.py

Merged Data Analysis Example:

uv run python dev/merged_data_example.py

Package Publishing

Build and Publish to PyPI:

# Update version in pyproject.toml first
uv run python -m build
uv run python publish.py

See notes/BUILD_AND_PUBLISH.md for detailed publishing instructions.

Security Considerations

Store MongoDB connection strings in environment variables
Never commit database credentials to version control
Use read-only database connections when possible

Active Technologies

Python 3.12+ (per constitution requires-python) + pymongo (MongoDB driver), python-dotenv (env config) (001-foundation)
MongoDB (Nightscout/Loop database - external, not managed by this package) (001-foundation)
Python 3.12+ (per constitution requires-python) + pymongo (MongoDB driver), pandas + pyarrow (DataFrame output), python-dotenv (env config) (002-data-retrieval)
MongoDB (Nightscout/Loop database - external, read-only access) (002-data-retrieval)

Recent Changes

001-foundation: Added Python 3.12+ (per constitution requires-python) + pymongo (MongoDB driver), python-dotenv (env config)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Analysis Goals

Development Standards

Development Setup

Project Structure

Key Dependencies

Data Architecture

Development Stages

Stage 1: Database Connection

Stage 2: CGM Data Access

Stage 3: Time-Range Queries

Stage 4: Data Processing & DataFrame Integration

Stage 5: Pattern Analysis & Temporal Insights

Stage 6: Treatment Correlation Analysis ✅ (COMPLETED in v0.2.0)

Stage 7: Advanced Analytics (Future)

Analysis Patterns

Key Module Commands

Package Publishing

Security Considerations

Active Technologies

Recent Changes

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Analysis Goals

Development Standards

Development Setup

Project Structure

Key Dependencies

Data Architecture

Development Stages

Stage 1: Database Connection

Stage 2: CGM Data Access

Stage 3: Time-Range Queries

Stage 4: Data Processing & DataFrame Integration

Stage 5: Pattern Analysis & Temporal Insights

Stage 6: Treatment Correlation Analysis ✅ (COMPLETED in v0.2.0)

Stage 7: Advanced Analytics (Future)

Analysis Patterns

Key Module Commands

Package Publishing

Security Considerations

Active Technologies

Recent Changes