Testing Guidelines

Overview

CRUDkit uses a comprehensive testing strategy to ensure code quality and reliability. Our testing stack includes:

Vitest: Fast unit testing framework
React Testing Library: Component testing utilities
Coverage Reports: Track test coverage metrics
CI/CD Integration: Automated testing on every push

Testing Stack

Core Dependencies

vitest: Test runner and assertion library
@testing-library/react: React component testing
@testing-library/jest-dom: Custom DOM matchers
@vitest/ui: Interactive test UI
@vitest/coverage-v8: Coverage reporting
jsdom: Browser environment simulation

Running Tests

Docker Commands (MANDATORY)

⚠️ IMPORTANT: This project REQUIRES Docker for all development and testing.

# Run all tests once
docker compose exec scripthammer pnpm test

# Run tests in watch mode
docker compose exec scripthammer pnpm test:watch

# Run tests with UI
docker compose exec scripthammer pnpm test:ui

# Generate coverage report
docker compose exec scripthammer pnpm test:coverage

NOTE: Local pnpm/npm commands are NOT supported. All testing MUST use Docker.

Writing Tests

Component Testing

Components should be tested for:

Rendering: Component renders without errors
Props: Props are handled correctly
User Interactions: Click, type, focus events work
States: Different states display correctly
Accessibility: ARIA attributes and roles are present

Example Component Test

import { describe, it, expect, vi } from 'vitest';
import { render, screen, fireEvent } from '@testing-library/react';
import Button from './Button';

describe('Button Component', () => {
  it('renders with children text', () => {
    render(<Button>Click me</Button>);
    expect(screen.getByRole('button')).toHaveTextContent('Click me');
  });

  it('handles click events', () => {
    const handleClick = vi.fn();
    render(<Button onClick={handleClick}>Click me</Button>);

    fireEvent.click(screen.getByRole('button'));
    expect(handleClick).toHaveBeenCalledTimes(1);
  });

  it('can be disabled', () => {
    render(<Button disabled>Disabled</Button>);
    expect(screen.getByRole('button')).toBeDisabled();
  });
});

Best Practices

Use Testing Library Queries: Prefer queries that reflect how users interact
- Good: getByRole, getByLabelText, getByPlaceholderText
- Avoid: getByTestId (unless necessary)

Test User Behavior: Focus on what users see and do

// Good: Test user-visible behavior
expect(screen.getByRole('button')).toHaveTextContent('Submit');

// Avoid: Test implementation details
expect(component.state.isSubmitting).toBe(true);

Keep Tests Isolated: Each test should be independent

describe('Component', () => {
  // Reset mocks after each test
  afterEach(() => {
    vi.clearAllMocks();
  });
});

Use Descriptive Names: Test names should explain what's being tested

// Good
it('displays error message when email is invalid');

// Avoid
it('test email validation');

Test Structure

File Organization

src/
├── components/
│   ├── subatomic/
│   │   ├── Button/
│   │   │   ├── Button.tsx
│   │   │   ├── Button.test.tsx  # Component test
│   │   │   └── Button.stories.tsx
│   │   └── Input/
│   │       ├── Input.tsx
│   │       └── Input.test.tsx
│   └── atomic/
│       └── Card/
│           ├── Card.tsx
│           └── Card.test.tsx
├── utils/
│   ├── theme.ts
│   └── theme.test.ts  # Utility test
└── test/
    └── setup.ts  # Test configuration

Test Configuration

The test environment is configured in vitest.config.ts:

export default defineConfig({
  plugins: [react()],
  test: {
    environment: 'jsdom',
    globals: true,
    setupFiles: './src/test/setup.ts',
    coverage: {
      provider: 'v8',
      thresholds: {
        statements: 10,
        branches: 10,
        functions: 10,
        lines: 10,
      },
    },
  },
});

Coverage Requirements

Current Thresholds

Statements: 0.5%
Branches: 0.5%
Functions: 0.5%
Lines: 0.5%

These thresholds will increase as the project matures:

Sprint 2: 0.5% (current - minimal baseline)
Sprint 3: 10%
Sprint 4: 25%
Sprint 5: 50%
Sprint 6: 75%

Viewing Coverage

# Generate coverage report (inside Docker)
docker compose exec scripthammer pnpm test:coverage

# Coverage report is generated in /coverage directory
# View it from your host machine:
open coverage/index.html

Known Issues

Colorblind Mode Tests (10 failures as of 2025-09-14)

The following tests are currently failing due to test implementation issues, not functionality bugs:

ColorblindToggle Component (6 failures):

Dropdown not rendering in test environment
Tests looking for "Color Vision Settings" text that may be rendered differently
Focus management tests failing due to dropdown behavior

useColorblindMode Hook (3 failures):

localStorage persistence tests expecting different state updates
Pattern class toggle tests not detecting DOM changes correctly

ColorblindFilters Component (1 failure):

Parent element assertion failing in render test

These failures do not affect the actual functionality of the colorblind assistance feature, which works correctly in the application. The issues are related to test setup and expectations.

Test Suite Architecture

Vitest Exclusions (vitest.config.ts)

The main vitest suite excludes 105 tests across three categories:

TDD Placeholders (47 tests) - Features Not Yet Implemented

These tests intentionally fail with expect(true).toBe(false) until features are built:

tests/contract/email-notifications.test.ts (17 tests) - Email Edge Function
tests/contract/stripe-webhook.test.ts (14 tests) - Stripe webhook handling
tests/contract/paypal-webhook.test.ts (15 tests) - PayPal webhook handling
tests/contract/profile/delete-account.contract.test.ts (1 test) - Cascade delete

Real Tests Excluded for Rate Limits (52 tests)

Fully implemented but excluded to avoid hitting shared Supabase instance:

tests/integration/auth/oauth-flow.test.ts (10 tests)
tests/integration/auth/password-reset-flow.test.ts (7 tests)
tests/integration/auth/sign-in-flow.test.ts (7 tests)
tests/integration/auth/sign-up-flow.test.ts (6 tests)
tests/integration/auth/protected-routes.test.ts (10 tests)
tests/contract/auth/*.test.ts (7 tests)
tests/integration/messaging/database-setup.test.ts (14 tests)

Future work: Re-enable when dedicated test Supabase instance available.

Canvas API Tests (6 tests) - Covered by E2E

Need real browser Canvas API, covered by Playwright E2E tests:

src/lib/avatar/__tests__/image-processing.test.ts (6 tests)
src/lib/avatar/__tests__/validation.test.ts (7 tests)
tests/integration/avatar/upload-flow.integration.test.ts (5 tests)

Test Count Summary

Suite	Tests	Notes
Vitest (main)	~2,300	Runs by default
Excluded (TDD)	47	Waiting for feature implementation
Excluded (rate limits)	52	Re-enable with dedicated test DB
Excluded (Canvas)	6	Covered by E2E
E2E (Playwright)	54 files	Run separately

Running Excluded Tests Manually

# Run integration auth tests (requires dedicated Supabase)
docker compose exec scripthammer pnpm test tests/integration/auth/ -- --run

# Run contract tests (some will fail - TDD placeholders)
docker compose exec scripthammer pnpm test tests/contract/ -- --run

CI/CD Integration

Tests run automatically on:

Every push to main or develop
Every pull request

The CI pipeline (/.github/workflows/ci.yml) runs:

Linting
Type checking
Unit tests
Coverage check
Build verification

Pre-commit Hooks

Husky runs tests on staged files before commit. Note that git hooks run on your host machine, but all testing commands are executed inside Docker:

# .husky/pre-commit
docker compose exec -T scripthammer pnpm lint-staged

Lint-staged configuration:

JS/TS files: ESLint + related tests
CSS/MD/JSON: Prettier formatting

Debugging Tests

Interactive Mode

# Open Vitest UI for debugging (inside Docker)
docker compose exec scripthammer pnpm test:ui

# Access the UI at http://localhost:51204 (or the port shown in terminal)

VSCode Integration

Install the Vitest extension for:

Run tests from editor
Debug with breakpoints
See inline coverage

Common Issues

Module not found: Check import paths and aliases
DOM not available: Ensure jsdom environment is set
Async issues: Use waitFor for async operations
React hooks errors: Wrap in renderHook from testing library

Testing Checklist

Before committing:

E2E Testing with Playwright

Overview

End-to-end tests use Playwright to test complete user workflows in real browsers. E2E tests are local-only (not run in CI) due to requiring authenticated sessions.

Test Users Setup

E2E tests require multiple test users for multi-user scenarios (messaging, connections, group chats).

⚠️ CRITICAL: Email Domain Requirement

Supabase validates email domains and rejects @example.com emails. You MUST use a real email domain. Gmail plus aliases work perfectly:

# REQUIRED in .env for dynamically generated test emails
TEST_EMAIL_DOMAIN=yourname+e2e@gmail.com

Without TEST_EMAIL_DOMAIN, E2E tests will fail with "Email address is invalid" errors.

Test Users (use Gmail plus aliases):

User	Email Example	Purpose
Primary	yourname+test-a@gmail.com	Runs E2E tests
Secondary	yourname+test-b@gmail.com	Multi-user tests (connections, messaging)
Tertiary	yourname+test-c@gmail.com	Group chat tests (3+ members)
Admin	admin@scripthammer.com	Welcome messages

Setup (one-time):

# 1. Create test users in Supabase
docker compose exec scripthammer pnpm exec tsx scripts/seed-test-users.ts

# 2. Create connections between users
docker compose exec scripthammer pnpm exec tsx scripts/seed-connections.ts

Environment Variables (in .env):

# REQUIRED - Dynamic test email generation
TEST_EMAIL_DOMAIN=yourname+e2e@gmail.com

# Pre-seeded test users (use Gmail plus aliases)
TEST_USER_PRIMARY_EMAIL=yourname+test-a@gmail.com
TEST_USER_PRIMARY_PASSWORD=<secure-password>
TEST_USER_SECONDARY_EMAIL=yourname+test-b@gmail.com
TEST_USER_SECONDARY_PASSWORD=<secure-password>
TEST_USER_TERTIARY_EMAIL=yourname+test-c@gmail.com
TEST_USER_TERTIARY_PASSWORD=<secure-password>

Running E2E Tests

# Run all E2E tests (starts dev server automatically)
docker compose exec scripthammer pnpm exec playwright test

# Run specific test file
docker compose exec scripthammer pnpm exec playwright test tests/e2e/messaging/

# Run with existing dev server (faster)
SKIP_WEBSERVER=true docker compose exec -e SKIP_WEBSERVER=true scripthammer pnpm exec playwright test

# Run specific browser
docker compose exec scripthammer pnpm exec playwright test --project=chromium

# Run with UI (headed mode)
docker compose exec scripthammer pnpm exec playwright test --ui

# Generate HTML report
docker compose exec scripthammer pnpm exec playwright show-report

Test Structure

tests/e2e/
├── messaging/
│   ├── complete-user-workflow.spec.ts    # Full messaging flow
│   ├── encrypted-messaging.spec.ts       # E2E encryption tests
│   ├── friend-requests.spec.ts           # Connection flows
│   ├── group-chat-multiuser.spec.ts      # Group chat with 3+ users
│   └── ...
└── auth/
    └── ...

Writing E2E Tests

import { test, expect } from '@playwright/test';
import {
  dismissCookieBanner,
  performSignIn,
  waitForAuthenticatedState,
  handleReAuthModal,
} from './utils/test-user-factory';

test.describe('Feature Name', () => {
  test('should do something', async ({ page }) => {
    // Sign in using helper (handles errors, waits for auth hydration)
    await page.goto('/sign-in');
    const result = await performSignIn(
      page,
      process.env.TEST_USER_PRIMARY_EMAIL!,
      process.env.TEST_USER_PRIMARY_PASSWORD!
    );
    if (!result.success) {
      throw new Error(`Sign-in failed: ${result.error}`);
    }

    // Test actions...
    await expect(page.locator('...')).toBeVisible();
  });
});

E2E Test Helpers (test-user-factory.ts)

The tests/e2e/utils/test-user-factory.ts file provides essential helpers:

Helper	Purpose
`performSignIn(page, email, password)`	Sign in with error detection, returns `{ success, error? }`
`waitForAuthenticatedState(page)`	Wait for AuthContext to hydrate (Sign Out button visible)
`dismissCookieBanner(page)`	Dismiss GDPR cookie consent banner if present
`handleReAuthModal(page, password)`	Handle ReAuthModal for messaging password unlock
`getAdminClient()`	Get Supabase admin client for test data setup
`getUserByEmail(email)`	Look up user by email via admin client
`getDisplayNameByEmail(email)`	Get/set display_name for user search tests

Example: Messaging tests with beforeAll setup

import { test, expect } from '@playwright/test';
import {
  dismissCookieBanner,
  handleReAuthModal,
  waitForAuthenticatedState,
  getAdminClient,
  getUserByEmail,
} from '../utils/test-user-factory';

let setupSucceeded = false;
let setupError = '';

// Create connection AND conversation before all tests
test.beforeAll(async () => {
  const adminClient = getAdminClient();
  if (!adminClient) {
    setupError = 'Admin client unavailable';
    return;
  }

  const userA = await getUserByEmail(TEST_USER_1.email);
  const userB = await getUserByEmail(TEST_USER_2.email);

  // Create connection
  await adminClient.from('user_connections').upsert({
    requester_id: userA.id,
    addressee_id: userB.id,
    status: 'accepted',
  });

  // Create conversation with CANONICAL UUID ordering
  const [p1, p2] =
    userA.id < userB.id ? [userA.id, userB.id] : [userB.id, userA.id];

  await adminClient.from('conversations').upsert({
    participant_1_id: p1,
    participant_2_id: p2,
  });

  setupSucceeded = true;
});

test.beforeEach(async ({ page }, testInfo) => {
  if (!setupSucceeded) {
    testInfo.skip(true, `Setup failed: ${setupError}`);
  }
});

Key Pattern: Canonical UUID Ordering

The conversations table requires participant_1_id < participant_2_id (lexicographically). Always sort IDs before inserting:

const [p1, p2] =
  userA.id < userB.id ? [userA.id, userB.id] : [userB.id, userA.id];

GDPR Consent in Payment Tests

Payment tests require GDPR consent before Stripe loads:

async function handlePaymentConsent(page: Page) {
  // Clear localStorage to ensure fresh consent state
  await page.evaluate(() => {
    localStorage.removeItem('payment_consent');
  });

  const acceptButton = page.getByRole('button', { name: /Accept & Continue/i });
  if (await acceptButton.isVisible().catch(() => false)) {
    await acceptButton.click();
    // Wait for Step 2 (payment form) to appear
    await expect(page.getByRole('heading', { name: /Step 2/i })).toBeVisible({
      timeout: 5000,
    });
  }
}

// Usage in test
await page.goto('/payment-demo');
await dismissCookieBanner(page);
await handlePaymentConsent(page);
// Now payment form is ready

Common E2E Test Failures and Solutions

Symptom	Root Cause	Solution
"Sign In" visible in navbar after sign-in	Auth not hydrated	Use `performSignIn()` + `waitForAuthenticatedState()`
"Conversation not found"	Missing `conversations` record	Add `beforeAll` with connection AND conversation creation
"No users found" in search	`display_name` is NULL	Use `getDisplayNameByEmail()` which sets it
Payment buttons disabled	GDPR consent not given	Call `handlePaymentConsent()` before payment tests
ReAuthModal blocks messaging	Encryption keys need unlock	Call `handleReAuthModal(page, password)`
Cookie banner overlays form	Banner not dismissed	Call `dismissCookieBanner(page)` after navigation

Supabase Rate Limiting in E2E Tests

Supabase enforces rate limits that can cause E2E test failures. Understanding these limits is critical for reliable testing.

How Supabase Rate Limiting Works

Type	Scope	Default	Configurable
API Rate Limits	Per IP	30-360/hour	Yes (via Management API)
Account Lockout	Per Email	5 failed attempts	No (built-in security)
Email Sending	Per Project	4/hour (free tier)	Yes (custom SMTP)

Key Insight: Login rate limiting is IP-based, not email-based. All tests running from the same CI runner share the same IP and rate limit bucket.

Symptoms of Rate Limiting

"Too many failed attempts. Your account has been temporarily locked. Please try again in 15 minutes."

This error indicates the account lockout feature triggered after 5+ failed login attempts.

Solutions for Rate Limiting Tests

1. Run rate limiting tests in SERIAL mode:

// tests/e2e/auth/rate-limiting.spec.ts
test.describe.configure({ mode: 'serial' });

test.describe('Rate Limiting', () => {
  let sharedEmail: string;

  test.beforeAll(() => {
    // Generate ONE email for all tests
    sharedEmail = generateTestEmail('ratelimit');
  });

  test('1. triggers rate limit', async ({ page }) => {
    // Only THIS test triggers rate limiting
    for (let i = 0; i < 6; i++) {
      /* failed attempts */
    }
  });

  test('2. verifies lockout message', async ({ page }) => {
    // Verifies the ALREADY triggered rate limit
    // Only 1 attempt, not 6
  });
});

2. Increase Supabase rate limits via Management API:

# Check current limits
docker compose exec scripthammer node -e "
const token = process.env.SUPABASE_ACCESS_TOKEN;
const ref = 'your-project-ref';
fetch(\`https://api.supabase.com/v1/projects/\${ref}/config/auth\`)
  .then(r => r.json())
  .then(d => console.log({
    rate_limit_verify: d.rate_limit_verify,
    rate_limit_token_refresh: d.rate_limit_token_refresh
  }));
"

# Increase limits for testing
docker compose exec scripthammer node -e "
fetch(\`https://api.supabase.com/v1/projects/\${ref}/config/auth\`, {
  method: 'PATCH',
  headers: { 'Authorization': \`Bearer \${token}\`, 'Content-Type': 'application/json' },
  body: JSON.stringify({ rate_limit_verify: 360 })
}).then(r => r.json()).then(console.log);
"

3. Required environment variables:

# .env - for Management API access
SUPABASE_ACCESS_TOKEN=sbp_xxx...
SUPABASE_PROJECT_REF=your-project-ref

Anti-Patterns to Avoid

// ❌ BAD: Each test independently triggers rate limiting
test.describe('Rate Limiting', () => {
  test('test 1', async () => {
    for (i = 0; i < 6; i++) {
      /* attempt */
    }
  });
  test('test 2', async () => {
    for (i = 0; i < 6; i++) {
      /* attempt */
    }
  });
  test('test 3', async () => {
    for (i = 0; i < 6; i++) {
      /* attempt */
    }
  });
  // Result: 18+ attempts = IP blocked, tests 2 and 3 fail
});

// ❌ BAD: Assuming unique emails bypass rate limits
const email1 = generateEmail('test1');
const email2 = generateEmail('test2');
// This doesn't help - rate limiting is IP-based, not email-based

Playwright Project Ordering (Implemented Solution)

This project uses Playwright project dependencies to ensure rate-limiting tests run before sign-up tests can consume the IP quota:

playwright.config.ts project execution order:
┌──────────────────┐
│  rate-limiting   │  ← Runs FIRST (clean IP state)
└────────┬─────────┘
         │ depends on
┌────────▼─────────┐
│   brute-force    │  ← Runs second
└────────┬─────────┘
         │ depends on
┌────────▼─────────┐
│     signup       │  ← Runs LAST (can consume quota)
└──────────────────┘

┌──────────────────┐
│    chromium      │  ← Runs in parallel (excludes above tests)
│    firefox       │
│    webkit        │
│    Mobile-*      │
└──────────────────┘

Key files:

playwright.config.ts - Defines project dependencies via dependencies: ['rate-limiting']
tests/e2e/utils/rate-limit-admin.ts - Clears rate_limit_attempts table before tests
.github/workflows/e2e.yml - Runs ordered projects first, then parallel tests

How it works:

Rate-limiting tests use test.beforeAll() to clear the custom rate_limit_attempts table
Playwright enforces project execution order via dependencies
The chromium project uses testIgnore to exclude rate-limiting/brute-force/signup tests
CI workflow runs ordered projects first, then remaining tests

For Forked Projects

If you're experiencing rate limiting issues in a fork:

The project ordering is already configured - tests should work out of the box
Increase rate limits via Management API if still seeing issues (see above)
Verify SUPABASE_SERVICE_ROLE_KEY is set - needed for rate limit table cleanup
Consider skipping rate limiting tests in CI if they test Supabase behavior, not your code

Accessibility Testing (axe-core)

E2E accessibility tests use axe-playwright to run axe-core automated accessibility checks.

Theme-Dependent Violations

DaisyUI provides 32 themes with varying color schemes. Not all themes meet WCAG AA contrast ratios. To handle this, we exclude theme-dependent rules from automated checks:

// tests/e2e/tests/accessibility.spec.ts
const axeOptions = {
  axeOptions: {
    rules: {
      'color-contrast': { enabled: false }, // Theme-dependent
      'landmark-unique': { enabled: false }, // Multiple nav elements acceptable
    },
  },
};

test('homepage passes automated accessibility checks', async ({ page }) => {
  await injectAxe(page);
  await checkA11y(page, undefined, {
    detailedReport: true,
    ...axeOptions,
  });
});

Excluded Rules:

Rule	Impact	Reason
`color-contrast`	Serious	DaisyUI themes have varying contrast - theme choice
`landmark-unique`	Moderate	GlobalNav + footer navigation is acceptable pattern

Advisory Color Contrast Test

A separate test logs contrast warnings without failing:

test('color contrast meets WCAG standards', async ({ page }) => {
  // Runs axe-core color-contrast check and logs warnings
  // Does NOT fail - this is informational only
  const contrastViolations = results.violations.filter(
    (v) => v.id === 'color-contrast'
  );

  if (contrastViolations.length > 0) {
    console.warn(
      `[Advisory] ${contrastViolations[0].nodes.length} color contrast issues found. ` +
        `Consider testing with a high-contrast theme.`
    );
  }
});

Avatar Upload Accessibility Tests

Avatar upload a11y tests require authentication. Ensure cookie banner is dismissed before sign-in:

test.beforeEach(async ({ page }) => {
  await page.goto('/sign-in');
  await page.waitForLoadState('networkidle');
  await dismissCookieBanner(page); // BEFORE performSignIn

  const result = await performSignIn(page, testEmail, testPassword);
  if (!result.success) {
    throw new Error(`Sign-in failed: ${result.error}`);
  }

  await page.goto('/account');
  await dismissCookieBanner(page);
});

Debugging E2E Tests

# Run with trace on failure
docker compose exec scripthammer pnpm exec playwright test --trace on

# View trace
docker compose exec scripthammer pnpm exec playwright show-trace test-results/*/trace.zip

# Run in debug mode (step through)
docker compose exec scripthammer pnpm exec playwright test --debug

Resources

Last Updated: Auth persistence fixes, Accessibility testing patterns (Dec 27, 2025)

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

Testing Guidelines

Overview

Testing Stack

Core Dependencies

Running Tests

Docker Commands (MANDATORY)

Writing Tests

Component Testing

Example Component Test

Best Practices

Test Structure

File Organization

Test Configuration

Coverage Requirements

Current Thresholds

Viewing Coverage

Known Issues

Colorblind Mode Tests (10 failures as of 2025-09-14)

Test Suite Architecture

Vitest Exclusions (vitest.config.ts)

TDD Placeholders (47 tests) - Features Not Yet Implemented

Real Tests Excluded for Rate Limits (52 tests)

Canvas API Tests (6 tests) - Covered by E2E

Test Count Summary

Running Excluded Tests Manually

CI/CD Integration

Pre-commit Hooks

Debugging Tests

Interactive Mode

VSCode Integration

Common Issues

Testing Checklist

E2E Testing with Playwright

Overview

Test Users Setup

Running E2E Tests

Test Structure

Writing E2E Tests

E2E Test Helpers (test-user-factory.ts)

GDPR Consent in Payment Tests

Common E2E Test Failures and Solutions

Supabase Rate Limiting in E2E Tests

How Supabase Rate Limiting Works

Symptoms of Rate Limiting

Solutions for Rate Limiting Tests

Anti-Patterns to Avoid

Playwright Project Ordering (Implemented Solution)

For Forked Projects

Accessibility Testing (axe-core)

Theme-Dependent Violations

Advisory Color Contrast Test

Avatar Upload Accessibility Tests

Debugging E2E Tests

Resources