Form Playground Subconscious

A highly configurable form-filling test environment for browser automation and LLM form-filling evaluation.

Overview

This project provides a dynamic form system where forms are defined in JSON configuration files. It supports 28+ input types, validation, multipage forms, and most importantly - ground truth comparison for evaluating LLM form-filling accuracy.

Features

Configuration-Driven Forms: Define forms in JSON/YAML instead of hardcoded React components
28+ Input Types: Text, email, phone, date, time, file upload, selectors, sliders, currency, credit card, and more
AI-Powered Evaluation: Uses GPT-4o-mini as a judge to evaluate dynamic fields (text, textarea, address) with semantic similarity scoring
Intelligent Scoring System:
- Fixed Field Score: Average accuracy for deterministic fields (email, date, select, checkbox, etc.)
- Dynamic Field Score: Average AI-evaluated score for non-deterministic fields (text, textarea, address)
- Overall Accuracy: Weighted average across all fields (0-100%)
Database Integration: Automatically stores all evaluation results in PostgreSQL (Neon) with detailed field-by-field analysis
Ground Truth Comparison: Compare LLM submissions against expected values with detailed accuracy reports
Multipage Forms: Support for multi-step forms with state persistence
Input to LLM: Each form includes context information for LLM form-filling
Real-time Validation: Field-level and form-level validation
Multiple Form Layouts: Single column, two column, split screen, wizard style, and website-style layouts
Industry Examples: Realistic forms for various industries (job applications, patient registration, payment forms, etc.)

Getting Started

Prerequisites

Node.js & npm installed - install with nvm

Installation

# Step 1: Clone the repository
git clone <YOUR_GIT_URL>

# Step 2: Navigate to the project directory
cd form-playground-subconcious

# Step 3: Install dependencies
npm install

# Step 4: Set up environment variables
# Create a .env file in the root directory with:
# DATABASE_URL=your_neon_postgresql_connection_string
# OPENAI_API_KEY=your_openai_api_key
# Note: Use OPENAI_API_KEY (not VITE_OPENAI_API_KEY) to keep the key secure on the server side

# Step 5: Start the development servers
# Option 1: Run both frontend and API server together
npm run dev:all

# Option 2: Run them separately (in different terminals)
# Terminal 1: Frontend (Vite)
npm run dev

# Terminal 2: API Server (Express)
npm run dev:api

The application will be available at http://localhost:8080 The API server will run on http://localhost:3001

Evaluation System

How It Works

Form Submission: When a form is submitted, the system compares submitted values against ground truth
Field Classification: Fields are automatically classified as:
- Fixed/Deterministic: Fields with exact matching (email, date, select, checkbox, etc.)
- Dynamic/Non-Deterministic: Fields requiring semantic evaluation (text, textarea, address)
AI Evaluation: Dynamic fields are evaluated one-by-one using GPT-4o-mini:
- Each field receives a similarity score (0-1)
- AI provides one-line feedback explaining the score
- Evaluation happens sequentially with progress indication
Score Calculation:
- Fixed fields: 1.0 for exact match, 0.0 for mismatch
- Dynamic fields: AI-generated score (0-1) based on semantic similarity
- Overall accuracy: Sum of all field scores ÷ total number of fields
Database Storage: All evaluation data is automatically saved to PostgreSQL:
- Form metadata (title, description, type, layout)
- Field-by-field evaluation with scores and feedback
- Separate columns for fixed field score, dynamic field score, and overall accuracy
- Timestamped records for tracking evaluation history

Evaluation Report

After form submission, users see a detailed evaluation report showing:

Fixed Fields Average Score (percentage)
Dynamic Fields Average Score (percentage)
Individual field results with:
- Expected vs. submitted values
- Score (for dynamic fields)
- AI feedback (for dynamic fields)
Overall Accuracy (bottom of page)

Project Structure

manual_config.json - Manually created form configurations
llm_generated_config.json - AI-generated form configurations
src/components/DynamicForm.tsx - Core form renderer with evaluation logic
src/components/form-fields/ - Individual field components
src/components/form-layouts/ - Form layout components
src/utils/form-comparison.ts - Ground truth comparison logic
src/utils/llm-evaluator.ts - AI-powered field evaluation
src/utils/api.ts - Database API integration
src/pages/ - Page components (Index, FormPage, MultipageFormPage, FormCompletePage)
api/save-evaluation.ts - Vercel serverless function for database operations
server.js - Local Express server for development

Form Configuration

Forms are defined in config_form.json with the following structure:

{
  "form-id": {
    "id": "form-id",
    "title": "Form Title",
    "description": "Form description",
    "type": "single-page" | "multipage",
    "inputToLLM": "Context information for LLM",
    "groundTruth": {
      "fieldId": "expected value"
    },
    "trainingTasks": [
      {
        "id": "task_1",
        "instruction": "Natural language instruction for the form filling task",
        "masked": false,
        "maskedFields": []
      }
    ],
    "pages": [...]
  }
}

Configuration Files

manual_config.json: Contains manually crafted form definitions.
llm_generated_config.json: Contains form definitions generated by LLMs. This file includes additional fields like trainingTasks for fine-tuning or evaluating models on specific form-filling instructions.

Distribution Analysis

To analyze the distribution of form components, layouts, and field types across your configuration files, you can use the included Python script.

# Run the distribution check script
python check_distribution.py

This script will read both manual_config.json and llm_generated_config.json and output detailed statistics about:

Form types and layouts
Field types and their frequency
Date picker and range styles
Required vs. optional fields

Technologies

Vite - Build tool and dev server
React - UI framework
TypeScript - Type safety
shadcn-ui - UI component library
Tailwind CSS - Styling
React Router DOM - Routing
date-fns - Date manipulation
OpenAI GPT-4o-mini - AI-powered field evaluation
PostgreSQL (Neon) - Database for storing evaluation results
Express.js - Local development API server
Vercel Serverless Functions - Production API endpoints

Database Setup

The application uses PostgreSQL (Neon) to store evaluation results. See DATABASE_SETUP.md for:

Database schema structure (refer to database_schema.sql for the raw SQL definition)
Environment variable configuration
Table structure and field descriptions

Form Generation Scripts

The project includes scripts to generate synthetic forms and training tasks using OpenAI's GPT-4o.

1. Generate New Forms (`generate_pages.py`)

Generates completely new form configurations (JSON) with realistic fields, layouts, and industries.

# Generate 1 batch (5 forms) by default
python generate_pages.py

# Generate multiple batches (e.g., 4 batches = 20 forms)
python generate_pages.py 4

Output: Appends new forms to public/llm_generated_config.json.
Features:
- Round-robin selection of layouts and industries.
- Realistic "ground truth" data generation.
- First-person inputToLLM context generation.

2. Generate Synthetic Tasks (`generate_synthetic_task.py`)

Adds trainingTasks to existing forms in manual_config.json and llm_generated_config.json.

python generate_synthetic_task.py

Purpose: Creates 5 variations of natural language instructions for each form.
Masking: Randomly masks (omits) certain fields in ~10% of tasks to train models on partial information.
Output: Updates the config files in place with a trainingTasks array for each form.

Important: Make sure to set DATABASE_URL in your Vercel environment variables for production deployment.

Building for Production

npm run build

The built files will be in the dist directory.

Deployment

See DEPLOYMENT.md for detailed Vercel deployment instructions, including:

Environment variable setup
Database configuration
Serverless function deployment

License

Private project

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
api		api
public		public
src		src
.gitignore		.gitignore
README.md		README.md
bun.lockb		bun.lockb
check_distribution.py		check_distribution.py
components.json		components.json
database_schema.sql		database_schema.sql
eslint.config.js		eslint.config.js
generate_pages.py		generate_pages.py
generate_synthetic_task.py		generate_synthetic_task.py
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
server.js		server.js
tailwind.config.ts		tailwind.config.ts
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vercel.json		vercel.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Form Playground Subconscious

Overview

Features

Getting Started

Prerequisites

Installation

Evaluation System

How It Works

Evaluation Report

Project Structure

Form Configuration

Configuration Files

Distribution Analysis

Technologies

Database Setup

Form Generation Scripts

1. Generate New Forms (`generate_pages.py`)

2. Generate Synthetic Tasks (`generate_synthetic_task.py`)

Building for Production

Deployment

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Form Playground Subconscious

Overview

Features

Getting Started

Prerequisites

Installation

Evaluation System

How It Works

Evaluation Report

Project Structure

Form Configuration

Configuration Files

Distribution Analysis

Technologies

Database Setup

Form Generation Scripts

1. Generate New Forms (generate_pages.py)

2. Generate Synthetic Tasks (generate_synthetic_task.py)

Building for Production

Deployment

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Generate New Forms (`generate_pages.py`)

2. Generate Synthetic Tasks (`generate_synthetic_task.py`)

Packages