This project investigates how different levels of AI execution autonomy (Manual, Assistance, Execution) affect system performance and intervention behavior in a controlled task environment.
- How does AI autonomy impact task completion time and error rates?
- How does autonomy influence user intervention behavior?
- Experimental setup with three autonomy conditions
- Interaction logging via a custom-built system
- Data analysis in R
- Non-parametric statistical tests:
- Kruskal-Wallis test
- Pairwise Wilcoxon test
- Metrics:
- Completion Time
- Error Rate
- Intervention Behavior
- R (data analysis)
- Quarto (report generation)
- Supabase (logging backend)
- Next.js (prototype interface)
The full analysis can be reproduced by rendering the Quarto document:
quarto render ai-execution-autonomy-analysis.qmd