feat: 플레이테스트 로그 분석 도구 + SUS + 행동 기반 이해도 측정 by SilverSupplier · Pull Request #126 · sw1029/magic

SilverSupplier · 2026-06-12T16:11:36Z

요약

HCI 평가에서 실험 설계를 보강하는 3가지를 추가합니다.

1. `scripts/analyze-playtest-attempts.py` (표준 라이브러리만 사용)

attempts.csv에서 프로토콜 지표를 자동 산출:

RQ1 학습 곡선 — 세션 내 성공률 이동 평균, 전/후반 비교 ("설명 없이 학습 가능한가"의 직접 증거)
RQ2 힌트 전후 비교 — 첫 힌트 노출 전후 + 조건부 성공률 (baseline 부재를 보완하는 준-인과 분석)
RQ3 family 공정성 — family별 첫 시도 성공률 + 혼동 행렬 CSV
공통: 실패 유형 분포, assist 도달률, 막힘 지점, 첫 시전 시간
--sus로 SUS 표준 채점 (홀짝 보정 ×2.5)

합성 데이터로 end-to-end 검증 완료.

2. RESEARCH_PROTOCOL — SUS-10 한국어 문항 추가

커스텀 5문항만으로는 외부 비교 기준이 없어, 표준 척도(기준선 68)를 사후 설문에 추가.

3. RESEARCH_PROTOCOL — 피드백 이해도의 행동 측정

"다음에 뭘 바꿀 건가요?" 발화와 다음 시도의 실제 변화를 대조 코딩 — 자기보고+행동 이중 측정.

FINAL_REPORT §5.2 지표 표도 동기화.

scripts/analyze-playtest-attempts.py auto-derives the protocol metrics from attempts.csv: per-session learning curve (RQ1), before/after-hint success rates (RQ2), per-family first-attempt success and confusion matrix (RQ3), failure type distribution, assist reach, stuck points, and SUS scoring. Stdlib only; verified against synthetic data. RESEARCH_PROTOCOL gains the SUS-10 questionnaire (Korean), a behavioral feedback-comprehension coding procedure (stated fix vs actual next attempt), and analysis steps. FINAL_REPORT metrics table updated to match.

SilverSupplier · 2026-06-13T16:01:13Z

Superseded by #129, which includes this commit and the consolidated final integration.

SilverSupplier requested a review from sw1029 as a code owner June 12, 2026 16:11

SilverSupplier closed this Jun 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 플레이테스트 로그 분석 도구 + SUS + 행동 기반 이해도 측정#126

feat: 플레이테스트 로그 분석 도구 + SUS + 행동 기반 이해도 측정#126
SilverSupplier wants to merge 1 commit into
mainfrom
codex/hci-experiment-tooling

SilverSupplier commented Jun 12, 2026

Uh oh!

SilverSupplier commented Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SilverSupplier commented Jun 12, 2026

요약

1. scripts/analyze-playtest-attempts.py (표준 라이브러리만 사용)

2. RESEARCH_PROTOCOL — SUS-10 한국어 문항 추가

3. RESEARCH_PROTOCOL — 피드백 이해도의 행동 측정

Uh oh!

SilverSupplier commented Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

1. `scripts/analyze-playtest-attempts.py` (표준 라이브러리만 사용)