Skip to content

arianaazarbal/recontextualization

Repository files navigation

Recontextualization

This repository contains experimental code for Recontextualization: Mitigating Specification Gaming without Modifying the Specification. Each subdirectory provides a self-contained module for different experimental settings:

  • evaluation-metric-gaming/ - Mitigating General Evaluation Gaming
  • test-case-hacking - Preventing Test Case Hacking in Code Generation
  • deception-evasion-honesty/ - Preventing Learned Evasion of a Lie Detector
  • sycophantic-post-training/ - Mitigating Emergence of Sycophancy in Post-training

Please refer to the individual README files in each subdirectory for specific setup and execution instructions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors