Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models

Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models

Aakash Sen Sharma, Niladri Sarkar, Vikram Chundawat, Ankur A Mali, Murari Mandal

We expose a significant vulnerability in diffusion model unlearning methods, where an attacker can reverse the supposed erasure of concepts during the inference process. Our approach leverages a novel Partial Diffusion Attack that operates across all layers of the model, successfully recovering forgotten concepts in an unsupervised and data-free manner. While our work currently focuses on the unlearning methods applied to Stable Diffusion 1.4, this limitation highlights the need for further research to generalize these findings to other models and versions.

Setup

To set up your python environment:

python3 -m venv environ
source ./environ/bin/activate
cd diffusers
pip install .

Purpose and Ethical Use

This code is shared for educational purposes and is not intended to be used for any harmful or malicious generation, such as the creation of misleading information, harmful content, or the impersonation of others.

Acknowledgement:

Our work is based on a diffusers fork by @bghira.

Citation

If you find this useful for your research, please cite the following:

@misc{sharma2024unlearningconcealmentcriticalanalysis,
  title={Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models}, 
  author={Aakash Sen Sharma and Niladri Sarkar and Vikram Chundawat and Ankur A Mali and Murari Mandal},
  year={2024},
  eprint={2409.05668},
  archivePrefix={arXiv},
  primaryClass={cs.LG},
  url={https://arxiv.org/abs/2409.05668}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
diffusers		diffusers
docs		docs
.gitignore		.gitignore
CCS_CRS.ipynb		CCS_CRS.ipynb
LICENSE		LICENSE
README.md		README.md
get_clip_values.py		get_clip_values.py
gold_standard.ipynb		gold_standard.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models

Setup

Purpose and Ethical Use

Acknowledgement:

Citation

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models

Setup

Purpose and Ethical Use

Acknowledgement:

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages