CISB-dataset

A dataset of Compiler-Introduced-Security-bugs (CISB) with reproduction materials. These CISBs are manually collected from the GCC/Clang bugzilla and Linux kernel through an empirical study.

See our paper (to appear) for more information on the CISB taxonomy and collection methodology.

Our data is stored in CISB-dataset/dataset. More details here.

Reproduction Material

We provide the following reproduction materials:

test code for all the reproducted CISB;
an automatic tool to test whether one CISB is triggered with pre-defined oracles.

More details here

Aritifact setup

We provide a Dockerfile that automates the setup process for our artifact. With this Dockerfile, users can easily download the dataset and evaluation materials, as well as install all the necessary software requirements in one step.

For running one of the mitigation evaluation experiments that requires SPEC CPU 2006, it is recommended to mount the host directory containing SPEC CPU 2006 to a specific directory (/cisb_docker/CISB-dataset/spec/cpu2006) in the Docker container. Here are the instructions to build and run a Docker container with this:

Make sure you have Docker installed on your system.
Download the SPEC CPU 2006 benchmark and extract it to a directory on your host machine.
Navigate to the directory where you have the Dockerfile and run the following command to build the Docker image:

cd path/to/Dockerfile
docker build -t cisb_docker .

Once the build is complete, run the following command to start a container:

docker run -itd -v /path/to/cpu2006:/cisb_docker/CISB-dataset/spec/cpu2006 --privileged cisb_docker

As an alternative, you can also place SPEC CPU 2006 anywhere you like within the Docker container. In that case, you will need to set the environment variable before running the experiment in the container.

export SEPC_CPU_2006_PATH='path/to/cpu2006'

Aritifact experiments

All of our experiments can be done through a script.

E1: CISB statistics

Execute the Python script to obtain the statistics of CISBs in our dataset. The result should be in line with the data in Figure 2 and Figure 3 of the paper.

python3 statistic.py -e cisb-statistics

E2: Evaulation of mitigations

Review a list of bugs where the prevention performed by programmers failed. This list can be obtained by executing a script. The expected result is those CISBs exist.

python3 statistic.py -e human-mitigation

Run a script to obtain statistics on the effectiveness of compiler mitigations. The output results should be in line with the data shown in Table 6 of the paper. We also provide a guide to measure the effectiveness of each strategy separately.

python3 statistic.py -e mitigation-effectiveness

Run two script to measure the overhead of different compiler prevention strategies using the SPEC CPU 2006 benchmark. First, run the script to lauch all the SPEC CPU 2006 tests. It takes 62 hours to finish all the tests. You might need to set up your SPEC CPU 2006 before that.

# python3 spec/config/test_all.py

Second, run a script to obtain the statistics of the overhead of tested mitigations

# python3 statistic.py -e mitigation-overhead

The output results should be in line with the data shown in Table 6 of the paper. We also provide a guide to measure the overhead of each strategy separately.

E3: Target bugs of automatic prevention works

Execute the script to obtain the statistics of CISBs that can theoretically be prevented by automatic prevention works. The result should be in line with the data in Figure 7 of the paper.

python3 statistic.py -e target-cisb

Check the lists of CISBs we summarized and shown in the script. These bugs should be within the scope of the corresponding prevention work.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
compiler_strategies		compiler_strategies
dataset		dataset
env		env
reproduction_material		reproduction_material
spec		spec
.gitignore		.gitignore
README.md		README.md
check-compiler.py		check-compiler.py
check-key.py		check-key.py
effectiveness_evaluation.py		effectiveness_evaluation.py
requirements.txt		requirements.txt
statistic.py		statistic.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CISB-dataset

Reproduction Material

Aritifact setup

Aritifact experiments

E1: CISB statistics

E2: Evaulation of mitigations

E3: Target bugs of automatic prevention works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CISB-dataset

Reproduction Material

Aritifact setup

Aritifact experiments

E1: CISB statistics

E2: Evaulation of mitigations

E3: Target bugs of automatic prevention works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages