Skip to content

DanyelleAngelo/gcx-experiments

Repository files navigation

GCX Algorithm Benchmark Suite

This benchmark suite evaluates the GCX and GC* compression algorithm by comparing it against GCIS and RePair algorithms. The tests measure:

  • Compression performance (time and ratio);
  • Decompression performance (time);
  • Substring extraction time;
  • Memory usage across all operations; and
  • Grammar analysis for GCX and GC* .

⚙️ Dependencies

This benchmark uses the following external tools:

  • GCX - Our target compression algorithm
  • GCIS - Grammar compression reference
  • RePair - Classical compression algorithm
  • ShapeSlp - Required to perform extraction of grammar generated by RePair
  • malloc-count - Required to account for memory usage
  • sdls - Required for reference algorithms

Usage

./measure_compression.sh

Output Structure

Results are organized in:

report/YYYY-MM-DD/
├── base-file-gcx-encoding.csv      # Compression and decompression metrics
├── base-file-gcx-extract.csv       # Substring extraction times
├── base-file-gcx-grammar.csv       # Grammar  reports
└── graphs/                         # Visualizations (pt or en)
│   └──pt                         
│   └──en 
└── errors                          # error report (if errors occurred)
└── extract                         # extraction intervals

Test files are downloaded from URLs specified in files_paths/ directory.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published