This benchmark suite evaluates the GCX and GC* compression algorithm by comparing it against GCIS and RePair algorithms. The tests measure:
- Compression performance (time and ratio);
- Decompression performance (time);
- Substring extraction time;
- Memory usage across all operations; and
- Grammar analysis for GCX and GC* .
This benchmark uses the following external tools:
- GCX - Our target compression algorithm
- GCIS - Grammar compression reference
- RePair - Classical compression algorithm
- ShapeSlp - Required to perform extraction of grammar generated by RePair
- malloc-count - Required to account for memory usage
- sdls - Required for reference algorithms
./measure_compression.shResults are organized in:
report/YYYY-MM-DD/
├── base-file-gcx-encoding.csv # Compression and decompression metrics
├── base-file-gcx-extract.csv # Substring extraction times
├── base-file-gcx-grammar.csv # Grammar reports
└── graphs/ # Visualizations (pt or en)
│ └──pt
│ └──en
└── errors # error report (if errors occurred)
└── extract # extraction intervals
Test files are downloaded from URLs specified in files_paths/ directory.