Skip to content

danielschafi/Cob2Java-Bench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cob2Java-Bench

Benchmark for cobol to java conversion. Can be used to test conversions to other programming languages other than Java too.

Adding New Cobol files to the benchmark dataset

  1. Add the files to src_data

    • Check if they are compilable with gnucobol compiler before adding them
  2. run utils/filter_short_files.py to remove all files from the directory that are shorter than a threshold number of lines

    python utils/filter_short_files.py src_data short_files 60
    
  3. Rewrite file paths in the source code to be in the same dir e.g. data/experiment2/data.dat -> data.dat

Config

Set the Model you want to use in the env. you can use pretty much any model that is available on huggingface There is also a class prepared for use with the anthropic api. You will need to get an api key and credits from anthropic though and set it in the .env or in the terminal as an environment variable. And exchange the LLM class with the one from llm_anthropic everywhere

Data

(src_data)

  • From codetransocean/niche dataset
  • From theStack dataset on huggingface

The compilable cobol files get put into this folder

About

Benchmark for COBOL to java conversion. Tests performance of converters with equivalence tests and code quality metrics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors