Hi
It seems the pretrained checkpoint has execution accuracy of 98.4 on Spider test set. Is this desired? It is significantly higher than table 1 (70.0).
The full result is
easy medium hard extra all
count 470 857 463 357 2147
===================== EXECUTION ACCURACY =====================
execution 0.996 0.974 0.983 0.994 0.984
The result is the same for CLLM and Pretrained LLM
Hi
It seems the pretrained checkpoint has execution accuracy of 98.4 on Spider test set. Is this desired? It is significantly higher than table 1 (70.0).
The full result is
The result is the same for CLLM and Pretrained LLM