Dear RozDavid,
hello, I am reproducing your work, but I found that when doing text_representation_train and train_model, the result of running according to your code is 10-20 mIoU points worse than the result in your paper. I'm wondering if there is related data preprocessing that you didn't put on GitHub, or the training script lacks some parameters mentioned in the ablation experiment? Can you please confirm it for me, thank you very much!