Hello,
I am trying to understand the connection between the DeD and lifegate. How is the data trained using DeD being tested on the lifegate environment?
I was looking on the provided scripts to see how the lifegate environment is using the VD and VR obtained from training the offline RL method but I can't seem to understand how it works clearly. Any explanation will be greatly appreciated. Thank you so much.