Reward functions are often hard or impossible to design. If you're working on RL without a predefined reward function (RLHF, unsupervised, exploration etc.), consider submitting to the RLBrew workshop! Deadline May 3rd.
Reward functions are often hard or impossible to design. If you're working on RL without a predefined reward function (RLHF, unsupervised, exploration etc.), consider submitting to the RLBrew workshop! Deadline May 3rd.
2
16
69
19K
21
Download Image
@JoeyHejna Hey @JoeyHejna, your work seems very interesting. Would you be looking for any research intern by any chance ?