planning isn’t free. agents should plan only when reward > compute + instability overhead. good read w/Crafter on test time compute. (arxiv.org/pdf/2509.03581)
0
0
0
65
0
Download Image