In 2025, PufferAI will be running RL experiments and ablations at a scale previously reserved only for Google. Our objective is to make RL 10x easier to get working on new problems by the end of the year. We are still experimenting with where we will share results, but for now, they will be here.
Quick test demo from a Puffer Pong sweep