RL for massive LLMs faces update delays; optimizations like checkpoint engines reduce this to seconds.