The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...
Researchers have recently developed a novel reinforcement learning framework named Parallel-R1, which successfully enhances the inference capabilities of large language models (LLMs), achieving an ...
In this competition, the advanced version of “Gemini 2.5 Deep Seek” participated remotely online and solved 10 out of 12 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results