Distributed RL System for LLM Reasoning
-
Updated
Jul 18, 2025 - Python
Distributed RL System for LLM Reasoning
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
Awesome RL-based LLM Reasoning
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"
Ling is a MoE LLM provided and open-sourced by InclusionAI.
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL
[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
[ACL'2025 Findings] Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
[KDD 2025] Rewarding Graph Reasoning Process makes LLMs more Generalized Reasoners
Add a description, image, and links to the llm-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the llm-reasoning topic, visit your repo's landing page and select "manage topics."