-
Provable Benefits of In-Tool Learning for Large Language Models
Paper • 2508.20755 • Published • 11 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83 -
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench
Paper • 2508.20931 • Published • 15 -
THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning
Paper • 2509.13761 • Published • 14
Sayambhu Sen
Testerpce
AI & ML interests
None yet
Recent Activity
updated
a collection
about 19 hours ago
Diffusion
updated
a collection
about 20 hours ago
Video understanding
updated
a collection
about 20 hours ago
Reasoning