-
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
Paper • 2505.14231 • Published • 52 -
Skywork-R1V3 Technical Report
Paper • 2507.06167 • Published • 70 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 35 -
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
Paper • 2507.10532 • Published • 88
PaceWang
PaceWang
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
updated
a collection
about 1 month ago
daily_paper
updated
a collection
2 months ago
daily_paper
Organizations
None yet