arxiv:2407.08348
Jujie He
leafzs
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
17 days ago
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for
Long-Horizon LLM Agents
updated
a model
about 1 month ago
Skywork/Skywork-o1-Open-Llama-3.1-8B
updated
a model
about 1 month ago
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B