CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning Paper • 2605.28742 • Published 19 days ago • 4
Reinforcement Learning from Rich Feedback with Distributional DAgger Paper • 2606.05152 • Published 12 days ago • 3
Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development Paper • 2606.07207 • Published 10 days ago • 4
Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses Paper • 2606.08348 • Published 9 days ago • 14
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 7 days ago • 61
UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs Paper • 2606.06622 • Published 11 days ago • 20
LLM Explainability with Counterfactual Chains and Causal Graphs Paper • 2606.05972 • Published 11 days ago • 16
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 7 days ago • 32
When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents Paper • 2606.05806 • Published 11 days ago • 22
Human Psychometric Questionnaires Mischaracterize LLM Behavior Paper • 2509.10078 • Published 17 days ago • 35
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies Paper • 2606.06601 • Published 11 days ago • 26
AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization Paper • 2606.07326 • Published 10 days ago • 29
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 11 days ago • 50
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 10 days ago • 91
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges Paper • 2605.26046 • Published 21 days ago • 3