Paper-Conference

HugAgent: Benchmarking LLMs for Simulation of Individualized Human Reasoning
Simulating human reasoning in open-ended tasks has been a long-standing aspiration in AI and cognitive science. While large language …
Assessing Adaptive World Models in Machines with Novel Games
Human intelligence exhibits a remarkable capacity for rapid adaptation and effective problem-solving in novel and unfamiliar contexts. …
Simulating Society Requires Simulating Thought
Simulating society with large language models (LLMs), we argue, requires more than generating plausible behavior; it demands …