Claas Beger

I am currently a Graduate Fellow at Santa Fe Institute, doing research on multimodal reasoning under Professor Melanie Mitchell. Prior to that I finished my Master’s in Computer Science at Cornell University, where I worked with Kevin Ellis, Kilian Weinberger and Saikat Dutta. My research interest centers broadly around Human-like Artificial Intelligence. For this purpose, I think it is the most promising direction to look towards Natural Intelligence, both with regard to the brain and psychology/cognition.

news

Apr 06, 2026	“Bongards at the Boundary of Perception and Reasoning” accepted as full paper at CogSci 2026. “Cognitive Science-Inspired Evaluation of Large Language Models” accepted as Symposium at CogSci 2026.
Apr 06, 2026	OmniCode is accepted to ACL 2026 Findings!
Mar 28, 2026	I was awarded a fellowship by Princeton University’s Natural and Artificial Minds (NAM) Initiative
Feb 03, 2026	New paper on using VLMs for visual concept hypothesis formation on Bongard problems out on arxiv! arXiv:2602.03038
Feb 03, 2026	New paper on a diverse evaluation benchmark for Code Generation Agents out on arXiv arXiv:2602.02262

selected publications

A Neuroscience-Inspired Dual-Process Model of Compositional Generalization

Alex Noviello^*, Claas Beger^*, Jacob Groner, Kevin Ellis, and Weinan Sun

2025

TLDR: An architecture of schema learning and iterative application can resolve arbitrarily deep compositional statements.

Poster
Memento: Note-Taking for Your Future Self

Chao Wan, Albert Gong, Mihir Mishra, Carl-Leander Henneking, Claas Beger, and Kilian Q. Weinberger

2025

TLDR: Decomposing multi-hop questions into single-step prolog definition improves performance on various long-context question datasets.
CoCoNUT: Structural Code Understanding does not fall out of a tree

Claas Beger, and Saikat Dutta

2025

TLDR: Various language models struggle with dry-execution of simple and advanced code structures (Recursion, Concurrency OOP)
Decoding Human Preferences in Alignment: An Improved Approach to Inverse Constitutional AI

Carl-Leander Henneking^*, and Claas Beger^*

2025

TLDR: Using improved clustering and a more diverse embedding approach our technique can more accurately compress preference datasets into human-readable constitutions

Poster
Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus

Claas Beger^*, and Carl-Leander Henneking^*

2025

TLDR: Through a multi-step retrieval and summarization pipeline with three definable properties Citegeist can synthesize related work for a given scientific paper
Do AI Models Perform Human-like Abstract Reasoning Across Modalities?

Claas Beger, Ryan Yi, Shuhao Fu, Arseny Moskvichev, Sarah W. Tsai, Sivasankaran Rajamanickam, and Melanie Mitchell

2025

TLDR: Vision-Language models have good performance on abstract reasoning tasks, but do not utilize the intended human-core knowledge priors.