Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

Rushang Karia, Daniel Bramblett, Daksh Dobhal, Siddharth Srivastava

Published in International Conference on Learning Representations, 2025

This paper evaluates large language models on truth maintenance on translating formal language and reasoning tasks.