Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia, Daniel Bramblett, Daksh Dobhal, Siddharth Srivastava
Published in International Conference on Learning Representations, 2025
This paper evaluates large language models on truth maintenance on translating formal language and reasoning tasks.
