Briefing: Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
Strategic angle: Introducing TRACED, a framework for assessing reasoning quality in LLMs beyond scalar probabilities.
Browse the full archive, newest first.