SCALaR: A Unified Latent Reasoning Framework for Large Language Models with Multi-Layer Aggregation and Strategic Compression

Han-Yu Zhang; Cheng-Ao Li; Xiang Ao; Qing He

doi:10.1007/s11390-026-6298-2

Han-Yu Zhang, Cheng-Ao Li, Xiang Ao, Qing He. SCALaR: A Unified Latent Reasoning Framework for Large Language Models with Multi-Layer Aggregation and Strategic Compression. Journal of Computer Science and Technology. DOI: 10.1007/s11390-026-6298-2

Citation:

SCALaR: A Unified Latent Reasoning Framework for Large Language Models with Multi-Layer Aggregation and Strategic Compression

Abstract

Abstract

Large language models have demonstrated remarkable reasoning capabilities when guided by explicit chain-of-thought. However, generating long reasoning traces incurs substantial computational cost due to linguistic and stylistic redundancy. This inefficiency limits the scalability and practical deployment of such methods. To address this issue, recent work has explored latent reasoning, in which models perform reasoning implicitly within internal representations instead of generating explicit intermediate tokens. While promising, existing latent reasoning approaches face key challenges, including representational limitations arising from reliance on final-layer hidden states and optimization difficulties caused by sparse supervision. In this work, we propose SCALaR, a unified framework for latent reasoning that improves reasoning effectiveness through data compression, representation construction, and training strategy. SCALaR compresses training data by retaining the most informative elements, constructs enriched latent representations via multi-layer aggregation, injects calibrated Gaussian noise during training to encourage latent-space exploration, and adopts an alternating compression strategy to balance intermediate supervision and answer-oriented exploration. Extensive experiments on mathematical reasoning and question-answering benchmarks show that SCALaR consistently outperforms existing latent reasoning baselines. Additional analyses examine the individual roles of each component, providing insights into how they contribute to effective latent reasoning.

FullText(HTML)

References (0)

Relative Articles

Supplements (1)

Cited By

SCALaR: A Unified Latent Reasoning Framework for Large Language Models with Multi-Layer Aggregation and Strategic Compression

Abstract

Catalog

Export File

Citation

Format

Content