面向静态与时序模型的可解释方法及其在知识追踪中的应用

李凡; 张天成; 尹一方; 樊迪; 于明鹤; 于戈

doi:10.1007/s11390-025-4239-0

面向静态与时序模型的可解释方法及其在知识追踪中的应用

A Survey of Static and Temporal Explainable Methods and Their Applications in Knowledge Tracing

摘要

摘要:
背景意义 随着人工智能（尤其是深度学习）在多个领域取得突破，模型复杂度与非线性特征日益增强，导致“黑箱”问题凸显——人类难以理解其决策过程。在自动驾驶等高风险场景中，模型不透明可能引发严重后果，亟需可解释性技术提升透明度与用户信任。尽管研究者已提出多种可解释方法（如特征重要性分析、反事实生成），现有综述普遍忽略静态模型与时序模型在解释需求与技术路径上的本质差异。时序模型在电子健康记录、临床预警等关键任务中广泛应用，其可解释性对系统安全至关重要；而在时序模型中，特征会随时间变化，这为解释的生成带来了新的挑战。为此，本文系统总结静态与时序模型的可解释方法，基于“解释可理解性”这一核心维度构建新型分层分类体系（其结构如图1所示），并评估通用方法在知识追踪（knowledge tracing, KT）这一典型时序任务中的适用性与挑战，以推动可解释AI在实际场景的落地。
主要内容 本文系统回顾了可解释人工智能（XAI）在静态与时序模型中的发展进程，重点聚焦知识追踪（KT）领域的应用。早期研究集中于静态模型的可解释方法，形成特征重要性分析、概念解释及实例解释三大类；而时序模型可解释方法则需考虑时间依赖、揭示动态决策机制。在KT场景中，可解释方法需融合教育理论（如IRT模型）分析学生能力与题目难度，但面临核心挑战：静态方法忽略时序关联导致解释失真，时序方法难以捕捉长程依赖与知识结构，且评估缺乏统一标准。当前研究虽提升了模型透明度，仍受限于多模态数据融合不足和教育语义整合缺失，亟需突破"黑箱评估"瓶颈（见图2，揭示前者忽略历史交互的缺陷）。
结论与展望 本文通过梳理面向静态与时序模型的可解释方法的设计、评估与应用，提出了一种基于“可理解性”的分层分类体系，有助于统一不同领域的可解释方法研究框架。同时，在知识追踪领域，本文详细分析了当前可解释方法的不足。未来研究可聚焦以下几个方向：（1）将结构化信息（如知识概念之间的依赖关系、题目难度结构）纳入解释过程中，可提升模型解释的语义丰富性和可理解性。（2）评估标准与数据集构建：当前普遍缺乏系统的解释性评价指标和真实世界的标注数据，亟需构建标准化评估基准和任务驱动的数据集，支持主观/客观相结合的解释效果评测。（3）面向教育实践的个性化解释：未来解释系统应更注重对学生、教师等教育主体的服务，提供粒度更细、语义更明确的个性化学习路径与反馈解释，推动可解释方法在真实教学环境中落地。

Abstract: Deep learning has found widespread application across diverse domains owing to its exceptional performance. Nevertheless, the lack of transparency in deep learning models’ decision-making processes undermines their usability, especially in critical contexts. While researchers have made noteworthy advancements in explaining these models, they have frequently overlooked the differences between static and temporal models during explanation generation. In temporal models, features change over time, posing new challenges in the generation of explanations. Though extensive research has been dedicated to surmounting these hurdles, a survey summarizing these contributions is currently absent. To bridge this gap, this paper endeavors to summarize existing methods and their contributions in terms of both static and temporal models, highlighting their disparities. Additionally, we propose an innovative classification approach based on the comprehensibility of explanations, demonstrating that different explanation methods vary in their understandability for users. Finally, to assess the limitations of the explanation capabilities of existing methods, we specifically choose knowledge tracing to analyze the evolution of explanation methods in this context of temporal modeling and interpretations.

HTML全文

参考文献()

施引文献

资源附件()