We use cookies to improve your experience with our site.
黄锴, 张丽清. 半监督稀疏多线性判别分析[J]. 计算机科学技术学报, 2014, 29(6): 1058-1071. DOI: 10.1007/s11390-014-1490-1
引用本文: 黄锴, 张丽清. 半监督稀疏多线性判别分析[J]. 计算机科学技术学报, 2014, 29(6): 1058-1071. DOI: 10.1007/s11390-014-1490-1
Kai Huang, Li-Qing Zhang. Semisupervised Sparse Multilinear Discriminant Analysis[J]. Journal of Computer Science and Technology, 2014, 29(6): 1058-1071. DOI: 10.1007/s11390-014-1490-1
Citation: Kai Huang, Li-Qing Zhang. Semisupervised Sparse Multilinear Discriminant Analysis[J]. Journal of Computer Science and Technology, 2014, 29(6): 1058-1071. DOI: 10.1007/s11390-014-1490-1

半监督稀疏多线性判别分析

Semisupervised Sparse Multilinear Discriminant Analysis

  • 摘要: 当把普通向量空间算法来处理以张量数据为输入的问题是会遇到很多的问题.我们必须要克服小样本问题和过拟合问题,同时在将张量数据进行向量展开的时候,原始张量数据中的结构信息会丢失.因此,相对来说以向量数据为直接输入的方法将更为适合.对于心电数据来说还有另外一个问题需要克服,心电数据的人工诊断昂贵而耗时,所以获取到带有标签信息的数据非常困难.然而,当原始数据中有效的特征非常稀疏时,我们提出了半监督稀疏多线性判别分析.这种方法考虑了有标签和无标签数据的分布信息,同时也考虑了采用标签推演方法得到的标签信息.实际,我们采用了从远程诊断系统中获得的12导联心电信号,我们对原始信号应用了短时傅里叶变换来获取3阶张量.实验结果显示了心电信号的稀疏特点,而且我们的方法可以有效抽取出稀疏而有效的特征来取得好的分类效果.

     

    Abstract: Various problems are encountered when adopting ordinary vector space algorithms for high-order tensor data input. Namely, one must overcome the Small Sample Size (SSS) and overfitting problems. In addition, the structural information of the original tensor signal is lost during the vectorization process. Therefore, comparable methods using a direct tensor input are more appropriate. In the case of electrocardiograms (ECGs), another problem must be overcome; the manual diagnosis of ECG data is expensive and time consuming, rendering it difficult to acquire data with diagnosis labels. However, when effective features for classification in the original data are very sparse, we propose a semisupervised sparse multilinear discriminant analysis (SSSMDA) method. This method uses the distribution of both the labeled and the unlabeled data together with labels discovered through a label propagation algorithm. In practice, we use 12-lead ECGs collected from a remote diagnosis system and apply a short-time-fourier transformation (STFT) to obtain third-order tensors. The experimental results highlight the sparsity of the ECG data and the ability of our method to extract sparse and effective features that can be used for classification.

     

/

返回文章
返回