A Novel Web Video Event Mining Framework with the Integration of Correlation and Co-Occurrence Information

Cheng-De Zhang; Xiao Wu; Mei-Ling Shyu; Qiang Peng

doi:10.1007/s11390-013-1377-6

Cheng-De Zhang, Xiao Wu, Mei-Ling Shyu, Qiang Peng. A Novel Web Video Event Mining Framework with the Integration of Correlation and Co-Occurrence Information[J]. Journal of Computer Science and Technology, 2013, 28(5): 788-796. DOI: 10.1007/s11390-013-1377-6

Citation:

A Novel Web Video Event Mining Framework with the Integration of Correlation and Co-Occurrence Information

Abstract

Abstract

The massive web videos prompt an imperative demand on effciently grasping the major events. However, the distinct characteristics of web videos, such as the limited number of features, the noisy text information, and the unavoidable error in near-duplicate keyframes (NDKs) detection, make web video event mining a challenging task. In this paper, we propose a novel four-stage framework to improve the performance of web video event mining. Data preprocessing is the first stage. Multiple Correspondence Analysis (MCA) is then applied to explore the correlation between terms and classes, targeting for bridging the gap between NDKs and high-level semantic concepts. Next, co-occurrence information is used to detect the similarity between NDKs and classes using the NDK-within-video information. Finally, both of them are integrated for web video event mining through negative NDK pruning and positive NDK enhancement. Moreover, both NDKs and terms with relatively low frequencies are treated as useful information in our experiments. Experimental results on large-scale web videos from YouTube demonstrate that the proposed framework outperforms several existing mining methods and obtains good results for web video event mining.

FullText(HTML)

References (34)

Relative Articles

Supplements (0)

Cited By

A Novel Web Video Event Mining Framework with the Integration of Correlation and Co-Occurrence Information

Abstract

Catalog

Export File

Citation

Format

Content