We use cookies to improve your experience with our site.

关于如何更好地理解App的功能

Towards Better Understanding of App Functions

  • 摘要: App正在移动平台与Web平台上吸引着越来越多的关注。由于当前App市场的自组织特性, app的描述书写地并不正式, 其还包含了众多噪音词和句子。因此, 对于大多数app, 它们的功能未被很好地文档化, 所以也不易于被app搜索引擎获取。本文通过识别app描述中信息最丰富词汇的方法研究了推断一个app真实功能这一问题。为了以一种适当的方式来运用和集成app语料库的多样化信息, 我们提出了一个概率主题模型来发现app语料库中隐含的数据结构。此主题模型的结果进一步可用于识别一个app的功能和其信息最丰富的词汇。我们分别在从Google Play和Windows Phone Store所爬取的真实数据集上进行大量实验, 并验证所提出方法的有效性。

     

    Abstract: Apps are attracting more and more attention from both mobile and web platforms. Due to the self-organized nature of the current app marketplaces, the descriptions of apps are not formally written and contain a lot of noisy words and sentences. Thus, for most of the apps, the functions of them are not well documented and thus cannot be captured by app search engines easily. In this paper, we study the problem of inferring the real functions of an app by identifying the most informative words in its description. In order to utilize and integrate the diverse information of the app corpus in a proper way, we propose a probabilistic topic model to discover the latent data structure of the app corpus. The outputs of the topic model are further used to identify the function of an app and its most informative words. We verify the effectiveness of the proposed methods through extensive experiments on two real app datasets crawled from Google Play and Windows Phone Store, respectively.

     

/

返回文章
返回