We use cookies to improve your experience with our site.

基于概率图模型自动生成学术海报

Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models

  • 摘要: 学术工作者通常利用学术海报来总结展示他们的学术论文的内容,这是因为学术海报能够简洁流畅的表达学术论文的中心思想。然而,学术海报不仅要求良好的可读性和较强的信息表达力,同时还应当是比较美观的,这就使得设计制作学术海报这一工作复杂且耗时。本文中,我们第一次研究利用概率图模型来自动生成学术海报这一挑战性的工作。具体来说,给定需要展示的内容,我们的方案能够自动地从已有数据中学习推断出海报各个面板的大小、形状,能够推断面板的排版以及各个面板内的内容布置。在我们的方案中,最大化后验概率(Maximum a posterior)估计方法被用来引入并建模一些已有的设计原则;而为了串联起面板内部元素排版和整个海报的面板排版,我们提出了一种递归划分海报页面的方案。为了学习以及验证我们的模型,我们收集并公开了NJU-Fudan论文海报数据集,这一数据集包含了详细标注的学术论文海报对。质化和量化的实验结果说明了我们方法的有效性。

     

    Abstract: Researchers often summarize their work in the form of scientific posters. Posters provide a coherent and efficient way to convey core ideas expressed in scientific papers. Generating a good scientific poster, however, is a complex and time-consuming cognitive task, since such posters need to be readable, informative, and visually aesthetic. In this paper, for the first time, we study the challenging problem of learning to generate posters from scientific papers. To this end, a data-driven framework, which utilizes graphical models, is proposed. Specifically, given content to display, the key elements of a good poster, including attributes of each panel and arrangements of graphical elements, are learned and inferred from data. During the inference stage, the maximum a posterior (MAP) estimation framework is employed to incorporate some design principles. In order to bridge the gap between panel attributes and the composition within each panel, we also propose a recursive page splitting algorithm to generate the panel layout for a poster. To learn and validate our model, we collect and release a new benchmark dataset, called NJU-Fudan Paper-Poster dataset, which consists of scientific papers and corresponding posters with exhaustively labelled panels and attributes. Qualitative and quantitative results indicate the effectiveness of our approach.

     

/

返回文章
返回