用于食品识别和分析的深度卷积神经网络多任务学习
Multi-Task Learning for Food Identification and Analysis with Deep Convolutional Neural Networks
-
摘要: 在这篇论文中, 我们提出了一种利用深层卷积神经网络从菜品图像中同时识别菜品类别, 菜品中的原料和菜品的烹调方法的多任务系统。我们建立了一个有360种菜品的数据集, 其中每种菜品至少有500张图像。为了减少从互联网上收集的数据中的噪声, 建立数据集时数据集中的异常数据被用基于深层卷积神经网络特征训练得到的单类别SVM检测出来并清除掉。我们同时训练了一个菜品识别器, 一个烹调方法识别器和一个多标签的菜品原料检测器, 它们共享深层网络结构中较低的几层。我们提出的系统比基于手工选取特征训练的传统方法识别准确率更高, 且可以被用于没有被包括在训练数据集中的新菜品种类, 为用户提供可供参考的菜品原料和烹调方法信息。Abstract: In this paper, we proposed a multi-task system that can identify dish types, food ingredients, and cooking methods from food images with deep convolutional neural networks. We built up a dataset of 360 classes of different foods with at least 500 images for each class. To reduce the noises of the data, which was collected from the Internet, outlier images were detected and eliminated through a one-class SVM trained with deep convolutional features. We simultaneously trained a dish identifier, a cooking method recognizer, and a multi-label ingredient detector. They share a few low-level layers in the deep network architecture. The proposed framework shows higher accuracy than traditional method with handcrafted features, and the cooking method recognizer and ingredient detector can be applied to dishes which are not included in the training dataset to provide reference information for users.