DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment (1606.05675v1)

Published 17 Jun 2016 in cs.CV

Abstract: Worldwide, in 2014, more than 1.9 billion adults, 18 years and older, were overweight. Of these, over 600 million were obese. Accurately documenting dietary caloric intake is crucial to manage weight loss, but also presents challenges because most of the current methods for dietary assessment must rely on memory to recall foods eaten. The ultimate goal of our research is to develop computer-aided technical solutions to enhance and improve the accuracy of current measurements of dietary intake. Our proposed system in this paper aims to improve the accuracy of dietary assessment by analyzing the food images captured by mobile devices (e.g., smartphone). The key technique innovation in this paper is the deep learning-based food image recognition algorithms. Substantial research has demonstrated that digital imaging accurately estimates dietary intake in many environments and it has many advantages over other methods. However, how to derive the food information (e.g., food type and portion size) from food image effectively and efficiently remains a challenging and open research problem. We propose a new Convolutional Neural Network (CNN)-based food image recognition algorithm to address this problem. We applied our proposed approach to two real-world food image data sets (UEC-256 and Food-101) and achieved impressive results. To the best of our knowledge, these results outperformed all other reported work using these two data sets. Our experiments have demonstrated that the proposed approach is a promising solution for addressing the food image recognition problem. Our future work includes further improving the performance of the algorithms and integrating our system into a real-world mobile and cloud computing-based system to enhance the accuracy of current measurements of dietary intake.

Citations (238)

View on Semantic Scholar

Summary

The paper introduces a novel CNN algorithm that achieved top-1 accuracies of 76.3% on UEC-100 and 77.4% on Food-101 datasets.
It employs technical innovations such as additional 1x1 convolutional layers and transfer learning to optimize feature extraction and computational efficiency.
The research demonstrates the potential of integrating deep learning into mobile and cloud systems for more reliable, automated dietary assessments.

Analysis of "DeepFood: Deep Learning-based Food Image Recognition for Computer-aided Dietary Assessment"

The paper presents a novel research effort focused on enhancing the accuracy of dietary assessment using food image recognition facilitated by deep learning techniques. This paper addresses the challenges associated with traditional dietary self-reporting methods, which often suffer from bias and inaccuracies. By leveraging Convolutional Neural Networks (CNNs), the paper proposes an innovative approach to automatically recognize food items and estimate their portion sizes from captured images.

Key Contributions

The central contribution of this research is the development of a new CNN-based algorithm optimized for food image recognition tasks. The method is designed to improve upon existing mobile and cloud-based dietary assessment systems, which traditionally require user input and explanation, thereby minimizing human error and enhancing precision.

1. Dataset Analysis and Experimental Validation:

The approach was rigorously tested using two real-world datasets: UEC-256 and Food-101. The UEC-256 dataset, which includes diverse Asian cuisines, provided a robust platform for evaluating accuracy improvements. The 22-layer CNN architecture, leveraging Inception modules inspired by GoogLeNet, demonstrated substantial accuracy gains over traditional methods. A notable top-1 accuracy rate of 76.3% was achieved on UEC-100, surpassing previous methods by a considerable margin.

Similarly, the Food-101 dataset, encompassing largely western food types, further validated the method's generalizability across different dietary cultures, showing a top-1 accuracy rate of 77.4%. This indicates the proposed model's ability to adapt to varied food image datasets and maintain high accuracy levels.

2. Technical Innovations:

The paper highlights several technical optimizations within the CNN framework, such as the use of additional 1x1 convolutional layers in the Inception modules for depth enhancement and dimensionality reduction. These optimizations are crucial for improving the computational efficiency and feature extraction capabilities, thus enhancing the accuracy of food recognition under constrained computational resources.

3. Use of Pre-trained Models:

The paper exploits the efficacy of pre-trained models on large-scale datasets like ImageNet. This practice of domain-specific fine-tuning significantly enhances classification performance, showcasing the utility of transfer learning approaches in specialized food image recognition tasks.

Numerical Results

A standout aspect of the paper is its empirical strength, demonstrated through impressive numerical results. The proposed methodology achieved and often exceeded benchmark performances with sophisticated algorithmic implementations. For instance:

On UEC-256, a top-5 accuracy reached 81.5%, and further integration of bounding box preprocessing elevated the classification accuracy to 87.2%.
On Food-101, the CNN approach with fine-tuning improved the accuracy over non-fine-tuned counterparts, confirming the benefit of integrating domain-specific learning towards elevating classification outcomes.

Implications and Future Directions

The research holds substantial theoretical and practical implications. Practically, the integration of such accurate dietary assessment tools into mobile cloud systems could significantly enhance personal health management by providing more reliable dietary data. Theoretically, it advances food computing and image recognition fields, opening avenues for future research in automated and precise dietary analysis.

Looking forward, the next steps include improving the algorithm’s deployment into real-world applications, possibly enhancing mobile device integration and cloud-based computation. Future research may also explore expanding the model's capability to estimate nutritional content and detect composite or previously unseen food items, broadening the applicability of such systems in global health initiatives.

In conclusion, this paper sets a significant precedent for how deep learning and CNNs can be harnessed effectively within the domain of dietary assessment, offering a robust alternative to manual input-centric systems. The results emphasize the potential of machine learning in transcending traditional boundaries within health-related data collection and analysis.

PDF Markdown