2000 character limit reached
Deep Cooking: Predicting Relative Food Ingredient Amounts from Images (1910.00100v1)
Published 26 Sep 2019 in cs.LG and stat.ML
Abstract: In this paper, we study the novel problem of not only predicting ingredients from a food image, but also predicting the relative amounts of the detected ingredients. We propose two prediction-based models using deep learning that output sparse and dense predictions, coupled with important semi-automatic multi-database integrative data pre-processing, to solve the problem. Experiments on a dataset of recipes collected from the Internet show the models generate encouraging experimental results.
- Menu-match: Restaurant-specific food logging from images. In Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on. IEEE, 844–851.
- Food-101–mining discriminative components with random forests. In European Conference on Computer Vision. Springer, 446–461.
- Jingjing Chen and Chong-Wah Ngo. 2016. Deep-based ingredient recognition for cooking recipe retrieval. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 32–41.
- Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval. In 2018 ACM Multimedia Conference on Multimedia Conference. ACM, 1020–1028.
- ChineseFoodNet: A large-scale Image Dataset for Chinese Food Recognition. arXiv preprint arXiv:1705.02743 (2017).
- Food Recognition: A New Dataset, Experiments, and Results. IEEE J. Biomedical and Health Informatics 21, 3 (2017), 588–598.
- Two-view 3d reconstruction for food volume estimation. IEEE transactions on multimedia 19, 5 (2017), 1090–1099.
- Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation. In 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 274–279.
- Single-view food portion estimation: learning image-to-energy mappings using generative adversarial networks. In 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 251–255.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
- Yanchao Liang and Jianhua Li. 2017. Computer vision-based food calorie estimation: dataset, method, and experiment. arXiv preprint arXiv:1705.07632 (2017).
- Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images. IEEE transactions on pattern analysis and machine intelligence (2019).
- Im2Calories: towards an automated mobile vision food diary. In Proceedings of the IEEE International Conference on Computer Vision. 1233–1241.
- Simon Mezgec and Barbara Koroušić Seljak. 2017. NutriNet: a deep learning food and drink image recognition system for dietary assessment. Nutrients 9, 7 (2017), 657.
- Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111–3119.
- Inverse cooking: Recipe generation from food images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10453–10462.
- Learning cross-modal embeddings for cooking recipes and food images. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3020–3028.
- Amnon Shashua and Nassir Navab. 1994. Relative affine structure: Theory and application to 3D reconstruction from perspective views. In CVPR, Vol. 94. 483–489.
- Food/non-food image classification and food categorization using pre-trained googlenet model. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. ACM, 3–11.
- Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 11572–11581.
- Recipe recognition with large multimodal food dataset. In Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on. IEEE, 1–6.
- Multi-view Model Contour Matching Based Food Volume Estimation. In International Conference on Applied Human Factors and Ergonomics. Springer, 85–93.