Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
194 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Cooking: Predicting Relative Food Ingredient Amounts from Images (1910.00100v1)

Published 26 Sep 2019 in cs.LG and stat.ML

Abstract: In this paper, we study the novel problem of not only predicting ingredients from a food image, but also predicting the relative amounts of the detected ingredients. We propose two prediction-based models using deep learning that output sparse and dense predictions, coupled with important semi-automatic multi-database integrative data pre-processing, to solve the problem. Experiments on a dataset of recipes collected from the Internet show the models generate encouraging experimental results.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. Menu-match: Restaurant-specific food logging from images. In Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on. IEEE, 844–851.
  2. Food-101–mining discriminative components with random forests. In European Conference on Computer Vision. Springer, 446–461.
  3. Jingjing Chen and Chong-Wah Ngo. 2016. Deep-based ingredient recognition for cooking recipe retrieval. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 32–41.
  4. Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval. In 2018 ACM Multimedia Conference on Multimedia Conference. ACM, 1020–1028.
  5. ChineseFoodNet: A large-scale Image Dataset for Chinese Food Recognition. arXiv preprint arXiv:1705.02743 (2017).
  6. Food Recognition: A New Dataset, Experiments, and Results. IEEE J. Biomedical and Health Informatics 21, 3 (2017), 588–598.
  7. Two-view 3d reconstruction for food volume estimation. IEEE transactions on multimedia 19, 5 (2017), 1090–1099.
  8. Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation. In 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 274–279.
  9. Single-view food portion estimation: learning image-to-energy mappings using generative adversarial networks. In 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 251–255.
  10. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
  11. Yanchao Liang and Jianhua Li. 2017. Computer vision-based food calorie estimation: dataset, method, and experiment. arXiv preprint arXiv:1705.07632 (2017).
  12. Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images. IEEE transactions on pattern analysis and machine intelligence (2019).
  13. Im2Calories: towards an automated mobile vision food diary. In Proceedings of the IEEE International Conference on Computer Vision. 1233–1241.
  14. Simon Mezgec and Barbara Koroušić Seljak. 2017. NutriNet: a deep learning food and drink image recognition system for dietary assessment. Nutrients 9, 7 (2017), 657.
  15. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111–3119.
  16. Inverse cooking: Recipe generation from food images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10453–10462.
  17. Learning cross-modal embeddings for cooking recipes and food images. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3020–3028.
  18. Amnon Shashua and Nassir Navab. 1994. Relative affine structure: Theory and application to 3D reconstruction from perspective views. In CVPR, Vol. 94. 483–489.
  19. Food/non-food image classification and food categorization using pre-trained googlenet model. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. ACM, 3–11.
  20. Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 11572–11581.
  21. Recipe recognition with large multimodal food dataset. In Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on. IEEE, 1–6.
  22. Multi-view Model Contour Matching Based Food Volume Estimation. In International Conference on Applied Human Factors and Ergonomics. Springer, 85–93.
Citations (15)

Summary

We haven't generated a summary for this paper yet.