Learning to Classify New Foods Incrementally Via Compressed Exemplars (2404.07507v1)
Abstract: Food image classification systems play a crucial role in health monitoring and diet tracking through image-based dietary assessment techniques. However, existing food recognition systems rely on static datasets characterized by a pre-defined fixed number of food classes. This contrasts drastically with the reality of food consumption, which features constantly changing data. Therefore, food image classification systems should adapt to and manage data that continuously evolves. This is where continual learning plays an important role. A challenge in continual learning is catastrophic forgetting, where ML models tend to discard old knowledge upon learning new information. While memory-replay algorithms have shown promise in mitigating this problem by storing old data as exemplars, they are hampered by the limited capacity of memory buffers, leading to an imbalance between new and previously learned data. To address this, our work explores the use of neural image compression to extend buffer size and enhance data diversity. We introduced the concept of continuously learning a neural compression model to adaptively improve the quality of compressed data and optimize the bitrates per pixel (bpp) to store more exemplars. Our extensive experiments, including evaluations on food-specific datasets including Food-101 and VFN-74, as well as the general dataset ImageNet-100, demonstrate improvements in classification accuracy. This progress is pivotal in advancing more realistic food recognition systems that are capable of adapting to continually evolving data. Moreover, the principles and methodologies we've developed hold promise for broader applications, extending their benefits to other domains of continual machine learning systems.
- End-to-end optimized image compression. International Conference on Learning Representations, 2017.
- Variational image compression with a scale hyperprior. International Conference on Learning Representations, 2018.
- Food-101–mining discriminative components with random forests. European Conference on Computer Vision, pages 446–461, 2014.
- Rfc2083: Png (portable network graphics) specification version 1.0. Network Working Group, pages 1–102, 1997.
- End-to-end incremental learning. European Conference on Computer Vision, page 241–257, 2018.
- Riemannian walk for incremental learning: Understanding forgetting and intransigence. European Conference on Computer Vision, 2018.
- Autoaugment: Learning augmentation policies from data. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.
- Imagenet: A large-scale hierarchical image database. IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009.
- Podnet: Pooled outputs distillation for small-tasks incremental learning. European Conference on Computer Vision, page 86–102, 2020.
- Towards backward-compatible continual learning of image compression. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024.
- Food image recognition using very deep convolutional networks. International Workshop on Multimedia Assisted Dietary Management, pages 41–49, 2016.
- Online continual learning for visual food classification. Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pages 2337–2346, 2021.
- Online continual learning via candidates voting. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 3154–3163, 2022a.
- Exemplar-free online continual learning. IEEE International Conference on Image Processing, pages 541–545, 2022b.
- Gradient reweighting: Towards imbalanced class-incremental learning. arXiv preprint arXiv:2402.18528, 2024.
- Incremental learning in online scenario. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 13926–13935, 2020.
- Long-tailed food classification. Nutrients, 15(12):2751, 2023a.
- Long-tailed continual learning for visual food recognition. arXiv preprint arXiv:2307.00183, 2023b.
- Deep residual learning for image recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 770–778, 2016.
- Learning a unified classifier incrementally via rebalancing. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 831–839, 2019.
- Are visual recognition models robust to image compression? Neural Compression Workshop, International Conference on Machine Learning, 2023.
- Adam: A method for stochastic optimization. International Conference on Learning Representations, 2015.
- Deepfood: Deep learning-based food image recognition for computer-aided dietary assessment. International Conference on Smart Homes and Health Telematics, pages 37–48, 2016.
- Mnemonics training: Multi-class incremental learning without forgetting. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12245–12254, 2020.
- RMM: reinforced memory management for class-incremental learning. Advances in Neural Information Processing Systems, pages 3478–3490, 2021.
- Class-incremental exemplar compression for class-incremental learning. IEEE Conference on Computer Vision and Pattern Recognition, 2023.
- Improving dietary assessment via integrated hierarchy food classification. IEEE International Workshop on Multimedia Signal Processing, pages 1–6, 2021a.
- Visual aware hierarchy based food recognition. International conference on pattern recognition, pages 571–598, 2021b.
- The stability-plasticity dilemma: Investigating the continuum from catastrophic forgetting to age-limited learning effects. Frontiers in psychology, 4:504, 2013.
- Mixed deep learning and natural language processing method for fake-food image recognition and standardization to help automated dietary assessment. Public health nutrition, 22(7):1193–1202, 2019.
- Joint autoregressive and hierarchical priors for learned image compression. Advances in Neural Information Processing Systems, 31:10794–10803, 2018.
- Pytorch: An imperative style, high-performance deep learning library. International Conference on Neural Information Processing Systems, 2019.
- Efficient continual learning in neural networks with embedding regularization. Neurocomputing, 397:139–148, 2020.
- Online class-incremental learning for real-world food image classification. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 8195–8204, 2024.
- icarl: Incremental classifier and representation learning. IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5533–5542, 2017.
- Continual learning with deep generative replay. Advances in neural information processing systems, 30, 2017.
- The jpeg 2000 still image compression standard. IEEE Signal Processing Magazine, 18(5):36–58, 2001.
- Lossy image compression with compressive autoencoders. International Conference on Learning Representations, 2017.
- G. Wallace. The jpeg still picture compression standard. IEEE Transactions on Consumer Electronics, 38(1):xviii–xxxiv, 1992.
- Foster: Feature boosting and compression for class-incremental learning. European Conference on Computer Vision, 2022a.
- Memory replay with data compression for continual learning. International Conference on Learning Representations, 2022b.
- M. Welling. Herding dynamical weights to learn. International Conference on Machine Learning, page 1121–1128, 2009.
- Incremental learning using conditional adversarial networks. IEEE/CVF International Conference on Computer Vision, pages 6618–6627, 2019.
- Video enhancement with task-oriented flow. International Journal of Computer Vision, 127(8):1106–1125, 2019.
- Der: Dynamically expandable representation for class incremental learning. 2021.
- Probing image compression for class-incremental learning. Picture Coding Symposium, 2024.
- Improving inference for neural image compression. Advances in Neural Information Processing Systems, 33:573–584, 2020.
- Maintaining discrimination and fairness in class incremental learning. IEEE/CVF conference on computer vision and pattern recognition, pages 13208–13217, 2020.
- Learning Deep Features for Discriminative Localization. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2016.
- Deep class-incremental learning: A survey. arXiv preprint arXiv:2302.03648, 2023.
- Justin Yang (7 papers)
- Zhihao Duan (38 papers)
- Jiangpeng He (41 papers)
- Fengqing Zhu (77 papers)