Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations (2404.08135v1)

Published 11 Apr 2024 in cs.CV

Abstract: Optical flow estimation is crucial to a variety of vision tasks. Despite substantial recent advancements, achieving real-time on-device optical flow estimation remains a complex challenge. First, an optical flow model must be sufficiently lightweight to meet computation and memory constraints to ensure real-time performance on devices. Second, the necessity for real-time on-device operation imposes constraints that weaken the model's capacity to adequately handle ambiguities in flow estimation, thereby intensifying the difficulty of preserving flow accuracy. This paper introduces two synergistic techniques, Self-Cleaning Iteration (SCI) and Regression Focal Loss (RFL), designed to enhance the capabilities of optical flow models, with a focus on addressing optical flow regression ambiguities. These techniques prove particularly effective in mitigating error propagation, a prevalent issue in optical flow models that employ iterative refinement. Notably, these techniques add negligible to zero overhead in model parameters and inference latency, thereby preserving real-time on-device efficiency. The effectiveness of our proposed SCI and RFL techniques, collectively referred to as SciFlow for brevity, is demonstrated across two distinct lightweight optical flow model architectures in our experiments. Remarkably, SciFlow enables substantial reduction in error metrics (EPE and Fl-all) over the baseline models by up to 6.3% and 10.5% for in-domain scenarios and by up to 6.2% and 13.5% for cross-domain scenarios on the Sintel and KITTI 2015 datasets, respectively.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (50)
  1. Dejavu: Conditional regenerative learning to enhance dense prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 19466–19477, 2023.
  2. A naturalistic open source movie for optical flow evaluation. In Proceedings of the European Conference on Computer Vision, pages 611–625. Springer, 2012.
  3. X-distill: Improving self-supervised monocular depth via cross-task distillation. In British Machine Vision Conference, 2021.
  4. Temporal hockey action recognition via pose and optical flows. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 0–0, 2019.
  5. Neural ordinary differential equations. In NeurIPS, pages 6572–6583, 2018.
  6. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
  7. Transadapt: A transformative framework for online test time adaptive semantic segmentation. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE, 2023.
  8. Gru-ode-bayes: Continuous modeling of sporadically-observed time series. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2019.
  9. Flownet: Learning optical flow with convolutional networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2758–2766, 2015.
  10. Dift: Dynamic iterative field transforms for memory efficient optical flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2219–2228, 2023.
  11. Lightweight probabilistic deep networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3369–3378, 2018.
  12. Vision meets robotics: The kitti dataset. The International Journal of Robotics Research, 32(11):1231–1237, 2013.
  13. Flowformer: A transformer architecture for optical flow. In Proceedings of the European Conference on Computer Vision, 2022.
  14. Liteflownet3: Resolving correspondence ambiguity for more accurate optical flow estimation. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XX 16, pages 169–184. Springer, 2020.
  15. A lightweight optical flow cnn—revisiting data fidelity and regularization. IEEE transactions on pattern analysis and machine intelligence, 43(8):2555–2569, 2020.
  16. Flownet 2.0: Evolution of optical flow estimation with deep networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2462–2470, 2017.
  17. Imposing consistency for optical flow estimation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022, pages 3171–3181. IEEE, 2022.
  18. Distractflow: Improving optical flow estimation via realistic distractions and pseudo-labeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13691–13700, 2023.
  19. Ocai: Improving optical flow estimation by occlusion and consistency aware interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024.
  20. Learning to estimate hidden motions with global motion aggregation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 9772–9781, 2021a.
  21. Learning optical flow from a few matches. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 16587–16595, 2021b.
  22. Moving object tracking using optical flow and motion vector estimation. In 2015 4th international conference on reliability, infocom technologies and optimization (ICRITO)(trends and future directions), pages 1–6. IEEE, 2015.
  23. Fdflownet: Fast optical flow estimation using a deep lightweight network. In 2020 IEEE International Conference on Image Processing (ICIP), pages 1501–1505. IEEE, 2020.
  24. Fastflownet: A lightweight network for fast optical flow estimation. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 10310–10316. IEEE, 2021.
  25. Ifrnet: Intermediate feature refine network for efficient frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1969–1978, 2022.
  26. Motion feature network: Fixed motion filter for action recognition. In Proceedings of the European Conference on Computer Vision, pages 387–403, 2018.
  27. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pages 2980–2988, 2017.
  28. Dvc: An end-to-end deep video compression framework. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11006–11015, 2019.
  29. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4040–4048, 2016.
  30. Object scene flow for autonomous vehicles. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3061–3070, 2015.
  31. Joint 3d estimation of vehicles and scene flow. In ISPRS Workshop on Image Sequence Analysis (ISA), 2015.
  32. Optical flow estimation using a spatial pyramid network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4161–4170, 2017.
  33. Ddcnet: Deep dilated convolutional neural network for dense prediction. Neurocomputing, 523:116–129, 2023.
  34. Mobilenetv2: Inverted residuals and linear bottlenecks, 2018. cite arxiv:1801.04381.
  35. Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1599–1610, 2023.
  36. Neural network quantization with ai model efficiency toolkit (aimet), 2022.
  37. PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume. In CVPR, 2018a.
  38. Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8934–8943, 2018b.
  39. Raft: Recurrent all-pairs field transforms for optical flow. In Proceedings of the European Conference on Computer Vision, pages 402–419. Springer, 2020.
  40. Learning accurate dense correspondences and when to trust them. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5714–5724, 2021.
  41. Displacement-invariant matching cost learning for accurate optical flow estimation. Advances in Neural Information Processing Systems, 33, 2020.
  42. Video compression through image interpolation. In Proceedings of the European Conference on Computer Vision, pages 416–431, 2018.
  43. High-resolution optical flow from 1d attention and correlation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10498–10507, 2021.
  44. Mamo: Leveraging memory and attention for monocular video depth estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 8754–8764, 2023.
  45. Futuredepth: Learning to predict the future improves video depth estimation. arXiv preprint arXiv:2403.12953, 2024.
  46. Hierarchical discrete distribution decomposition for match density estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6044–6053, 2019.
  47. Separable flow: Learning motion cost volumes for optical flow estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10807–10817, 2021.
  48. Neuflow: Real-time, high-accuracy optical flow estimation on robots using edge devices, 2024.
  49. Maskflownet: Asymmetric feature matching with learnable occlusion mask. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6278–6287, 2020.
  50. Deeptam: Deep tracking and mapping. In Proceedings of the European Conference on Computer Vision, pages 822–838, 2018.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com
Youtube Logo Streamline Icon: https://streamlinehq.com