Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
3 tokens/sec
DeepSeek R1 via Azure Pro
51 tokens/sec
2000 character limit reached

Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation (2312.07180v4)

Published 12 Dec 2023 in cs.CV

Abstract: Existing recurrent optical flow estimation networks are computationally expensive since they use a fixed large number of iterations to update the flow field for each sample. An efficient network should skip iterations when the flow improvement is limited. In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample. The policy network achieves this by learning contextual information to realize whether flow improvement is bottlenecked or minimal. On the one hand, we use iteration embedding and historical hidden cell, which include previous iterations information, to convey how flow has changed from previous iterations. On the other hand, we use the incremental loss to make the policy network implicitly perceive the magnitude of optical flow improvement in the subsequent iteration. Furthermore, the computational complexity in our dynamic network is controllable, allowing us to satisfy various resource preferences with a single trained model. Our policy network can be easily integrated into state-of-the-art optical flow networks. Extensive experiments show that our method maintains performance while reducing FLOPs by about 40%/20% for the Sintel/KITTI datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. A framework for the robust estimation of optical flow. In 1993 (4th) International Conference on Computer Vision, 231–236.
  2. Adaptive Neural Networks for Efficient Inference. In Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, 527–536.
  3. A Naturalistic Open Source Movie for Optical Flow Evaluation. In Computer Vision – ECCV 2012, 611–625. Berlin, Heidelberg.
  4. Optical Flow based Visual Potential Field for Autonomous Driving. In 2020 IEEE Intelligent Vehicles Symposium (IV), 885–891.
  5. BasicVSR++: Improving Video Super-Resolution With Enhanced Propagation and Alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5972–5981.
  6. Motion-Aware Dynamic Architecture for Efficient Frame Interpolation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 13839–13848.
  7. CDFI: Compression-Driven Network Design for Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8001–8011.
  8. FlowNet: Learning Optical Flow With Convolutional Networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
  9. FrameExit: Conditional Early Exiting for Efficient Video Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 15608–15618.
  10. Skip-Convolutions for Efficient Video Processing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2695–2704.
  11. Dynamic Neural Networks: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11): 7436–7456.
  12. Multi-scale dense networks for resource efficient image classification.
  13. FlowFormer: A Transformer Architecture for Optical Flow. In Computer Vision – ECCV 2022, 668–685. Cham.
  14. FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  15. Categorical Reparameterization with Gumbel-Softmax. In International Conference on Learning Representations.
  16. Learning To Estimate Hidden Motions With Global Motion Aggregation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 9772–9781.
  17. Learning Optical Flow From a Few Matches. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16592–16600.
  18. The HCI Benchmark Suite: Stereo and Flow Ground Truth With Uncertainties for Urban Autonomous Driving. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
  19. IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1969–1978.
  20. ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12016–12025.
  21. Instance-Aware Dynamic Neural Network Quantization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12434–12443.
  22. Learning Optical Flow With Kernel Patch Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8906–8915.
  23. Learning Optical Flow with Adaptive Graph Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2): 1890–1898.
  24. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  25. Object Scene Flow for Autonomous Vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  26. DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 12497–12506.
  27. Non-local Total Generalized Variation for Optical Flow Estimation. In Fleet, D.; Pajdla, T.; Schiele, B.; and Tuytelaars, T., eds., Computer Vision – ECCV 2014, 439–454. Cham.
  28. Optical Flow Estimation Using a Spatial Pyramid Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  29. CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17602–17611.
  30. A Quantitative Analysis of Current Practices in Optical Flow Estimation and The Principles Behind Them. International Journal of Computer Vision, 106: 115–137.
  31. PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  32. Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(6): 1408–1423.
  33. RAFT: Recurrent All-Pairs Field Transforms for Optical Flow. In Computer Vision – ECCV 2020, 402–419. Cham: Springer International Publishing.
  34. Optical Flow in Deep Visual Tracking. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07): 12112–12119.
  35. Exploring Sparsity in Image Super-Resolution for Efficient Inference. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4917–4926.
  36. Adaptive Patch Exiting for Scalable Single Image Super-Resolution. 292–307. Berlin, Heidelberg.
  37. Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation. In Computer Vision – ECCV 2020, 531–548. Cham.
  38. Early Exit or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images. In Computer Vision – ECCV 2020, 275–292.
  39. High-Resolution Optical Flow From 1D Attention and Correlation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 10498–10507.
  40. QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13668–13677.
  41. A Duality Based Approach for Realtime TV-L1 Optical Flow. In Hamprecht, F. A.; Schnörr, C.; and Jähne, B., eds., Pattern Recognition, 214–223. Berlin, Heidelberg: Springer Berlin Heidelberg.
  42. Separable Flow: Learning Motion Cost Volumes for Optical Flow Estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 10807–10817.
  43. DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8925–8934.
  44. SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model. arXiv:2307.16586.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.