Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
117 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

VisIRNet: Deep Image Alignment for UAV-taken Visible and Infrared Image Pairs (2402.09635v1)

Published 15 Feb 2024 in cs.CV

Abstract: This paper proposes a deep learning based solution for multi-modal image alignment regarding UAV-taken images. Many recently proposed state-of-the-art alignment techniques rely on using Lucas-Kanade (LK) based solutions for a successful alignment. However, we show that we can achieve state of the art results without using LK-based methods. Our approach carefully utilizes a two-branch based convolutional neural network (CNN) based on feature embedding blocks. We propose two variants of our approach, where in the first variant (ModelA), we directly predict the new coordinates of only the four corners of the image to be aligned; and in the second one (ModelB), we predict the homography matrix directly. Applying alignment on the image corners forces algorithm to match only those four corners as opposed to computing and matching many (key)points, since the latter may cause many outliers, yielding less accurate alignment. We test our proposed approach on four aerial datasets and obtain state of the art results, when compared to the existing recent deep LK-based architectures.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. Medical image registration: Classification, applications and issues. Journal of Postgraduate Medical Institute, 32:300–3007, 12 2018.
  2. S. Baker and I. Matthews. Lucas-kanade 20 years on: A unifying framework. International journal of computer vision, 56(3):221–255, 2004.
  3. D. Barath and Z. Kukelova. Relative pose from sift features, 2022.
  4. Surf: Speeded up robust features. Computer vision and image understanding, 110(3):346–359, 2006.
  5. An automatic image registration for applications in remote sensing. IEEE Transactions on Geoscience and Remote Sensing, 43(9):2127–2137, 2005.
  6. An automatic image registration for applications in remote sensing. Geoscience and Remote Sensing, IEEE Transactions on, 43:2127 – 2137, 10 2005.
  7. S. Bhowmick. The RGB rendering of visible wavelength lights (2019 02 28 14 47 31 UTC). 01 2017.
  8. Performing calibration of transmittance by single rgb-led within the visible spectrum. Sensors, 20(12), 2020.
  9. Clkn: Cascaded lucas-kanade networks for image alignment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017.
  10. Real-time registration in image stitching under the microscope. In 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), pages 907–911, 2018.
  11. Extending hyper-spectral imaging from vis to nir spectral regions: A novel scanner for the in-depth analysis of polychrome surfaces. Proc SPIE, 8790:09–, 05 2013.
  12. Visible and infrared imaging spectroscopy of paintings and improved reflectography. Heritage Science, 4, 03 2016.
  13. Interpretable Multi-Modal Image Registration Network Based on Disentangled Convolutional Sparse Coding. IEEE Transactions on Image Processing, 32:1078–1091, Jan. 2023.
  14. Deep image homography estimation. CoRR, abs/1606.03798, 2016.
  15. Implementation of the lucas-kanade image registration algorithm on a gpu for 3d computational platform stabilisation. pages 83–90, 06 2010.
  16. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381–395, 1981.
  17. G. Fox. The brewing industry and the opportunities for real-time quality analysis using infrared spectroscopy. Applied Sciences, 10, 01 2020.
  18. R. Gade and T. B. Moeslund. Thermal cameras and applications: a survey. Machine Vision and Applications, 25(1):245–262, Jan 2014.
  19. Digital image processing. 2008.
  20. A. A. Goshtasby. Image Registration - Principles, Tools and Methods. Advances in Computer Vision and Pattern Recognition. Springer, 2012.
  21. C. Harris and M. Stephens. A combined corner and edge detector. In Proceedings of the 4th Alvey Vision Conference, pages 147–151, 1988.
  22. R. Hartley and A. Zisserman. Multiple view geometry in computer vision. Cambridge university press, 2003.
  23. Medical image registration. Physics in Medicine and Biology, 46(3):R1, mar 2001.
  24. Image registration among uav image sequence and google satellite image under quality mismatch. In 2012 12th International Conference on ITS Telecommunications, pages 311–315, 2012.
  25. Spatial transformer networks. CoRR, abs/1506.02025, 2015.
  26. Effect of low-pass filters as a shi-tomasi corner detector’s window functions. 07 2018.
  27. E. J. Kirkland. Bilinear Interpolation, pages 261–263. Springer US, Boston, MA, 2010.
  28. Deep homography estimation for dynamic scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
  29. Deep global feature-based template matching for fast multi-modal image registration. In 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, pages 5457–5460. IEEE, 2021.
  30. J. P. Lewis. Fast normalized cross-correlation. Vision interface, 10(1):120–123, 1995.
  31. Y. F. LI Fuyu. Summarization of sift-based remote sensing image registration techniques. Remote Sensing for Natural Resources, 28(2):14, 2016.
  32. Remote sensing image registration approach based on a retrofitted sift algorithm and lissajous-curve trajectories. Opt. Express, 18(2):513–522, Jan 2010.
  33. Microsoft COCO: common objects in context. CoRR, abs/1405.0312, 2014.
  34. D. G. Lowe. Object recognition from local scale-invariant features. In Proceedings of the seventh IEEE international conference on computer vision, volume 2, pages 1150–1157. Ieee, 1999.
  35. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2):91–110, 2004.
  36. Infrared and visible image homography estimation using multiscale generative adversarial network. Electronics, 12(4), 2023.
  37. Creating rgb images from hyperspectral images using a color matching function. In IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium, pages 2045–2048, 2020.
  38. Image registration for sequence of visual images captured by uav. In 2009 IEEE Symposium on Computational Intelligence for Multimedia Signal and Vision Processing, pages 91–97, 2009.
  39. Remote sensing image processing based on modified fuzzy algorithm. In R. Silhavy, editor, Artificial Intelligence and Bioinspired Computational Methods, pages 563–572, Cham, 2020. Springer International Publishing.
  40. P. Monasse. Extraction of the Level Lines of a Bilinear Image. Image Processing On Line, 9:205–219, 2019. https://doi.org/10.5201/ipol.2019.269.
  41. Linear intensity-based image registration. International Journal of Advanced Computer Science and Applications, 9:211–217, 01 2018.
  42. Threshold accepting approach for image registration. UACEE International Journal of Computer Science and its Applications, 2, 2012.
  43. S. Ozer. Similarity domains machine for scale-invariant and sparse shape modeling. IEEE Transactions on Image Processing, 28(2):534–545, 2018.
  44. S. Özer. Feature matching with similarity domains network. In 2020 28th Signal Processing and Communications Applications Conference (SIU), pages 1–4. IEEE, 2020.
  45. Siamesefuse: A computationally efficient and a not-so-deep network to fuse visible and infrared images. Pattern Recognition, 129:108712, 2022.
  46. Offloading deep learning powered vision tasks from uav to 5g edge server with denoising. IEEE Transactions on Vehicular Technology, 2023.
  47. M. A. Özkanoğlu and S. Ozer. Infragan: A gan architecture to transfer visible images to infrared domain. Pattern Recognition Letters, 155:69–76, 2022.
  48. A comparative analysis of ransac techniques leading to adaptive real-time random sample consensus. volume 5303, pages 500–513, 10 2008.
  49. L. Ray. 2-d and 3-d image registration for medical, remote sensing, and industrial applications. Journal of Electronic Imaging, 14:9901–, 07 2005.
  50. S. Razakarivony and F. Jurie. Vehicle detection in aerial imagery : A small target detection benchmark. Journal of Visual Communication and Image Representation, 34:187–203, 2016.
  51. J. A. Rice. Mathematical Statistics and Data Analysis. Duxbury Press, Belmont, CA, 3rd edition, 2007.
  52. Feature based image registration using heuristic nearest neighbour search. In 2018 22nd International Computer Science and Engineering Conference (ICSEC), pages 1–3, 2018.
  53. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004.
  54. Deep lucas-kanade homography for multimodal image alignment. CoRR, abs/2104.11693, 2021.
Citations (1)

Summary

We haven't generated a summary for this paper yet.