A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images (2405.04650v1)
Abstract: Recognition of individual components and keypoint detection supported by instance segmentation is crucial to analyze the behavior of agents on the scene. Such systems could be used for surveillance, self-driving cars, and also for medical research, where behavior analysis of laboratory animals is used to confirm the aftereffects of a given medicine. A method capable of solving the aforementioned tasks usually requires a large amount of high-quality hand-annotated data, which takes time and money to produce. In this paper, we propose a method that alleviates the need for manual labeling of laboratory rats. To do so, first, we generate initial annotations with a computer vision-based approach, then through extensive augmentation, we train a deep neural network on the generated data. The final system is capable of instance segmentation, keypoint detection, and body part segmentation even when the objects are heavily occluded.
- Carion, Nicolas, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. “End-to-End Object Detection with Transformers.” arXiv preprint arXiv:2005.12872 (2020).
- He, Kaiming, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. “Mask r-cnn.” In Proceedings of the IEEE international conference on computervision (2017): 2961–2969.
- Varga, Viktor and András Lőrincz. “Reducing human efforts in video segmentation annotation with reinforcement learning.” Neurocomputing 405 (2020): 247-258.
- Khoreva, Anna, Rodrigo Benenson, Eddy Ilg, Thomas Brox, and Bernt Schiele. “Lucid data dreaming for video object segmentation.” International journal of computer vision (2018): 1–23.
- Perazzi, Federico, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, and Alexander Sorkine-Hornung. “Learning video object segmentation from static images.” In Proceedings of the IEEE conference on computer vision and pattern recognition (2017): 2663–2672.
- Lin, Tsung-Yi, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. “Microsoft coco: Common objects in context.” In European conference on computer vision (2014): 740-755.
- Zhang, T. Y., and Ching Y. Suen. “A fast parallel algorithm for thinning digital patterns.” Communications of the ACM 27, no. 3 (1984): 236-239.
- Neubert, Peer, and Peter Protzel. “Compact watershed and preemptive slic: On improving trade-offs of superpixel segmentation algorithms.” In 2014 22nd International Conference on Pattern Recognition, IEEE (2014): 996-1001.
- Hanson, Frank Blair, and Florence Heys. “Correlations of Body Weight, Body Length, and Tail Length in Normal and Alcoholic Albino Rats.” Genetics 9, no. 4 (1924): 368.
- Bookstein, Fred L. “Principal warps: Thin-plate splines and the decomposition of deformations.” IEEE Transactions on pattern analysis and machine intelligence 11, no. 6 (1989): 567-585.
- He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. “Deep residual learning for image recognition.” In Proceedings of the IEEE conference on computer vision and pattern recognition (2016): 770-778.
- Lin, Tsung-Yi, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. “Feature pyramid networks for object detection.” In Proceedings of the IEEE conference on computer vision and pattern recognition (2017): 2117-2125.
- Kass, Michael, Andrew Witkin, and Demetri Terzopoulos. “Snakes: Active contour models.” International journal of computer vision 1, no. 4 (1988): 321-331.
- Bergmann, Philipp, Tim Meinhardt, and Laura Leal-Taixe. “Tracking without bells and whistles.” In Proceedings of the IEEE international conference on computer vision (2019): 941-951.
- Sun, Deqing, Xiaodong Yang, Ming-Yu Liu, and Jan Kautz. “Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume.” In Proceedings of the IEEE conference on computer vision and pattern recognition (2018): 8934–8943.