Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 78 tok/s
Gemini 2.5 Pro 42 tok/s Pro
GPT-5 Medium 28 tok/s Pro
GPT-5 High 28 tok/s Pro
GPT-4o 80 tok/s Pro
Kimi K2 127 tok/s Pro
GPT OSS 120B 471 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

Real-Time Idling Vehicles Detection using Combined Audio-Visual Deep Learning (2305.14579v3)

Published 23 May 2023 in cs.CV

Abstract: Combustion vehicle emissions contribute to poor air quality and release greenhouse gases into the atmosphere, and vehicle pollution has been associated with numerous adverse health effects. Roadways with extensive waiting and/or passenger drop off, such as schools and hospital drop-off zones, can result in high incidence and density of idling vehicles. This can produce micro-climates of increased vehicle pollution. Thus, the detection of idling vehicles can be helpful in monitoring and responding to unnecessary idling and be integrated into real-time or off-line systems to address the resulting pollution. In this paper we present a real-time, dynamic vehicle idling detection algorithm. The proposed idle detection algorithm and notification rely on an algorithm to detect these idling vehicles. The proposed method relies on a multi-sensor, audio-visual, machine-learning workflow to detect idling vehicles visually under three conditions: moving, static with the engine on, and static with the engine off. The visual vehicle motion detector is built in the first stage, and then a contrastive-learning-based latent space is trained for classifying static vehicle engine sound. We test our system in real-time at a hospital drop-off point in Salt Lake City. This in-situ dataset was collected and annotated, and it includes vehicles of varying models and types. The experiments show that the method can detect engine switching on or off instantly and achieves 71.02 average precision (AP) for idle detections and 91.06 for engine off detections.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. OECD. The Economic Consequences of Outdoor Air Pollution; 2016. Available from: https://www.oecd-ilibrary.org/content/publication/9789264257474-en.
  2. Annavarapu RN, Kathi S. Cognitive disorders in children associated with urban vehicular emissions. Environmental pollution. 2016;208 Pt A:74-8.
  3. Lifetime Exposure to Ambient Pollution and Lung Function in Children. American journal of respiratory and critical care medicine. 2016;193 8:881-8.
  4. Lewtas J. Air pollution combustion emissions: characterization of causative agents and mechanisms associated with cancer, reproductive, and cardiovascular effects. Mutation research. 2007;636 1-3:95-133.
  5. Carbon Pollution from Transportation;. https://www.epa.gov/transportation-air-pollution-and-climate-change/carbon-pollution-transportation.
  6. The impact of an anti-idling campaign on outdoor air quality at four urban schools. Environmental science Processes & impacts. 2013;15 11:2030-7.
  7. Sharma A, Kumar P. A review of factors surrounding the air pollution exposure to in-pram babies and mitigation strategies. Environment international. 2018;120:262-78.
  8. Greater nitrogen dioxide concentrations at child versus adult breathing heights close to urban main road kerbside. Air Quality, Atmosphere, & Health. 2015;9:589 595.
  9. Mischler SE, Colinet JF. Controlling And Monitoring Diesel Emissions In Underground Mines In The United States; 2010. .
  10. Widla J. A complete guide to fleet idling: Understand, detect and stop true idling; 2022. Accessed: 2023-05-15. ”https://www.geotab.com/blog/detect-stop-true-fleet-idling/”.
  11. Scales M. The hidden impact of idling engines; 2022. Accessed: 2023-05-15. ”https://www.canadianminingjournal.com/featured-article/the-hidden-impact-of-idling-engines/”.
  12. Air Quality and Behavioral Impacts of Anti-Idling Campaigns in School Drop-Off Zones. Atmosphere. 2022.
  13. A community-based participatory research partnership to reduce vehicle idling near public schools. Journal of environmental health. 2013;75 9:14-9.
  14. Surveillance or Self-Surveillance? Behavioral Cues Can Increase the Rate of Drivers’ Pro-Environmental Behavior at a Long Wait Stop. Environment and Behavior. 2017;49:1156 1172.
  15. Motivating the selfish to stop idling: self-interest cues can improve environmentally relevant driver behaviour. Transportation Research Part F-traffic Psychology and Behaviour. 2018;54:79-85.
  16. Comparisons of Discretionary Passenger Vehicle Idling Behavior by Season and Trip Stage with Global Positioning System and Onboard Diagnostic Devices. Transportation Research Record. 2013;2341:76 82.
  17. Costly myths: An analysis of idling beliefs and behavior in personal motor vehicles $. Energy Policy. 2009;37:2881-8.
  18. Winnett MA, Wheeler A. VEHICLE-ACTIVATED SIGNS - A LARGE SCALE EVALUATION; 2003. .
  19. Ullman GL, Rose ER. Evaluation of Dynamic Speed Display Signs. Transportation Research Record. 2005;1918(1):92-7. Available from: https://doi.org/10.1177/0361198105191800112.
  20. Effectiveness of Speed-Monitoring Displays in Speed Reduction in School Zones. Transportation Research Record. 2006;1973:27 35.
  21. Cruzado I, Donnell ET. Evaluating Effectiveness of Dynamic Speed Display Signs in Transition Zones of Two-Lane, Rural Highways in Pennsylvania. Transportation Research Record. 2009;2122:1 8.
  22. Evaluation des Dialog-Displays - Berliner Studien / Evaluation of dynamic speed display signs - The Berlin studies; 2010. .
  23. Long-term effect analysis of dynamic speed display sign in streets. 2017 4th International Conference on Transportation Information and Safety (ICTIS). 2017:522-9.
  24. Evaluation of different types of dynamic speed display signs. Transportation Research Part F-traffic Psychology and Behaviour. 2012;15:667-75.
  25. Idling Car Detection with ConvNets in Infrared Image Sequences. 2018 IEEE International Symposium on Circuits and Systems (ISCAS). 2018:1-5.
  26. Object Detection in Video with Spatiotemporal Sampling Networks. 2018.
  27. SlowFast Networks for Video Recognition. 2019 IEEE/CVF International Conference on Computer Vision (ICCV). 2018:6201-10.
  28. Spatio-temporal action detection and localization using a hierarchical LSTM. 2020.
  29. Watch Only Once: An End-to-End Video Action Detection Framework. 2021.
  30. 3DFCNN: real-time action recognition using 3D deep neural networks with raw depth information. Multimedia Tools and Applications. 2022:24119–24143.
  31. MODNet: Motion and Appearance based Moving Object Detection Network for Autonomous Driving. In: 2018 21st International Conference on Intelligent Transportation Systems (ITSC); 2018. p. 2859-64.
  32. MODNet: Motion and Appearance based Moving Object Detection Network for Autonomous Driving. 2018 21st International Conference on Intelligent Transportation Systems (ITSC). 2018:2859-64.
  33. Motion Embedding for On-road Motion Object Detection for Intelligent Vehicle Systems. 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC). 2022:3354-9.
  34. Real-Time Vehicle Motion Detection and Motion Altering for Connected Vehicle: Algorithm Design and Practical Applications. Sensors (Basel, Switzerland). 2019;19.
  35. Parking Lot Occupancy Tracking Through Image Processing. In: International Conference on Computers and Their Applications; 2019. .
  36. You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization. ArXiv. 2019;abs/1911.06644.
  37. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2019;28:2880-94.
  38. Contrastive Learning of General-Purpose Audio Representations. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2020:3875-9.
  39. Nasiri A, Hu J. SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification. ArXiv. 2021;abs/2103.01929.
  40. Speaker segmentation and clustering. Signal Processing. 2008;88:1091-124.
  41. Seeing the Sound: A New Multimodal Imaging Device for Computer Vision. 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). 2015:693-701.
  42. A Simple Framework for Contrastive Learning of Visual Representations. In: International Conference on Machine Learning; 2020. .
  43. Momentum Contrast for Unsupervised Visual Representation Learning. In: IEEE Conference on Computer Vision and Pattern Recognition; 2020. .
  44. A Dataset and Taxonomy for Urban Sound Research. In: Proceedings of the 22nd ACM international conference on Multimedia; 2014. .
  45. Piczak KJ. ESC: Dataset for Environmental Sound Classification. In: Proceedings of the 23rd ACM international conference on Multimedia; 2015. .
  46. AUDIO SET: AN ONTOLOGY AND HUMAN-LABELED DATASET FOR AUDIO EVENTS. In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP); 2017. .
  47. Self-supervised object detection from audio-visual correspondence. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2021:10565-76.
  48. Owens A, Efros AA. Audio-Visual Scene Analysis with Self-Supervised Multisensory Features. In: European Conference on Computer Vision; 2018. .
  49. Joint Visual and Audio Learning for Video Highlight Detection. In: International Conference on Computer Vision; 2021. .
  50. Weakly supervised representation learning for unsynchronized audio-visual events. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops; 2018. .
  51. Redmon J, Farhadi A. YOLO9000: Better, Faster, Stronger. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:6517-25.
  52. Aggregated Residual Transformations for Deep Neural Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:5987-95.
  53. Supervised Contrastive Learning. ArXiv. 2020;abs/2004.11362.
  54. Piczak KJ. ESC: Dataset for Environmental Sound Classification. Proceedings of the 23rd ACM international conference on Multimedia. 2015.
Citations (1)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.