Using Combined Audio-Visual Deep Learning
Xiwen Li 1 , Tristalee Mangin 2 , Surojit Saha 1 , Mohammed Rehman 1 , Evan Blanchard 2 __, __
Dillon Tang 2 , Henry Poppe 2 , Ouk Choi 3 , Kerry Kelly 2 , Ross Whitaker 1
1 Scientific Computing and Imaging Institute, University of Utah
2 Department of Chemical Engineering, University of Utah
3 Incheon National University
Health Concern: vulnerable populations
Commercial Waste: fleet idling
__Global Warming: greenhouse gas __
Shorten sentences Roadways such as schools and hospital drop-off zones, where idling vehicles congregate, produce increased air pollution to such as long-haul idling trucks at depot/delivery centers, causes wasted fuel and engine wear
__Similar application: Dynamic speed display __
Motivated by effectiveness of dynamic speed sign
We design an intelligent transportation system (ITS) to show dynamic messaging
Reduce info
The trained vehicle motion detector is capable of localizing vehicle motion on unseen data
Fix groundtruth motion bounding box and only estimate audio classification
We observed false positives caused by loud noise makers and parallel moving vehicles
Smoothing prediction: window majority voting
We deploy for 14 days at a hospital*
Monitor Accuracy:
Has No Idle Cases (Accuracy) | Has Idle Cases (Accuracy) | |
---|---|---|
08.02.23 | 94.9% | 68.0% |
08.01.23 | 96.1% | 77.0% |
08.04.23 | 98.1% | 91.0% |
08.07.23 | 98.2% | 77.8% |
08.11.23 | 97.1% | 77.1% |
08.14.23 | 97.2% | 80.9% |
* LDS Hospital: 8th Avenue, C St E, Salt Lake City, UT 84143, USA
We built an algorithm and ITS to detect idling vehicles
We will test it at more locations to generalize the system