Spatio-Temporal Adaptation with Dilated Neighbourhood Attention for Accident Anticipation
Published in IEEE International Conference on Image Processing (ICIP), 2024
This study uses Parameter-Efficient Transfer Learning (PEFTL) and Dilated Neighborhood Attention (DNA) to adapt pretrained CLIP-ViT for traffic accident anticipation. By utilizing novel Spatial and Temporal Adapters with cross-attention, the model captures long-range dependencies more effectively, achieving state-of-the-art earliness and accuracy on the DAD and CCD datasets.
Recommended citation: P. Patera, Y. -T. Chen and W. -H. Fang, "Spatio-Temporal Adaptation With Dilated Neighbourhood Attention For Accident Anticipation," 2024 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2024, pp. 2452-2458, doi: 10.1109/ICIP51287.2024.10647316.
Download Paper
