Spotting football events using two-stream convolutional neural network and dilated recurrent neural network

Mahaseni, Behzad and Faizal, Erma Rahayu Mohd and Raj, Ram Gopal (2021) Spotting football events using two-stream convolutional neural network and dilated recurrent neural network. IEEE Access, 9. pp. 61929-61942. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2021.3074831.

Full text not available from this repository.

Abstract

This paper addresses the problem of event detection and localization in long football (soccer) videos. Our key idea is that understanding the long-range dependencies between video frames is imperative for accurate event localization in long football videos. Additionally, proper event detection is not likely for fast movements in football videos without considering mid-range and short-range correlations between neighboring video frames. We argue that event spotting can be considerably improved by considering short-range to long-range frame dependencies in a unified architecture. To model long-range and mid-range dependencies, we propose to use the dilated recurrent neural network (DilatedRNN) with long short-term memory (LSTM) units, grounded on two-stream convolutional neural network (Two-stream CNN) features. While two-stream CNN extracts local spatiotemporal features necessary for fine-level details, the DilatedRNN makes the information obtained from distant frames available for the classifier and spotting algorithms. Evaluating our event spotting algorithm on the largest publicly available benchmark football dataset -SoccerNet- shows an accuracy improvement of 0.8% - 13.6% compared to state of the art, and up to 30.1% accuracy gain in comparison to the baselines. We also investigate the contribution of each neural network component in spotting accuracy through an extensive ablation study.

Item Type: Article
Funders: University of Malaya (UM) Research Grant (IIRG012C-2019)
Uncontrolled Keywords: Sports; Videos; Games; Event detection; Correlation; Feature extraction; Spatiotemporal phenomena; Deep learning; Dilated RNNs; Sport videos event detection; Two-stream CNNs
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > TA Engineering (General). Civil engineering (General)
Divisions: Faculty of Computer Science & Information Technology > Department of Artificial Intelligence
Depositing User: Ms Zaharah Ramly
Date Deposited: 10 Jun 2022 07:47
Last Modified: 10 Jun 2022 07:47
URI: http://eprints.um.edu.my/id/eprint/27600

Actions (login required)

View Item View Item