mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions

Chalavadi, V. and Ch, Sobhan Babu and C, Krishna Mohan and et al, . (2022) mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions. Pattern Recognition, 126. pp. 1-10. ISSN 0031-3203

[img] Text
1-s2.0-S0031320322000292-main.pdf - Published Version
Restricted to Registered users only

Download (2MB) | Request a copy

Abstract

The object detection in aerial images is one of the most commonly used tasks in the wide-range of computer vision applications. However, the object detection is more challenging due to the following issues: (a) the pixel occupancy vary among the different scales of objects, (b) the distribution of objects is not uniform in aerial images, (c) the appearance of an object varies with different view-points and illumination conditions, and (d) the number of objects, even though they belong to same type, vary across the images. To address these issues, we propose a novel network for multi-scale object detection in aerial images using hierarchical dilated convolutions, called as mSODANet. In particular, we probe hierarchical dilated network using parallel dilated convolutions to learn the contextual information of different types of objects at multiple scales and multiple field-of-views. The introduced hierarchical dilated network captures the visual information of aerial image more effectively and enhances the detection capability of the model. Further, the extensive experiments conducted on three challenging publicly available datasets, i.e., Visdrone2019, DOTA (OBB & HBB), NWPU VHR-10, demonstrate the effectiveness of the proposed mSODANet and achieve the state-of-the-art performance on all three datasets. © 2022 Elsevier Ltd

[error in script]
IITH Creators:
IITH CreatorsORCiD
Ch, Sobhan BabuUNSPECIFIED
C, Krishna Mohanhttps://orcid.org/0000-0002-7316-0836
Item Type: Article
Uncontrolled Keywords: Aerial images, Contextual features, Dilated convolutions, Multi-scale object detection
Subjects: Computer science
Divisions: Department of Computer Science & Engineering
Depositing User: . LibTrainee 2021
Date Deposited: 25 Jun 2022 12:00
Last Modified: 27 Jun 2022 04:47
URI: http://raiithold.iith.ac.in/id/eprint/9398
Publisher URL: https://doi.org/10.1016/j.patcog.2022.108548
OA policy: https://v2.sherpa.ac.uk/id/publication/4665
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 9398 Statistics for this ePrint Item