Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description

Bandara, Ravimal and Ranathunga, Lochandaka and Abdullah, Nor Aniza (2021) Deep learned compact binary descriptor with a lightweight network-in-network architecture for visual description. Visual Computer, 37 (2). pp. 275-290. ISSN 0178-2789, DOI https://doi.org/10.1007/s00371-020-01798-5.

Full text not available from this repository.

Abstract

Binary descriptors have been widely used for real-time image retrieval and correspondence matching. However, most of the learned descriptors are obtained using a large deep neural network (DNN) with several million parameters, and the learned binary codes are generally not invariant to many geometrical variances which is crucial for accurate correspondence matching. To address this problem, we proposed a new learning approach using a lightweight DNN architecture via a stack of multiple multilayer perceptrons based on the network in network (NIN) architecture, and a restricted Boltzmann machine (RBM). The latter is used for mapping the features to binary codes, and carry out the geometrically invariant correspondence matching task. Our experimental results on several benchmark datasets (e.g., Brown, Oxford, Paris, INRIA Holidays, RomePatches, HPatches, and CIFAR-10) show that the proposed approach produces the learned binary descriptor that outperforms other baseline self-supervised binary descriptors in terms of correspondence matching despite the smaller size of its DNN. Most importantly, the proposed approach does not freeze the features that are obtained while pre-training the NIN model. Instead, it fine-tunes the features while learning the features needed for binary mapping through the RBM. Additionally, its lightweight architecture makes it suitable for resource-constrained devices.

Item Type: Article
Funders: Senate Research Council, University of Moratuwa, Sri Lanka [SRC-16-1], National Research Council, Sri Lanka [12-017]
Uncontrolled Keywords: Binary descriptor; Network-in-network; Restricted Boltzmann machine; Correspondence matching; Lightweight deep neural network
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Computer Science & Information Technology > Department of Computer System & Technology
Depositing User: Ms Zaharah Ramly
Date Deposited: 14 Apr 2022 01:31
Last Modified: 14 Apr 2022 01:31
URI: http://eprints.um.edu.my/id/eprint/26950

Actions (login required)

View Item View Item