Learned and hand-crafted feature fusion in unit ball for 3D object classification

Sameera Ramasinghe, The Australian National University
Salman Khan, The Australian National University
Nick Barnes, The Australian National University

Abstract

Convolution is an effective technique that can be used to obtain abstract feature representations using hierarchical layers in deep networks. However, performing convolution in non-Euclidean topological spaces such as the unit ball (B3) is still an under-explored problem. In this paper, we propose a light-weight experimental architecture for 3D object classification, that operates in B3. The proposed network utilizes both hand-crafted and learned features, and uses capsules in the penultimate layer to disentangle 3D shape features through pose and view equivariance. It simultaneously maintains an intrinsic co-ordinate frame, where mutual relationships between object parts are preserved. Furthermore, we show that the optimal view angles for extracting patterns from 3D objects depend on its shape and achieve compelling results with a relatively shallow network, compared to the state-of-the-art.