AF: An Association-based Fusion Method for Multi-Modal Classification

Authors: Xinyan Liang, Yuhua Qian, Qian Guo, Honghong Cheng, Jiye Liang


Multi-modal classifification (MMC) aims to integrate the complementary information from different modalities to improve classifification performance. Existing MMC methods can be grouped into two categories: traditional methods and deep learning-based methods. The traditional methods often implement fusion in a low-level original space. Besides, they mostly focus on the inter-modal fusion and neglect the intra-modal fusion. Thus, the representation capacity of fused features induced by them is insuffificient. The deep learning-based methods implement the fusion in a high-level feature space where the associations among features are considered, while the whole process is implicit and the fused space lacks interpretability. Based on these observations, we propose a novel interpretative association-based fusion method for MMC, named AF. In AF, both the association information and the high-order information extracted from feature space are simultaneously encoded into a new feature space to help to train an MMC model in an explicit manner. Moreover, AF is a general fusion framework, and most existing MMC methods can be embedded into it to improve their performance. Finally, the effectiveness and the generality of AF are validated on 22 datasets, four typically traditional MMC methods adopting best modality, early, late and model fusion strategies and a deep learning-based MMC method.

Keywords: Multi-modal fusion, association-based fusion, interpretative fusion, multi-modal classification, association representation.


Sat Jan 01 00:00:00 CST 2022