ABSTRACT
Multi-label classification is the challenging task of predicting the presence and absence of multiple targets, involving representation learning and label correlation modeling. We propose a novel framework for multi-label classification, Multivariate Probit Variational AutoEncoder (MPVAE), that effectively learns latent embedding spaces as well as label correlations. MPVAE learns and aligns two probabilistic embedding spaces for labels and features respectively. The decoder of MPVAE takes in the samples from the embedding spaces and models the joint distribution of output targets under a Multivariate Probit model by learning a shared covariance matrix. We show that MPVAE outperforms the existing state-of-the-art methods on important computational sustainability applications as well as on other application domains, using public real-world datasets1. MPVAE is further shown to remain robust under noisy settings. Lastly, we demonstrate the interpretability of the learned covariance by a case study on a bird observation dataset.
- Raed Alazaidah, Fadi Thabtah, and Qasem Al-Radaideh. A multi-label classification approach based on correlations among labels. International Journal of Advanced Computer Science and Applications, 2015.Google Scholar
- Fernando Benites and Elena Sapozhnikova. Haram: a hierarchical aram neural network for large-scale text classification. In 2015 IEEE International Conference on Data Mining Workshop (ICDMW), pages 847-854. IEEE, 2015.Google Scholar
- Kush Bhatia, Himanshu Jain, Purushottam Kar, Manik Varma, and Prateek Jain. Sparse local embeddings for extreme multi-label classification. In Advances in neural information processing systems, 2015.Google Scholar
- Wei Bi and James T Kwok. Multilabel classification with label correlations and missing labels. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pages 1680-1686, 2014.Google Scholar
- Matthew R Boutell, Jiebo Luo, Xipeng Shen, and Christopher M Brown. Learning multilabel scene classification. Pattern recognition, 37(9):1757- 1771, 2004.Google Scholar
- James A Carton, Gennady A Chepurin, and Ligang Chen. Soda3: A new ocean climate reanalysis. Journal of Climate, 31(17):6967-6983, 2018.Google Scholar
- Yao-Nan Chen and Hsuan-Tien Lin. Feature-aware label space dimension reduction for multilabel classification. In Advances in Neural Information Processing Systems, pages 1529-1537, 2012.Google Scholar
- Di Chen, Yexiang Xue, Daniel Fink, Shuo Chen, and Carla P Gomes. Deep multi-species embedding. In Proceedings of the 26th International Joint Conference on Artificial Intelligence, pages 3639-3646, 2017.Google Scholar
- Di Chen, Yexiang Xue, and Carla Gomes. End-to-end learning for the deep multivariate probit model. In International Conference on Machine Learning, pages 932-941, 2018.Google Scholar
- Chen Chen, Haobo Wang, Weiwei Liu, Xingyuan Zhao, Tianlei Hu, and Gang Chen. Twostage label embedding via neural factorization machine for multi-label classification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 3304- 3311, 2019.Google Scholar
- Zhao-Min Chen, Xiu-Shen Wei, Peng Wang, and Yanwen Guo. Multi-label image recognition with graph convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5177-5186, 2019.Google Scholar
- Tsung-Hsien Chiang, Hung-Yi Lo, and Shou-De Lin. A ranking-based knn approach for multilabel classification. In Asian Conference on Machine Learning, pages 81-96, 2012.Google Scholar
- Hong-Min Chu, Chih-Kuan Yeh, and Yu-Chiang Frank Wang. Deep generative models for weakly-supervised multi-label classification. In Proceedings of the European Conference on Computer Vision, 2018.Google Scholar
- Tat-Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. Nuswide: a real-world web image database from national university of singapore. In Proceedings of the ACM international conference on image and video retrieval, pages 1-9, 2009.Google Scholar
- Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron C Courville, and Yoshua Bengio. A recurrent latent variable model for sequential data. In Advances in neural information processing systems, pages 2980-2988, 2015.Google Scholar
- Daniel M Evans, Judy P Che-Castaldo, Deborah Crouse, Frank W Davis, Rebecca Epanchin-Niell, Curtis H Flather, R Kipp Frohlich, Dale D Goble, Ya-Wei Li, and Timothy D Male. Species recovery in the united states: increasing the effectiveness of the endangered species act. Issues in Ecology, 2017.Google Scholar
- Carla Gomes, Thomas Dietterich, Christopher Barrett, Jon Conrad, Bistra Dilkina, Stefano Ermon, Fei Fang, Andrew Farnsworth, Alan Fern, Xiaoli Fern, et al. Computational sustainability: Computing for a better world and a sustainable future. Communications of the ACM, 62(9):56-65, 2019.Google Scholar
- Irina Higgins, Loic Matthey, Arka Pal, Christopher Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner. β-vae: Learning basic visual concepts with a constrained variational framework. The Eighth International Conference on Learning Representations, 2(5):6, 2017.Google Scholar
- Mark J. Huiskes and Michael S. Lew. The mir flickr retrieval evaluation. In MIR '08: Proceedings of the 2008 ACM International Conference on Multimedia Information Retrieval, New York, NY, USA, 2008. ACM.Google Scholar
- Ioannis Katakis, Grigorios Tsoumakas, and Ioannis Vlahavas. Multilabel text classification for automated tag suggestion. Discovery Challenge in Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), page 75, 2008.Google Scholar
- Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. International conference on learning representation, 2015.Google Scholar
- Michael Kuhn, Ivica Letunic, Lars Juhl Jensen, and Peer Bork. The sider database of drugs and side effects. Nucleic acids research, 44(D1):D1075- D1079, 2015.Google Scholar
- Michael Kuhn, Ivica Letunic, Lars Juhl Jensen, and Peer Bork. The sider database of drugs and side effects. Nucleic acids research, 44(D1):D1075- D1079, 2016.Google Scholar
- Jack Lanchantin, Arshdeep Sekhon, and Yanjun Qi. Neural message passing for multi-label classification. Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2019.Google Scholar
- James W. Morley, Rebecca L. Selden, Robert J. Latour, Thomas L. Frölicher, Richard J. Seagraves, and Malin L. Pinsky. Projecting shifts in thermal habitat for 686 species on the north american continental shelf. PLOS ONE, 13(5):1-28, 05 2018.Google Scholar
- M Arthur Munson, Kevin Webb, Daniel Sheldon, Daniel Fink, Wesley M Hochachka, Marshall Iliff, Mirek Riedewald, Daria Sorokina, Brian Sullivan, Christopher Wood, et al. The ebird reference dataset. Cornell Lab of Ornithology and National Audubon Society, 2011.Google Scholar
- Kenta Nakai and Minoru Kanehisa. A knowledge base for predicting protein localization sites in eukaryotic cells. Genomics, 14(4):897-911, 1992.Google Scholar
- Jesse Read, Bernhard Pfahringer, Geoff Holmes, and Eibe Frank. Classifier chains for multilabel classification. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 254-269. Springer, 2009.Google Scholar
- Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. Effective and efficient multilabel classification in domains with large number of labels. In Proceedings of ECML-PKDD 2008 Workshop on Mining Multidimensional Data, pages 53-59, 2008.Google Scholar
- Sjoerd van Steenkiste, Francesco Locatello, Jürgen Schmidhuber, and Olivier Bachem. Are disentangled representations helpful for abstract visual reasoning? In Advances in Neural Information Processing Systems, pages 14222-14235, 2019.Google Scholar
- Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, and Wei Xu. Cnn-rnn: A unified framework for multi-label image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2285-2294, 2016.Google Scholar
- Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem, and Siwei Lyu. Multi-label learning with missing labels using mixed dependency graphs. International Journal of Computer Vision, 126(8):875-896, 2018.Google Scholar
- Chih-Kuan Yeh, Wei-Chieh Wu, Wei-Jen Ko, and Yu-Chiang Frank Wang. Learning deep latent space for multi-label classification. In Thirty-First AAAI Conference on Artificial Intelligence, 2017.Google Scholar
- Hsiang-Fu Yu, Prateek Jain, Purushottam Kar, and Inderjit Dhillon. Large-scale multi-label learning with missing labels. In International conference on machine learning, pages 593-601, 2014.Google Scholar
- Yu Zhang and Dit-Yan Yeung. Multilabel relationship learning. ACM Transactions on Knowledge Discovery from Data (TKDD), 7(2):7, 2013.Google Scholar
- Min-Ling Zhang and Zhi-Hua Zhou. Ml-knn: A lazy learning approach to multilabel learning. Pattern recognition, 40(7):2038-2048, 2007.Google Scholar
- Min-Ling Zhang and Zhi-Hua Zhou. A review on multi-label learning algorithms. IEEE transactions on knowledge and data engineering, 26(8):1819-1837, 2013.Google Scholar
- Min-Ling Zhang, Yu-Kun Li, Xu-Ying Liu, and Xin Geng. Binary relevance for multilabel learning: an overview. Frontiers of Computer Science, 12(2):191-202, 2018.Google Scholar
Index Terms
(auto-classified)Disentangled variational autoencoder based multi-label classification with covariance-aware multivariate probit model
Comments