Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empi...Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empirical performance,the theoretical foundations of deep multi-modal learning have yet to be fully explored.In this paper,we will undertake a comprehensive survey of recent developments in multi-modal learning theories,focusing on the fundamental properties that govern this field.Our goal is to provide a thorough collection of current theoretical tools for analyzing multi-modal learning,to clarify their implications for practitioners,and to suggest future directions for the establishment of a solid theoretical foundation for deep multi-modal learning.展开更多
Artificial intelligence (AI) is almo st everywhere due to the rapid development of modern technology and popularity of intelligent devices.While control theory and machine learning techniques as two enabling technolog...Artificial intelligence (AI) is almo st everywhere due to the rapid development of modern technology and popularity of intelligent devices.While control theory and machine learning techniques as two enabling technologies have shown enormous power in their own right,a rapprochement of them is required to handle nonlinearity,uncertainty and scalability induced by high complexity of modern systems,huge quantity of real-time data,and large scale of agent networks.Journal of Automation and Intelligence (JAI) aims to provide a platform for researchers and practitioners from both academia and industry to exchange their ideas and present new developments across multiple disciplines relevant to automation and artificial intelligence with particular attention to machine learning.展开更多
基金Supported by Technology and Innovation Major Project of the Ministry of Science and Technology of China(2020AAA0108400, 2020AAA0108403)Tsinghua Precision Medicine Foundation(10001020109)。
文摘Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empirical performance,the theoretical foundations of deep multi-modal learning have yet to be fully explored.In this paper,we will undertake a comprehensive survey of recent developments in multi-modal learning theories,focusing on the fundamental properties that govern this field.Our goal is to provide a thorough collection of current theoretical tools for analyzing multi-modal learning,to clarify their implications for practitioners,and to suggest future directions for the establishment of a solid theoretical foundation for deep multi-modal learning.
文摘Artificial intelligence (AI) is almo st everywhere due to the rapid development of modern technology and popularity of intelligent devices.While control theory and machine learning techniques as two enabling technologies have shown enormous power in their own right,a rapprochement of them is required to handle nonlinearity,uncertainty and scalability induced by high complexity of modern systems,huge quantity of real-time data,and large scale of agent networks.Journal of Automation and Intelligence (JAI) aims to provide a platform for researchers and practitioners from both academia and industry to exchange their ideas and present new developments across multiple disciplines relevant to automation and artificial intelligence with particular attention to machine learning.