Transformers,the dominant architecture for natural language processing,have also recently attracted much attention from computational visual media researchers due to their capacity for long-range representation and hi...Transformers,the dominant architecture for natural language processing,have also recently attracted much attention from computational visual media researchers due to their capacity for long-range representation and high performance.Transformers are sequence-to-sequence models,which use a selfattention mechanism rather than the RNN sequential structure.Thus,such models can be trained in parallel and can represent global information.This study comprehensively surveys recent visual transformer works.We categorize them according to task scenario:backbone design,high-level vision,low-level vision and generation,and multimodal learning.Their key ideas are also analyzed.Differing from previous surveys,we mainly focus on visual transformer methods in low-level vision and generation.The latest works on backbone design are also reviewed in detail.For ease of understanding,we precisely describe the main contributions of the latest works in the form of tables.As well as giving quantitative comparisons,we also present image results for low-level vision and generation tasks.Computational costs and source code links for various important works are also given in this survey to assist further development.展开更多
Line drawings, as a concise form, can be recognized by infants and even chimpanzees. Recently, how the visual system processes line-drawings attracts more and more attention from psychology, cognitive science and comp...Line drawings, as a concise form, can be recognized by infants and even chimpanzees. Recently, how the visual system processes line-drawings attracts more and more attention from psychology, cognitive science and computer science. The neuroscientific studies revealed that line drawings generate similar neural actions as color photographs, which give insights on how to efficiently process big media data. In this paper, we present a comprehensive survey on line drawing studies, including cognitive mechanism of visual perception, computational models in computer vision and intelligent process in diverse media applications. Major debates, challenges and solutions that have been addressed over the years are discussed. Finally some of the ensuing challenges in line drawing studies are outlined.展开更多
基于Visual C#和MATLAB语言的混合编程,提出纤维过滤介质过滤性能计算的软件开发方案,实现对纤维过滤介质三维结构的重建及其过滤性能的计算。首先通过扫描电子显微镜(Scanning Electron Microscope,SEM)成像获得过滤介质内部微观结构...基于Visual C#和MATLAB语言的混合编程,提出纤维过滤介质过滤性能计算的软件开发方案,实现对纤维过滤介质三维结构的重建及其过滤性能的计算。首先通过扫描电子显微镜(Scanning Electron Microscope,SEM)成像获得过滤介质内部微观结构二维图像,提取过滤介质的几何参数,根据纤维半径和方向等参数重建过滤介质的三维结构,利用经典的经验公式计算过滤介质的压降和效率。以Visual Studio 2010为开发平台,Visual C#为开发语言,基于dll文件调用MATLAB程序,最终完成该软件的开发。与利用数值模拟或实验方法计算过滤介质性能相比,利用该软件计算过滤介质的过滤性能,方法简单、便捷。展开更多
基金supported by National Key R&D Program of China under Grant No.2020AAA0106200National Natural Science Foundation of China under Grant Nos.61832016 and U20B2070.
文摘Transformers,the dominant architecture for natural language processing,have also recently attracted much attention from computational visual media researchers due to their capacity for long-range representation and high performance.Transformers are sequence-to-sequence models,which use a selfattention mechanism rather than the RNN sequential structure.Thus,such models can be trained in parallel and can represent global information.This study comprehensively surveys recent visual transformer works.We categorize them according to task scenario:backbone design,high-level vision,low-level vision and generation,and multimodal learning.Their key ideas are also analyzed.Differing from previous surveys,we mainly focus on visual transformer methods in low-level vision and generation.The latest works on backbone design are also reviewed in detail.For ease of understanding,we precisely describe the main contributions of the latest works in the form of tables.As well as giving quantitative comparisons,we also present image results for low-level vision and generation tasks.Computational costs and source code links for various important works are also given in this survey to assist further development.
文摘Line drawings, as a concise form, can be recognized by infants and even chimpanzees. Recently, how the visual system processes line-drawings attracts more and more attention from psychology, cognitive science and computer science. The neuroscientific studies revealed that line drawings generate similar neural actions as color photographs, which give insights on how to efficiently process big media data. In this paper, we present a comprehensive survey on line drawing studies, including cognitive mechanism of visual perception, computational models in computer vision and intelligent process in diverse media applications. Major debates, challenges and solutions that have been addressed over the years are discussed. Finally some of the ensuing challenges in line drawing studies are outlined.
文摘基于Visual C#和MATLAB语言的混合编程,提出纤维过滤介质过滤性能计算的软件开发方案,实现对纤维过滤介质三维结构的重建及其过滤性能的计算。首先通过扫描电子显微镜(Scanning Electron Microscope,SEM)成像获得过滤介质内部微观结构二维图像,提取过滤介质的几何参数,根据纤维半径和方向等参数重建过滤介质的三维结构,利用经典的经验公式计算过滤介质的压降和效率。以Visual Studio 2010为开发平台,Visual C#为开发语言,基于dll文件调用MATLAB程序,最终完成该软件的开发。与利用数值模拟或实验方法计算过滤介质性能相比,利用该软件计算过滤介质的过滤性能,方法简单、便捷。