期刊文献+
共找到60篇文章
< 1 2 3 >
每页显示 20 50 100
Psychophysics of wearable haptic/tactile perception in a multisensory context
1
作者 Xiao LEI Tingwei ZHANG +4 位作者 Kun CHEN Jue ZHANG Yue TIAN Fang FANG Lihan CHEN 《Virtual Reality & Intelligent Hardware》 2019年第2期185-200,共16页
Multisensory lab based in Peking University,has carried out basic studies in multisensory space and time processing,intersensory binding and haptic/tactile perception.We exploited a typical paradigm of multisensory il... Multisensory lab based in Peking University,has carried out basic studies in multisensory space and time processing,intersensory binding and haptic/tactile perception.We exploited a typical paradigm of multisensory illusion-temporal ventriloquist effect and applied it in a wide range of multisensory interactions(mainly focused on temporal processing).In this work,we summarized how the tactile stimuli were exploited to compose tactile cues and as tactile apparent motion to interface with other sensory stimuli(visual and auditory stimuli)to examine the underlying perceptual organization in a multisensory context.Moreover,we introduced two examples of wearable haptic/tactile perception in our lab,by using two customized tactile devices and discussed the potential applications in this field. 展开更多
关键词 MULTISENSORY Ventriloquism effect Wearable haptics Perceptual organization
在线阅读 下载PDF
Consensus of high-order dynamic multi-agent systems with switching topology and time-varying delays 被引量:12
2
作者 Fangcui JIANG Long WANG Guangming XIE 《控制理论与应用(英文版)》 EI 2010年第1期52-60,共9页
This paper studies the consensus problems for a group of agents with switching topology and time-varying communication delays,where the dynamics of agents is modeled as a high-order integrator.A linear distributed con... This paper studies the consensus problems for a group of agents with switching topology and time-varying communication delays,where the dynamics of agents is modeled as a high-order integrator.A linear distributed consensus protocol is proposed,which only depends on the agent's own information and its neighbors'partial information.By introducing a decomposition of the state vector and performing a state space transformation,the closed-loop dynamics of the multi-agent system is converted into two decoupled subsystems.Based on the decoupled subsystems,some sufficient conditions for the convergence to consensus are established,which provide the upper bounds on the admissible communication delays.Also,the explicit expression of the consensus state is derived.Moreover,the results on the consensus seeking of the group of high-order agents have been extended to a network of agents with dynamics modeled as a completely controllable linear time-invariant system.It is proved that the convergence to consensus of this network is equivalent to that of the group of high-order agents.Finally,some numerical examples are given to demonstrate the effectiveness of the main results. 展开更多
关键词 Consensus problems Distributed control Multi-agent systems Switching topology Time-varying delays Lyapunov-Krasovskii approach
在线阅读 下载PDF
SADDLE-POINT BASED SEPARATION OF TOUCHED OBJECTS IN 2-D IMAGE 被引量:5
3
作者 Chen Ken Larry E. Banta Jiang Gangyi 《Journal of Electronics(China)》 2006年第3期452-456,共5页
In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish the... In many image analysis and processing problems, discriminating the size and shape of each individual object in an aggregate pile projected in an image is an important practice. It is relatively easy to distinguish these features among the objects already separated from each other. The problems will be undoubtedly more complex and of greater challenge if the objects are touched or/and overlapped. This letter presents an algorithm that can be used to separate the touches and overlaps existing in the objects within a 2-D image. The approach is first to convert the gray-scale image to its corresponding binary one and then to the 3-D topographic one using the erosion operations. A template (or mask) is engineered to search the topographic surface for the saddle point, from which the segmenting orientation is determined followed by the desired separating operation. The algorithm is tested on a real image and the running result is adequately satisfying and encouraging. 展开更多
关键词 Image processing Segmentation Objects separation Morphological processing Touch and overlap Aggregates images
在线阅读 下载PDF
The history, hotspots, and trends of electrocardiogram 被引量:4
4
作者 Xiang-Lin YANG Guo-Zhen LIU +7 位作者 Yun-Hai TONG Hong YAN Zhi XU Qi CHEN Xiang LIU Hong-Hao ZHANG Hong-Bo WANG Shao-Hua TAN 《Journal of Geriatric Cardiology》 SCIE CAS CSCD 2015年第4期448-456,共9页
The electrocardiogram (ECG) has broad applications in clinical diagnosis and prognosis of cardiovascular disease. Many researchers have contributed to its progressive development. To commemorate those pioneers, and ... The electrocardiogram (ECG) has broad applications in clinical diagnosis and prognosis of cardiovascular disease. Many researchers have contributed to its progressive development. To commemorate those pioneers, and to better study and promote the use of ECG, we reviewed and present here a systematic introduction about the history, hotspots, and trends of ECG. In the historical part, information including the invention, improvement, and extensive applications of ECG, such as in long QT syndrome (LQTS), angina, and myocardial infarction (MI), are chronologi- cally presented. New technologies and applications from the 1990s are also introduced. In the second part, we use the bibliometric analysis me- thod to analyze the hotspots in the field of ECG-related research. By using total citations and year-specific total citations as our main criteria, four key hotspots in ECG-related research were identified from 11 articles, including atrial fibrillation, LQTS, angina and MI, and heart rate variability. Recent studies in those four areas are also reported. In the final part, we discuss the future trends concerning ECG-related research. The authors believe that improvement of the ECG instrumentation, big data mining for ECG, and the accuracy of diagnosis and application will be areas of continuous concern. 展开更多
关键词 ELECTROCARDIOGRAM HISTORY HOTSPOTS REVIEW TRENDS
在线阅读 下载PDF
Interaction between auditory and motor systems in speech perception 被引量:2
5
作者 Zhe-Meng Wu Ming-Li Chen +1 位作者 Xi-Hong Wu Liang Li 《Neuroscience Bulletin》 SCIE CAS CSCD 2014年第3期490-496,共7页
Based on the Motor Theory of speech perception, the interaction between the auditory and motor systems plays an essential role in speech perception. Since the Motor Theory was proposed, it has received remarkable atte... Based on the Motor Theory of speech perception, the interaction between the auditory and motor systems plays an essential role in speech perception. Since the Motor Theory was proposed, it has received remarkable attention in the field. However, each of the three hypotheses of the theory still needs further verification. In this review, we focus on how the auditory-motor anatomical and functional associations play a role in speech perception and discuss why previous studies could not reach an agreement and particularly whether the motor system involvement in speech perception is task-load dependent. Finally, we suggest that the function of the auditory-motor link is particularly useful for speech perception under adverse listening conditions and the further revised Motor Theory is a potential solution to the "cocktail-party" problem. 展开更多
关键词 auditory-motor interaction Motor Theory of speech perception motor cortex "cocktail-party" problem.
原文传递
Digital Autofocusing Method Based on Contourlet Transform 被引量:1
6
作者 JIANG Gang-yi YI Wen-juan +1 位作者 YU Mei YANG Ming 《Optoelectronics Letters》 EI 2007年第5期381-384,共4页
The autofocusing technique based on contourlet transform is discussed in this paper and an autofocusing method is proposed for images with much information in certain directions. The experimental results show that the... The autofocusing technique based on contourlet transform is discussed in this paper and an autofocusing method is proposed for images with much information in certain directions. The experimental results show that the proposed method can focus accurately and the sensitivity ratio is higher than that of the other autofocusing methods based on conventional image processing 展开更多
关键词 图象处理 自动对焦 数字技术 信号传输
在线阅读 下载PDF
Deep learning model improves radiologists'performance in detection and classification of breast lesions 被引量:1
7
作者 Yingshi Sun Yuhong Qu +15 位作者 Dong Wang Yi Li Lin Ye Jingbo Du Bing Xu Baoqing Li Xiaoting Li Kexin Zhang Yanjie Shi Ruijia Sun Yichuan Wang Rong Long Dengbo Chen Haijiao Li Liwei Wang Min Cao 《Chinese Journal of Cancer Research》 SCIE CAS CSCD 2021年第6期682-693,共12页
Objective:Computer-aided diagnosis using deep learning algorithms has been initially applied in the field of mammography,but there is no large-scale clinical application.Methods:This study proposed to develop and veri... Objective:Computer-aided diagnosis using deep learning algorithms has been initially applied in the field of mammography,but there is no large-scale clinical application.Methods:This study proposed to develop and verify an artificial intelligence model based on mammography.Firstly,mammograms retrospectively collected from six centers were randomized to a training dataset and a validation dataset for establishing the model.Secondly,the model was tested by comparing 12 radiologists’performance with and without it.Finally,prospectively enrolled women with mammograms from six centers were diagnosed by radiologists with the model.The detection and diagnostic capabilities were evaluated using the freeresponse receiver operating characteristic(FROC)curve and ROC curve.Results:The sensitivity of model for detecting lesions after matching was 0.908 for false positive rate of 0.25 in unilateral images.The area under ROC curve(AUC)to distinguish the benign lesions from malignant lesions was0.855[95%confidence interval(95%CI):0.830,0.880].The performance of 12 radiologists with the model was higher than that of radiologists alone(AUC:0.852 vs.0.805,P=0.005).The mean reading time of with the model was shorter than that of reading alone(80.18 s vs.62.28 s,P=0.032).In prospective application,the sensitivity of detection reached 0.887 at false positive rate of 0.25;the AUC of radiologists with the model was 0.983(95%CI:0.978,0.988),with sensitivity,specificity,positive predictive value(PPV),and negative predictive value(NPV)of94.36%,98.07%,87.76%,and 99.09%,respectively.Conclusions:The artificial intelligence model exhibits high accuracy for detecting and diagnosing breast lesions,improves diagnostic accuracy and saves time. 展开更多
关键词 Breast cancer MAMMOGRAPHY deep learning artificial intelligence
暂未订购
Sequential Bag-of-Words model for human action classification 被引量:1
8
作者 Hong Liu Hao Tang +3 位作者 Wei Xiao ZiYi Guo Lu Tian Yuan Gao 《CAAI Transactions on Intelligence Technology》 2016年第2期125-136,共12页
Recently, approaches utilizing spatial-temporal features to form Bag-of-Words (BoWs) models have achieved great success due to their simplicity and effectiveness. But they still have difficulties when distinguishing... Recently, approaches utilizing spatial-temporal features to form Bag-of-Words (BoWs) models have achieved great success due to their simplicity and effectiveness. But they still have difficulties when distinguishing between actions with high inter-ambiguity. The main reason is that they describe actions by orderless bag of features, and ignore the spatial and temporal structure information of visual words. In order to improve classification performance, we present a novel approach called sequential Bag-of-Words. It captures temporal sequential structure by segmenting the entire action into sub-actions. Meanwhile, we pay more attention to the distinguishing parts of an action by classifying sub- actions separately, which is then employed to vote for the final result. Extensive experiments are conducted on challenging datasets and real scenes to evaluate our method. Concretely, we compare our results to some state-of-the-art classification approaches and confirm the advantages of our approach to distinguish similar actions. Results show that our approach is robust and outperforms most existing BoWs based classification approaches, especially on complex datasets with interactive activities, cluttered backgrounds and inter-class action ambiguities. 展开更多
关键词 Action classification Sequential Bag-of-Words STIP Probalibity
在线阅读 下载PDF
Audio-visual keyword transformer for unconstrained sentence-level keyword spotting
9
作者 Yidi Li Jiale Ren +3 位作者 Yawei Wang Guoquan Wang Xia Li Hong Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第1期142-152,共11页
As one of the most effective methods to improve the accuracy and robustness of speech tasks,the audio-visual fusion approach has recently been introduced into the field of Keyword Spotting(KWS).However,existing audio-... As one of the most effective methods to improve the accuracy and robustness of speech tasks,the audio-visual fusion approach has recently been introduced into the field of Keyword Spotting(KWS).However,existing audio-visual keyword spotting models are limited to detecting isolated words,while keyword spotting for unconstrained speech is still a challenging problem.To this end,an Audio-Visual Keyword Transformer(AVKT)network is proposed to spot keywords in unconstrained video clips.The authors present a transformer classifier with learnable CLS tokens to extract distinctive keyword features from the variable-length audio and visual inputs.The outputs of audio and visual branches are combined in a decision fusion module.As humans can easily notice whether a keyword appears in a sentence or not,our AVKT network can detect whether a video clip with a spoken sentence contains a pre-specified keyword.Moreover,the position of the keyword is localised in the attention map without additional position labels.Exper-imental results on the LRS2-KWS dataset and our newly collected PKU-KWS dataset show that the accuracy of AVKT exceeded 99%in clean scenes and 85%in extremely noisy conditions.The code is available at https://github.com/jialeren/AVKT. 展开更多
关键词 artificial intelligence multimodal approaches natural language processing neural network speech processing
在线阅读 下载PDF
Research Advance in Swarm Robotics 被引量:12
10
作者 TAN Ying ZHENG Zhongyang 《Defence Technology(防务技术)》 SCIE EI CAS 2013年第1期31-63,共33页
The research progress of swarm robotics is reviewed in details. The swarm robotics inspired from nature is a combination of swarm intelligence and robotics, which shows a great potential in several aspects. First of a... The research progress of swarm robotics is reviewed in details. The swarm robotics inspired from nature is a combination of swarm intelligence and robotics, which shows a great potential in several aspects. First of all, the cooperation of nature swarm and swarm intelligence are briefly introduced, and the special features of the swarm robotics are summarized compared to a single robot and other multi-individual systems. Then the modeling methods for swarm robotics are described by a list of several widely used swarm robotics entity projects and simulation platforms. Finally, as a main part of this paper, the current research on the swarm robotic algorithms are presented in detail, including cooperative control mechanisms in swarm robotics for flocking, navigating and searching applications. 展开更多
关键词 artificial intelligence swarm robotics cooperative control MODELING SIMULATION swarm intelligence
在线阅读 下载PDF
Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics 被引量:8
11
作者 Ao-Xue Li Ke-Xin Zhang Li-Wei Wang 《International Journal of Automation and computing》 EI CSCD 2019年第5期563-574,共12页
Fine-grained image classification, which aims to distinguish images with subtle distinctions, is a challenging task for two main reasons: lack of sufficient training data for every class and difficulty in learning dis... Fine-grained image classification, which aims to distinguish images with subtle distinctions, is a challenging task for two main reasons: lack of sufficient training data for every class and difficulty in learning discriminative features for representation. In this paper, to address the two issues, we propose a two-phase framework for recognizing images from unseen fine-grained classes, i.e., zeroshot fine-grained classification. In the first feature learning phase, we finetune deep convolutional neural networks using hierarchical semantic structure among fine-grained classes to extract discriminative deep visual features. Meanwhile, a domain adaptation structure is induced into deep convolutional neural networks to avoid domain shift from training data to test data. In the second label inference phase, a semantic directed graph is constructed over attributes of fine-grained classes. Based on this graph, we develop a label propagation algorithm to infer the labels of images in the unseen classes. Experimental results on two benchmark datasets demonstrate that our model outperforms the state-of-the-art zero-shot learning models. In addition, the features obtained by our feature learning model also yield significant gains when they are used by other zero-shot learning models, which shows the flexility of our model in zero-shot finegrained classification. 展开更多
关键词 FINE-GRAINED image CLASSIFICATION zero-shot LEARNING DEEP FEATURE LEARNING domain adaptation semantic graph
原文传递
TDD-net: a tiny defect detection network for printed circuit boards 被引量:118
12
作者 Runwei Ding Linhui Dai +1 位作者 Guangpeng Li Hong Liu 《CAAI Transactions on Intelligence Technology》 2019年第2期110-116,共7页
Tiny defect detection (TDD) which aims to perform the quality control of printed circuit boards (PCBs) is a basic and essential task in the production of most electronic products. Though significant progress has been ... Tiny defect detection (TDD) which aims to perform the quality control of printed circuit boards (PCBs) is a basic and essential task in the production of most electronic products. Though significant progress has been made in PCB defect detection, traditional methods are still difficult to cope with the complex and diverse PCBs. To deal with these problems, this article proposes a tiny defect detection network (TDD-Net) to improve performance for PCB defect detection. In this method, the inherent multi-scale and pyramidal hierarchies of deep convolutional networks are exploited to construct feature pyramids. Compared with existing approaches, the TDD-Net has three novel changes. First, reasonable anchors are designed by using k-means clustering. Second, TDD-Net strengthens the relationship of feature maps from different levels and benefits from low-level structural information, which is suitable for tiny defect detection. Finally, considering the small and imbalance dataset, online hard example mining is adopted in the whole training phase in order to improve the quality of region-of-interest (ROI) proposals and make more effective use of data information. Quantitative results on the PCB defect dataset show that the proposed method has better portability and can achieve 98.90% mAP, which outperforms the state-of-arts. The code will be publicly available. 展开更多
关键词 TDD PCBS TDD-Net
在线阅读 下载PDF
Head-related transfer function–reserved time-frequency masking for robust binaural sound source localization 被引量:2
13
作者 Hong Liu Peipei Yuan +2 位作者 Bing Yang Ge Yang Yang Chen 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第1期26-33,共8页
Various time-frequency(T-F)masks are being applied to sound source localization tasks.Moreover,deep learning has dramatically advanced T-F mask estimation.However,existing masks are usually designed for speech separat... Various time-frequency(T-F)masks are being applied to sound source localization tasks.Moreover,deep learning has dramatically advanced T-F mask estimation.However,existing masks are usually designed for speech separation tasks and are suitable only for single-channel signals.A novel complex-valued T-F mask is proposed that reserves the head-related transfer function(HRTF),customized for binaural sound source localization.In addition,because the convolutional neural network that is exploited to estimate the proposed mask takes binaural spectral information as the input and output,accurate binaural cues can be preserved.Compared with conventional T-F masks that emphasize single speech source–dominated T-F units,HRTFreserved masks eliminate the speech component while keeping the direct propagation path.Thus,the estimated HRTF is capable of extracting more reliable localization features for the final direction of arrival estimation.Hence,binaural sound source localization guided by the proposed T-F mask is robust under noisy and reverberant acoustic environments.The experimental results demonstrate that the new T-F mask is superior to conventional T-F masks and lead to the better performance of sound source localization in adverse environments. 展开更多
关键词 estimation. SOUND FUNCTION
在线阅读 下载PDF
Enhancing direct-path relative transfer function using deep neural network for robust sound source localization 被引量:2
14
作者 Bing Yang Runwei Ding +2 位作者 Yutong Ban Xiaofei Li Hong Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第3期446-454,共9页
This article proposes a deep neural network(DNN)-based direct-path relative transfer function(DP-RTF)enhancement method for robust direction of arrival(DOA)estimation in noisy and reverberant environments.The DP-RTF r... This article proposes a deep neural network(DNN)-based direct-path relative transfer function(DP-RTF)enhancement method for robust direction of arrival(DOA)estimation in noisy and reverberant environments.The DP-RTF refers to the ratio between the directpath acoustic transfer functions of the two microphone channels.First,the complex-value DP-RTF is decomposed into the inter-channel intensity difference,and sinusoidal functions of the inter-channel phase difference in the time-frequency domain.Then,the decomposed DP-RTF features from a series of temporal context frames are utilized to train a DNN model,which maps the DP-RTF features contaminated by noise and reverberation to the clean ones,and meanwhile provides a time-frequency(TF)weight to indicate the reliability of the mapping.The DP-RTF enhancement network can help to enhance the DP-RTF against noise and reverberation.Finally,the DOA of a sound source can be estimated by integrating the weighted matching between the enhanced DP-RTF features and the DP-RTF templates.Experimental results on simulated data show the superiority of the proposed DP-RTF enhancement network for estimating the DOA of the sound source in the environments with various levels of noise and reverberation. 展开更多
关键词 network SOUND TRANSFER
在线阅读 下载PDF
Binaural sound source localization based on weighted template matching 被引量:2
15
作者 Hong Liu Yongheng Sun +1 位作者 Ge Yang Yang Chen 《CAAI Transactions on Intelligence Technology》 EI 2021年第2期214-223,共10页
In robot binaural sound source localization(SSL),locating the direction of the sound source accurately in the shortest time is important.It refers to the algorithm complexity,but even more to the shortest duration of ... In robot binaural sound source localization(SSL),locating the direction of the sound source accurately in the shortest time is important.It refers to the algorithm complexity,but even more to the shortest duration of the required signal.A novel binaural SSL method based on feature and frequency weighting is proposed.More specifically,in the training stage,the direction-related interaural cross-correlation function(CCF)and interaural intensity difference(IID)in each frequency band are calculated under noiseless conditions,which are considered the templates.In the testing stage,first the cosine similarities between the CCF and IID of the test signal and templates are calculated in all features and frequency bands.Then,the direction likelihood can be obtained by weighting the similarities.Finally,the direction with maximum likelihood is specified as the direction of the sound source.Experiments were carried out on CIPIC dataset subject 003 with different noises in the noisex-92 dataset and demonstrated that the method can accurately locate the sound source with a short signal duration. 展开更多
关键词 TEMPLATE SSL ROBOT
在线阅读 下载PDF
Immune based computer virus detection approaches 被引量:1
16
作者 TAN Ying ZHANG Pengtao 《智能系统学报》 CSCD 北大核心 2013年第1期80-94,共15页
The computer virus is considered one of the most horrifying threats to the security of computer systems worldwide.The rapid development of evasion techniques used in virus causes the signature based computer virus det... The computer virus is considered one of the most horrifying threats to the security of computer systems worldwide.The rapid development of evasion techniques used in virus causes the signature based computer virus detection techniques to be ineffective.Many novel computer virus detection approaches have been proposed in the past to cope with the ineffectiveness,mainly classified into three categories: static,dynamic and heuristics techniques.As the natural similarities between the biological immune system(BIS),computer security system(CSS),and the artificial immune system(AIS) were all developed as a new prototype in the community of anti-virus research.The immune mechanisms in the BIS provide the opportunities to construct computer virus detection models that are robust and adaptive with the ability to detect unseen viruses.In this paper,a variety of classic computer virus detection approaches were introduced and reviewed based on the background knowledge of the computer virus history.Next,a variety of immune based computer virus detection approaches were also discussed in detail.Promising experimental results suggest that the immune based computer virus detection approaches were able to detect new variants and unseen viruses at lower false positive rates,which have paved a new way for the anti-virus research. 展开更多
关键词 数据挖掘 计算机技术 发展现状 人工智能
在线阅读 下载PDF
A Distributed 2D-to-3D Video Conversion System 被引量:1
17
作者 张哲斌 张吉安 +2 位作者 张学西 王亦洲 高文 《China Communications》 SCIE CSCD 2013年第5期30-38,共9页
2D-to-3D video conversion is a feasible way to generate 3D programs for the current 3DTV industry. However, for large-scale 3D video production, current systems are no longer adequate in terms of the time and labor re... 2D-to-3D video conversion is a feasible way to generate 3D programs for the current 3DTV industry. However, for large-scale 3D video production, current systems are no longer adequate in terms of the time and labor required for conversion. In this paper, we introduce a distributed 2D-to-3D video conversion system that includes a 2D-to-3D video conversion module, architecture of the parallel computation on the cloud, and 3D video coding in the system. The system enables cooperation among multiple users in the simultaneous completion of their conversion tasks so that the conversion efficiency is greatly promoted. In the experiments, we evaluate the system based on criteria related to both time consumption and video coding performance. 展开更多
关键词 3D video 2D-to-3D conversion distributed system
在线阅读 下载PDF
Flow-based SLAM:From geometry computation to learning 被引量:1
18
作者 Zike YAN Hongbin ZHA 《Virtual Reality & Intelligent Hardware》 2019年第5期435-460,共26页
Simultaneous localization and mapping(SLAM)has attracted considerable research interest from the robotics and computer-vision communities for>30 years.With steady and progressive efforts being made,modern SLAM syst... Simultaneous localization and mapping(SLAM)has attracted considerable research interest from the robotics and computer-vision communities for>30 years.With steady and progressive efforts being made,modern SLAM systems allow robust and online applications in real-world scenes.We examined the evolution of this powerful perception tool in detail and noticed that the insights concerning incremental computation and temporal guidance are persistently retained.Herein,we denote this temporal continuity as a flow basis and present for the first time a survey that specifically focuses on the flow-based nature,ranging from geometric computation to the emerging learning techniques.We start by reviewing two essential stages for geometric computation,presenting the de facto standard pipeline and problem formulation,along with the utilization of temporal cues.The recently emerging techniques are then summarized,covering a wide range of areas,such as learning techniques,sensor fusion,and continuous time trajectory modeling.This survey aims at arousing public attention on how robust SLAM systems benefit from a continuously observing nature,as well as the topics worthy of further investigation for better utilizing the temporal cues. 展开更多
关键词 Simultaneous localization and mapping Visual odometry Deep learning Flow basis Sensor fusion Augmented reality
在线阅读 下载PDF
Scene-adaptive hierarchical data association and depth-invariant part-based appearance model for indoor multiple objects tracking 被引量:1
19
作者 Hong Liu Can Wang Yuan Gao 《CAAI Transactions on Intelligence Technology》 2016年第3期210-224,共15页
Indoor multi-tracking is more challenging compared with outdoor tasks due to frequent occlusion, view-truncation, severe scale change and pose variation, which may bring considerable unreliability and ambiguity to tar... Indoor multi-tracking is more challenging compared with outdoor tasks due to frequent occlusion, view-truncation, severe scale change and pose variation, which may bring considerable unreliability and ambiguity to target representation and data association. So discriminative and reliable target representation is vital for accurate data association in multi-tracking. Pervious works always combine bunch of features to increase the discriminative power, but this is prone to error accumulation and unnecessary computational cost, which may increase ambiguity on the contrary. Moreover, reliability of a same feature in different scenes may vary a lot, especially for currently widespread network cameras, which are settled in various and complex indoor scenes, previous fixed feature selection schemes cannot meet general requirements. To properly handle these problems, first, we propose a scene-adaptive hierarchical data association scheme, which adaptively selects features with higher reliability on target representation in the applied scene, and gradually combines features to the minimum requirement of discriminating ambiguous targets; second, a novel depth-invariant part-based appearance model using RGB-D data is proposed which makes the appearance model robust to scale change, partial occlusion and view-truncation. The introduce of RGB-D data increases the diversity of features, which provides more types of features for feature selection in data association and enhances the final multi-tracking performance. We validate our method from several aspects including scene-adaptive feature selection scheme, hierarchical data association scheme and RGB-D based appearance modeling scheme in various indoor scenes, which demonstrates its effectiveness and efficiency on improving multi-tracking performances in various indoor scenes. 展开更多
关键词 Multiple objects tracking Scene-adaptive Data association Appearance model RGB-D data
在线阅读 下载PDF
Correction
20
作者 Ao-Xue Li Ke-Xin Zhang Li-Wei Wang 《International Journal of Automation and computing》 EI CSCD 2021年第6期1045-1045,共1页
Correction to:Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics DOI:10.1007/s11633-019-1177-8 Authors:Ao-Xue Li,Ke-Xin Zhang,Li-Wei Wang The article Zero-shot Fine-grained Classification by... Correction to:Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics DOI:10.1007/s11633-019-1177-8 Authors:Ao-Xue Li,Ke-Xin Zhang,Li-Wei Wang The article Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics written by Ao-Xue Li,Ke-Xin Zhang and Li-Wei Wang,was originally published on vol.16,no.5 of International Journal of Automation and Computing without Open Access.After publication,the authors decided to opt for Open Choice and to make the article an Open Access publication. 展开更多
关键词 OPEN DOI ACCESS MAKE
原文传递
上一页 1 2 3 下一页 到第
使用帮助 返回顶部