With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much att...With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much attention to heaRthcare robots and rehabilitation robots. To get natural and harmonious communication between the user and a service robot, the information perception/feedback ability, and interaction ability for service robots become more important in many key issues.展开更多
With the increasing importance of multimodal data in emotional expression on social media,mainstream methods for sentiment analysis have shifted from unimodal to multimodal approaches.However,the challenges of extract...With the increasing importance of multimodal data in emotional expression on social media,mainstream methods for sentiment analysis have shifted from unimodal to multimodal approaches.However,the challenges of extracting high-quality emotional features and achieving effective interaction between different modalities remain two major obstacles in multimodal sentiment analysis.To address these challenges,this paper proposes a Text-Gated Interaction Network with Inter-Sample Commonality Perception(TGICP).Specifically,we utilize a Inter-sample Commonality Perception(ICP)module to extract common features from similar samples within the same modality,and use these common features to enhance the original features of each modality,thereby obtaining a richer and more complete multimodal sentiment representation.Subsequently,in the cross-modal interaction stage,we design a Text-Gated Interaction(TGI)module,which is text-driven.By calculating the mutual information difference between the text modality and nonverbal modalities,the TGI module dynamically adjusts the influence of emotional information from the text modality on nonverbal modalities.This helps to reduce modality information asymmetry while enabling full cross-modal interaction.Experimental results show that the proposed model achieves outstanding performance on both the CMU-MOSI and CMU-MOSEI baseline multimodal sentiment analysis datasets,validating its effectiveness in emotion recognition tasks.展开更多
The interaction of arbitrarily distributed penny-shaped cracks in three-dimensional solids is analyzed in this paper. Using oblate spheroidal coordinates and displacement functions, an analytic method is devel- oped i...The interaction of arbitrarily distributed penny-shaped cracks in three-dimensional solids is analyzed in this paper. Using oblate spheroidal coordinates and displacement functions, an analytic method is devel- oped in which the opening and the sliding displacements on each crack surface are taken as the basic unknown functions. The basic unknown functions can be expanded in series of Legendre polynomials with unknown coefficients. Based on superposition technique, a set of governing equations for the unknown coefficients are formulated from the traction free conditions on each crack surface. The boundary collocation procedure and the average method for crack-surface tractions are used for solving the governing equations. The solution can be obtained for quite closely located cracks. Numerical examples are given for several crack problems. By comparing the present results with other existing results, one can conclude that the present method provides a direct and efficient approach to deal with three-dimensional solids containing multiple cracks.展开更多
We numerically investigate the formation and interaction of a parabolic-shaped pulse pair in a passively mode-locked Yb-doped fiber laser. Based on a lumped model, the parabolic-shaped pulse pair is obtained by contro...We numerically investigate the formation and interaction of a parabolic-shaped pulse pair in a passively mode-locked Yb-doped fiber laser. Based on a lumped model, the parabolic-shaped pulse pair is obtained by controlling the intercavity average dispersion and gain saturation energy, Moreover, pulse repulsive and attractive motion are also achieved with different pulse separations. Simulation results show that the phase shift plays an important role in pulse interaction, and the interaction is determined by the inter-cavity average dispersion and gain saturation energy, i.e., the strength of the interaction is proportional to the gain saturation energy, a stronger gain saturation energy will result in a higher interaction intensity. On the contrary, the increase of the inter-cavity dispersion will counterbalance some interaction force. The results also show that the interaction of a parabolic-shaped pulse pair has a larger interaction distance compared to conventional solitons.展开更多
Medical stents have made significant strides in development,however,creating a single manufacturing material that combines size adjustability,robust strength,and degradability remains a major challenge.Here,we develop...Medical stents have made significant strides in development,however,creating a single manufacturing material that combines size adjustability,robust strength,and degradability remains a major challenge.Here,we developed an elastomer designed for stent fabrication,featuring excellent thermo-responsive shape memory and fast self-healing.This elastomer is produced through supramolecular inter-actions between liquid crystal moieties,which exhibit strong orientation,and a polymer backbone.These supramolecular interactions provide the elastomer with remarkable mechanical strength(10.46 MPa).Interestingly,the elastomer shows excellent mesocrystalline stability and cyclability,thanks to multiple non-covalent bonds,allowing the crosslinked liquid crystalline phase to maintain integrity at temper-atures up to 285℃.Impressively,the elastomer can respond to stress and temperature changes,fully reverting to its original shape in just 25.7±0.94 s.When configured as a helical stent,its macroscopic dimensions can be adjusted to mimic the size of blood vessels in vitro.The stent exhibits rapid responsiveness at 37℃,achieving complete self-expansion within 10 s.Furthermore,it demonstrates excellent degradability,with a weight loss of only 2.75%±0.31%after 70 d.This innovation paves the way for new possibilities in the use of medical stents,particularly for the long-term treatment of coronary heart disease.展开更多
The response of tunnels subjected to seismic loading is a complex mechanism and depends not only on the seismic nature but also on tunnel structure and surrounding soil properties.The individual behavior of circular,r...The response of tunnels subjected to seismic loading is a complex mechanism and depends not only on the seismic nature but also on tunnel structure and surrounding soil properties.The individual behavior of circular,rectangular,and sub-rectangular tunnels subjected to seismic loadings has already been studied in the literature.In the present research,two case scenarios of circular,rectangular tunnels and four sub-rectangular shaped tunnels,with similar cross-section areas,were adopted to perform a comprehensive numerical investigation.The purpose of the study was to determine the mechanical behavior of tunnels of different shapes,depending upon seismic conditions.Analyses were performed by considering the influence of soil-lining interaction,soil parameters,and lining thickness,as well as lining rigidity.Computations were performed for no-slip and full-slip conditions.The results indicate that the tunnel shape design is of great importance when regarding the mechanical behavior of the surrounding soil.This concerns no-slip as well as full-slip soil-lining interaction,especially when the lining is subjected to seismic loading.Moreover,it is shown that changes in incremental bending moments for circular,rectangular and sub-rectangular tunnels that depend upon the soil-lining interaction conditions differ significantly.展开更多
The classic multi-mode input shapers(MMISs)are valid to decrease multi-mode residual vibration of manipulators or robots simultaneously.But these input shapers cannot suppress more residual vibration with a quick resp...The classic multi-mode input shapers(MMISs)are valid to decrease multi-mode residual vibration of manipulators or robots simultaneously.But these input shapers cannot suppress more residual vibration with a quick response time when the frequency bandwidth of each mode vibration is very different.The methodologies and various types of multi-mode classic and hybrid input shaping control schemes with positive impulses were introduced in this paper.Six types of two-mode hybrid input shapers with positive impulses of a 3 degree of freedom robot were established.The ability and robustness of these two-mode hybrid input shapers to suppress residual vibration were analyzed by vibration response curve and sensitivity curve via numerical simulation.The response time of the zero vibration-zero vibration and derivative(ZV-ZVD)input shaper is the fastest,but the robustness is the least.The robustness of the zero vibration and derivative-extra insensitive(ZVD-EI)input shaper is the best,while the response time is the longest.According to the frequency bandwidth at each mode and required system response time,the most appropriate multi-mode hybrid input shaper(MMHIS)can be selected in order to improve response time as much as possible under the condition of suppressing more residual vibration.展开更多
In geometry processing,symmetry research benefits from global geo-metric features of complete shapes,but the shape of an object captured in real-world applications is often incomplete due to the limited sensor resoluti...In geometry processing,symmetry research benefits from global geo-metric features of complete shapes,but the shape of an object captured in real-world applications is often incomplete due to the limited sensor resolution,single viewpoint,and occlusion.Different from the existing works predicting symmetry from the complete shape,we propose a learning approach for symmetry predic-tion based on a single RGB-D image.Instead of directly predicting the symmetry from incomplete shapes,our method consists of two modules,i.e.,the multi-mod-al feature fusion module and the detection-by-reconstruction module.Firstly,we build a channel-transformer network(CTN)to extract cross-fusion features from the RGB-D as the multi-modal feature fusion module,which helps us aggregate features from the color and the depth separately.Then,our self-reconstruction net-work based on a 3D variational auto-encoder(3D-VAE)takes the global geo-metric features as input,followed by a prediction symmetry network to detect the symmetry.Our experiments are conducted on three public datasets:ShapeNet,YCB,and ScanNet,we demonstrate that our method can produce reliable and accurate results.展开更多
A pneumatic launcher is theoretically investigated to study its natural transverse vibration in water. Considering the mass effect of the sealing cover, the launcher is simplified as a uniform cantilever beam with a t...A pneumatic launcher is theoretically investigated to study its natural transverse vibration in water. Considering the mass effect of the sealing cover, the launcher is simplified as a uniform cantilever beam with a top point mass. By introducing the boundary and continuity conditions into the motion equation, the natural frequency equation and the mode shape function are derived. An iterative calculation method for added mass is also presented using the velocity potential function to account for the mass effect of the fluid on the launcher. The first 2 order natural frequencies and mode shapes are discussed in external flow fields and both external and internal flow fields. The results show good agreement with both natural frequencies and mode shapes between the theoretical analysis and the FEM studies. Also, the added mass is found to decrease with the increase of the mode shape orders of the launcher. And because of the larger added mass in both the external and internal flow fields than that in only the external flow field, the corresponding natural frequencies of the former are relatively smaller.展开更多
At present,most experimental teaching systems lack guidance of an operator,and thus users often do not know what to do during an experiment.The user load is therefore increased,and the learning efficiency of the stude...At present,most experimental teaching systems lack guidance of an operator,and thus users often do not know what to do during an experiment.The user load is therefore increased,and the learning efficiency of the students is decreased.To solve the problem of insufficient system interactivity and guidance,an experimental navigation system based on multi-mode fusion is proposed in this paper.The system first obtains user information by sensing the hardware devices,intelligently perceives the user intention and progress of the experiment according to the information acquired,and finally carries out a multi-modal intelligent navigation process for users.As an innovative aspect of this study,an intelligent multi-mode navigation system is used to guide users in conducting experiments,thereby reducing the user load and enabling the users to effectively complete their experiments.The results prove that this system can guide users in completing their experiments,and can effectively reduce the user load during the interaction process and improve the efficiency.展开更多
Indoor scene semantic segmentation is essential for enabling robots to understand and interact with their environments effectively.However,numerous challenges remain unresolved,particularly in single-robot systems,whi...Indoor scene semantic segmentation is essential for enabling robots to understand and interact with their environments effectively.However,numerous challenges remain unresolved,particularly in single-robot systems,which often struggle with the complexity and variability of indoor scenes.To address these limitations,we introduce a novel multi-robot collaborative framework based on multiplex interactive learning(MPIL)in which each robot specialises in a distinct visual task within a unified multitask architecture.During training,the framework employs task-specific decoders and cross-task feature sharing to enhance collaborative optimisation.At inference time,robots operate independently with optimised models,enabling scalable,asynchronous and efficient deployment in real-world scenarios.Specifically,MPIL employs specially designed modules that integrate RGB and depth data,refine feature representations and facilitate the simultaneous execution of multiple tasks,such as instance segmentation,scene classification and semantic segmentation.By leveraging these modules,distinct agents within multi-robot systems can effectively handle specialised tasks,thereby enhancing the overall system's flexibility and adaptability.This collaborative effort maximises the strengths of each robot,resulting in a more comprehensive understanding of environments.Extensive experiments on two public benchmark datasets demonstrate MPIL's competitive performance compared to state-of-the-art approaches,highlighting the effectiveness and robustness of our multi-robot system in complex indoor environments.展开更多
Human life is invaluable,and timely efforts are crucial to rescue individuals trapped under debris following an earthquake.To identify opportunities for improving post-earthquake search and rescue(SAR)robotics,we get ...Human life is invaluable,and timely efforts are crucial to rescue individuals trapped under debris following an earthquake.To identify opportunities for improving post-earthquake search and rescue(SAR)robotics,we get insights through four different sources:(i)A literature review of disaster robotics and victim psychology,(ii)A public survey on earthquake awareness and their view of rescue robots,(iii)Semi-structured interviews with first responders,and(iv)Responses from GenAI chatbots which were prompted to assume the role of expert rescuers.The triangulated analysis show that there are challenges in mobility,autonomy,communication,situational awareness,and human-robot collaboration.The public respondents showed high acceptance of robot-assisted rescue and prioritised survivor detection,sensing,and communication as essential functionalities of robots.The experts expressed limitations in current equipment,the need for improved victim localisation,and interest in XR-based training and robot-assisted debris handling.The GenAI chatbots highlighted structural risk assessment,multi-sensor fusion,and supervised autonomy.Therefore,this study identifies critical robot features,outlines multi-modal interaction requirements,and highlights gaps in current SAR practice.These findings offer robot design directions for developing effective,trustworthy SAR robots,which can be integrated to future response disaster-workflows.展开更多
Hand gesture recognition is a popular topic in computer vision and makes human-computer interaction more flexible and convenient.The representation of hand gestures is critical for recognition.In this paper,we propose...Hand gesture recognition is a popular topic in computer vision and makes human-computer interaction more flexible and convenient.The representation of hand gestures is critical for recognition.In this paper,we propose a new method to measure the similarity between hand gestures and exploit it for hand gesture recognition.The depth maps of hand gestures captured via the Kinect sensors are used in our method,where the 3D hand shapes can be segmented from the cluttered backgrounds.To extract the pattern of salient 3D shape features,we propose a new descriptor-3D Shape Context,for 3D hand gesture representation.The 3D Shape Context information of each 3D point is obtained in multiple scales because both local shape context and global shape distribution are necessary for recognition.The description of all the 3D points constructs the hand gesture representation,and hand gesture recognition is explored via dynamic time warping algorithm.Extensive experiments are conducted on multiple benchmark datasets.The experimental results verify that the proposed method is robust to noise,articulated variations,and rigid transformations.Our method outperforms state-of-the-art methods in the comparisons of accuracy and efficiency.展开更多
The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigat...The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigation scheme based on Interacting Multiple Nonlinear Fuzzy Adaptive H_∞ Models(IMM-NFAH_∞) filtering technique for UAV is presented. The proposed IMM-NFAH_∞ strategy switches between two different Nonlinear Fuzzy Adaptive H_∞(NFAH_∞) filters and each NFAH_∞ filter is based on different fuzzy logic inference systems. The newly proposed technique takes into consideration the high order Taylor series terms and adapts the nonlinear H_∞ filter based on different fuzzy inference systems via adaptive filter bounds(di),along with disturbance attenuation parameter c. Simulation analysis validates the performance of the proposed algorithm, and the comparison with nonlinear H_∞(NH_∞) filter and that with different NFAH_∞ filters demonstrate the effectiveness of UAV localization utilizing IMM-NFAH_∞ filter.展开更多
文摘With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much attention to heaRthcare robots and rehabilitation robots. To get natural and harmonious communication between the user and a service robot, the information perception/feedback ability, and interaction ability for service robots become more important in many key issues.
基金supported by the Natural Science Foundation of Henan under Grant 242300421220the Henan Provincial Science and Technology Research Project under Grants 252102211047 and 252102211062+3 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126.
文摘With the increasing importance of multimodal data in emotional expression on social media,mainstream methods for sentiment analysis have shifted from unimodal to multimodal approaches.However,the challenges of extracting high-quality emotional features and achieving effective interaction between different modalities remain two major obstacles in multimodal sentiment analysis.To address these challenges,this paper proposes a Text-Gated Interaction Network with Inter-Sample Commonality Perception(TGICP).Specifically,we utilize a Inter-sample Commonality Perception(ICP)module to extract common features from similar samples within the same modality,and use these common features to enhance the original features of each modality,thereby obtaining a richer and more complete multimodal sentiment representation.Subsequently,in the cross-modal interaction stage,we design a Text-Gated Interaction(TGI)module,which is text-driven.By calculating the mutual information difference between the text modality and nonverbal modalities,the TGI module dynamically adjusts the influence of emotional information from the text modality on nonverbal modalities.This helps to reduce modality information asymmetry while enabling full cross-modal interaction.Experimental results show that the proposed model achieves outstanding performance on both the CMU-MOSI and CMU-MOSEI baseline multimodal sentiment analysis datasets,validating its effectiveness in emotion recognition tasks.
文摘The interaction of arbitrarily distributed penny-shaped cracks in three-dimensional solids is analyzed in this paper. Using oblate spheroidal coordinates and displacement functions, an analytic method is devel- oped in which the opening and the sliding displacements on each crack surface are taken as the basic unknown functions. The basic unknown functions can be expanded in series of Legendre polynomials with unknown coefficients. Based on superposition technique, a set of governing equations for the unknown coefficients are formulated from the traction free conditions on each crack surface. The boundary collocation procedure and the average method for crack-surface tractions are used for solving the governing equations. The solution can be obtained for quite closely located cracks. Numerical examples are given for several crack problems. By comparing the present results with other existing results, one can conclude that the present method provides a direct and efficient approach to deal with three-dimensional solids containing multiple cracks.
基金Project supported by the National Natural Science Foundation of China (Grant No. 60372061)the Scientific Forefront and Interdisciplinary Innovation Project of Jilin University, China (Grant No. 200903296)
文摘We numerically investigate the formation and interaction of a parabolic-shaped pulse pair in a passively mode-locked Yb-doped fiber laser. Based on a lumped model, the parabolic-shaped pulse pair is obtained by controlling the intercavity average dispersion and gain saturation energy, Moreover, pulse repulsive and attractive motion are also achieved with different pulse separations. Simulation results show that the phase shift plays an important role in pulse interaction, and the interaction is determined by the inter-cavity average dispersion and gain saturation energy, i.e., the strength of the interaction is proportional to the gain saturation energy, a stronger gain saturation energy will result in a higher interaction intensity. On the contrary, the increase of the inter-cavity dispersion will counterbalance some interaction force. The results also show that the interaction of a parabolic-shaped pulse pair has a larger interaction distance compared to conventional solitons.
基金funded by the Natural Science Foundation of Liaoning Province(2024JH2/102600340&2024-BSLH-310)the support program for excellent young scholars at China Medical University.
文摘Medical stents have made significant strides in development,however,creating a single manufacturing material that combines size adjustability,robust strength,and degradability remains a major challenge.Here,we developed an elastomer designed for stent fabrication,featuring excellent thermo-responsive shape memory and fast self-healing.This elastomer is produced through supramolecular inter-actions between liquid crystal moieties,which exhibit strong orientation,and a polymer backbone.These supramolecular interactions provide the elastomer with remarkable mechanical strength(10.46 MPa).Interestingly,the elastomer shows excellent mesocrystalline stability and cyclability,thanks to multiple non-covalent bonds,allowing the crosslinked liquid crystalline phase to maintain integrity at temper-atures up to 285℃.Impressively,the elastomer can respond to stress and temperature changes,fully reverting to its original shape in just 25.7±0.94 s.When configured as a helical stent,its macroscopic dimensions can be adjusted to mimic the size of blood vessels in vitro.The stent exhibits rapid responsiveness at 37℃,achieving complete self-expansion within 10 s.Furthermore,it demonstrates excellent degradability,with a weight loss of only 2.75%±0.31%after 70 d.This innovation paves the way for new possibilities in the use of medical stents,particularly for the long-term treatment of coronary heart disease.
基金supported by Vietnam Ministry of Education and Training under Grant No. B2022-MDA-06
文摘The response of tunnels subjected to seismic loading is a complex mechanism and depends not only on the seismic nature but also on tunnel structure and surrounding soil properties.The individual behavior of circular,rectangular,and sub-rectangular tunnels subjected to seismic loadings has already been studied in the literature.In the present research,two case scenarios of circular,rectangular tunnels and four sub-rectangular shaped tunnels,with similar cross-section areas,were adopted to perform a comprehensive numerical investigation.The purpose of the study was to determine the mechanical behavior of tunnels of different shapes,depending upon seismic conditions.Analyses were performed by considering the influence of soil-lining interaction,soil parameters,and lining thickness,as well as lining rigidity.Computations were performed for no-slip and full-slip conditions.The results indicate that the tunnel shape design is of great importance when regarding the mechanical behavior of the surrounding soil.This concerns no-slip as well as full-slip soil-lining interaction,especially when the lining is subjected to seismic loading.Moreover,it is shown that changes in incremental bending moments for circular,rectangular and sub-rectangular tunnels that depend upon the soil-lining interaction conditions differ significantly.
基金Project(LQ12E05008)supported by Natural Science Foundation of Zhejiang Province,ChinaProject(201708330107)supported by China Scholarship Council
文摘The classic multi-mode input shapers(MMISs)are valid to decrease multi-mode residual vibration of manipulators or robots simultaneously.But these input shapers cannot suppress more residual vibration with a quick response time when the frequency bandwidth of each mode vibration is very different.The methodologies and various types of multi-mode classic and hybrid input shaping control schemes with positive impulses were introduced in this paper.Six types of two-mode hybrid input shapers with positive impulses of a 3 degree of freedom robot were established.The ability and robustness of these two-mode hybrid input shapers to suppress residual vibration were analyzed by vibration response curve and sensitivity curve via numerical simulation.The response time of the zero vibration-zero vibration and derivative(ZV-ZVD)input shaper is the fastest,but the robustness is the least.The robustness of the zero vibration and derivative-extra insensitive(ZVD-EI)input shaper is the best,while the response time is the longest.According to the frequency bandwidth at each mode and required system response time,the most appropriate multi-mode hybrid input shaper(MMHIS)can be selected in order to improve response time as much as possible under the condition of suppressing more residual vibration.
文摘In geometry processing,symmetry research benefits from global geo-metric features of complete shapes,but the shape of an object captured in real-world applications is often incomplete due to the limited sensor resolution,single viewpoint,and occlusion.Different from the existing works predicting symmetry from the complete shape,we propose a learning approach for symmetry predic-tion based on a single RGB-D image.Instead of directly predicting the symmetry from incomplete shapes,our method consists of two modules,i.e.,the multi-mod-al feature fusion module and the detection-by-reconstruction module.Firstly,we build a channel-transformer network(CTN)to extract cross-fusion features from the RGB-D as the multi-modal feature fusion module,which helps us aggregate features from the color and the depth separately.Then,our self-reconstruction net-work based on a 3D variational auto-encoder(3D-VAE)takes the global geo-metric features as input,followed by a prediction symmetry network to detect the symmetry.Our experiments are conducted on three public datasets:ShapeNet,YCB,and ScanNet,we demonstrate that our method can produce reliable and accurate results.
基金Foundation item: Supported by the National Natural Science Foundation of China (51379083) and the Specialized Research Fund for the Doctoral Program of Hiher Education (20120142110051).
文摘A pneumatic launcher is theoretically investigated to study its natural transverse vibration in water. Considering the mass effect of the sealing cover, the launcher is simplified as a uniform cantilever beam with a top point mass. By introducing the boundary and continuity conditions into the motion equation, the natural frequency equation and the mode shape function are derived. An iterative calculation method for added mass is also presented using the velocity potential function to account for the mass effect of the fluid on the launcher. The first 2 order natural frequencies and mode shapes are discussed in external flow fields and both external and internal flow fields. The results show good agreement with both natural frequencies and mode shapes between the theoretical analysis and the FEM studies. Also, the added mass is found to decrease with the increase of the mode shape orders of the launcher. And because of the larger added mass in both the external and internal flow fields than that in only the external flow field, the corresponding natural frequencies of the former are relatively smaller.
基金the the National Key R&D Program of China(No.2018YFB1004901)the Independent Innovation Team Project of Jinan City(No.2019GXRC013).
文摘At present,most experimental teaching systems lack guidance of an operator,and thus users often do not know what to do during an experiment.The user load is therefore increased,and the learning efficiency of the students is decreased.To solve the problem of insufficient system interactivity and guidance,an experimental navigation system based on multi-mode fusion is proposed in this paper.The system first obtains user information by sensing the hardware devices,intelligently perceives the user intention and progress of the experiment according to the information acquired,and finally carries out a multi-modal intelligent navigation process for users.As an innovative aspect of this study,an intelligent multi-mode navigation system is used to guide users in conducting experiments,thereby reducing the user load and enabling the users to effectively complete their experiments.The results prove that this system can guide users in completing their experiments,and can effectively reduce the user load during the interaction process and improve the efficiency.
基金supported by the National Natural Science Foundation of China under Grant 62373009.
文摘Indoor scene semantic segmentation is essential for enabling robots to understand and interact with their environments effectively.However,numerous challenges remain unresolved,particularly in single-robot systems,which often struggle with the complexity and variability of indoor scenes.To address these limitations,we introduce a novel multi-robot collaborative framework based on multiplex interactive learning(MPIL)in which each robot specialises in a distinct visual task within a unified multitask architecture.During training,the framework employs task-specific decoders and cross-task feature sharing to enhance collaborative optimisation.At inference time,robots operate independently with optimised models,enabling scalable,asynchronous and efficient deployment in real-world scenarios.Specifically,MPIL employs specially designed modules that integrate RGB and depth data,refine feature representations and facilitate the simultaneous execution of multiple tasks,such as instance segmentation,scene classification and semantic segmentation.By leveraging these modules,distinct agents within multi-robot systems can effectively handle specialised tasks,thereby enhancing the overall system's flexibility and adaptability.This collaborative effort maximises the strengths of each robot,resulting in a more comprehensive understanding of environments.Extensive experiments on two public benchmark datasets demonstrate MPIL's competitive performance compared to state-of-the-art approaches,highlighting the effectiveness and robustness of our multi-robot system in complex indoor environments.
文摘Human life is invaluable,and timely efforts are crucial to rescue individuals trapped under debris following an earthquake.To identify opportunities for improving post-earthquake search and rescue(SAR)robotics,we get insights through four different sources:(i)A literature review of disaster robotics and victim psychology,(ii)A public survey on earthquake awareness and their view of rescue robots,(iii)Semi-structured interviews with first responders,and(iv)Responses from GenAI chatbots which were prompted to assume the role of expert rescuers.The triangulated analysis show that there are challenges in mobility,autonomy,communication,situational awareness,and human-robot collaboration.The public respondents showed high acceptance of robot-assisted rescue and prioritised survivor detection,sensing,and communication as essential functionalities of robots.The experts expressed limitations in current equipment,the need for improved victim localisation,and interest in XR-based training and robot-assisted debris handling.The GenAI chatbots highlighted structural risk assessment,multi-sensor fusion,and supervised autonomy.Therefore,this study identifies critical robot features,outlines multi-modal interaction requirements,and highlights gaps in current SAR practice.These findings offer robot design directions for developing effective,trustworthy SAR robots,which can be integrated to future response disaster-workflows.
基金supported by the National Natural Science Foundation of China(61773272,61976191)the Six Talent Peaks Project of Jiangsu Province,China(XYDXX-053)Suzhou Research Project of Technical Innovation,Jiangsu,China(SYG201711)。
文摘Hand gesture recognition is a popular topic in computer vision and makes human-computer interaction more flexible and convenient.The representation of hand gestures is critical for recognition.In this paper,we propose a new method to measure the similarity between hand gestures and exploit it for hand gesture recognition.The depth maps of hand gestures captured via the Kinect sensors are used in our method,where the 3D hand shapes can be segmented from the cluttered backgrounds.To extract the pattern of salient 3D shape features,we propose a new descriptor-3D Shape Context,for 3D hand gesture representation.The 3D Shape Context information of each 3D point is obtained in multiple scales because both local shape context and global shape distribution are necessary for recognition.The description of all the 3D points constructs the hand gesture representation,and hand gesture recognition is explored via dynamic time warping algorithm.Extensive experiments are conducted on multiple benchmark datasets.The experimental results verify that the proposed method is robust to noise,articulated variations,and rigid transformations.Our method outperforms state-of-the-art methods in the comparisons of accuracy and efficiency.
基金supported by a grant from the National Natural Science Foundation of China(No.61375082)
文摘The purpose of this research is to improve the robustness of the autonomous system in order to improve the position and velocity estimation of an Unmanned Aerial Vehicle(UAV).Therefore, new integrated SINS/GPS navigation scheme based on Interacting Multiple Nonlinear Fuzzy Adaptive H_∞ Models(IMM-NFAH_∞) filtering technique for UAV is presented. The proposed IMM-NFAH_∞ strategy switches between two different Nonlinear Fuzzy Adaptive H_∞(NFAH_∞) filters and each NFAH_∞ filter is based on different fuzzy logic inference systems. The newly proposed technique takes into consideration the high order Taylor series terms and adapts the nonlinear H_∞ filter based on different fuzzy inference systems via adaptive filter bounds(di),along with disturbance attenuation parameter c. Simulation analysis validates the performance of the proposed algorithm, and the comparison with nonlinear H_∞(NH_∞) filter and that with different NFAH_∞ filters demonstrate the effectiveness of UAV localization utilizing IMM-NFAH_∞ filter.