This paper presents an innovative approach to enhance the querying capability of ChatGPT,a conversational artificial intelligence model,by incorporating voice-based interaction and a convolutional neural network(CNN)-...This paper presents an innovative approach to enhance the querying capability of ChatGPT,a conversational artificial intelligence model,by incorporating voice-based interaction and a convolutional neural network(CNN)-based impaired vision detection model.The proposed system aims to improve user experience and accessibility by allowing users to interact with ChatGPT using voice commands.Additionally,a CNN-based model is employed to detect impairments in user vision,enabling the system to adapt its responses and provide appropriate assistance.This research tackles head-on the challenges of user experience and inclusivity in artificial intelligence(AI).It underscores our commitment to overcoming these obstacles,making ChatGPT more accessible and valuable for a broader audience.The integration of voice-based interaction and impaired vision detection represents a novel approach to conversational AI.Notably,this innovation transcends novelty;it carries the potential to profoundly impact the lives of users,particularly those with visual impairments.The modular approach to system design ensures adaptability and scalability,critical for the practical implementation of these advancements.Crucially,the solution places the user at its core.Customizing responses for those with visual impairments demonstrates AI’s potential to not only understand but also accommodate individual needs and preferences.展开更多
An adaptive weighted stereo matching algorithm with multilevel and bidirectional dynamic programming based on ground control points (GCPs) is presented. To decrease time complexity without losing matching precision,...An adaptive weighted stereo matching algorithm with multilevel and bidirectional dynamic programming based on ground control points (GCPs) is presented. To decrease time complexity without losing matching precision, using a multilevel search scheme, the coarse matching is processed in typical disparity space image, while the fine matching is processed in disparity-offset space image. In the upper level, GCPs are obtained by enhanced volumetric iterative algorithm enforcing the mutual constraint and the threshold constraint. Under the supervision of the highly reliable GCPs, bidirectional dynamic programming framework is employed to solve the inconsistency in the optimization path. In the lower level, to reduce running time, disparity-offset space is proposed to efficiently achieve the dense disparity image. In addition, an adaptive dual support-weight strategy is presented to aggregate matching cost, which considers photometric and geometric information. Further, post-processing algorithm can ameliorate disparity results in areas with depth discontinuities and related by occlusions using dual threshold algorithm, where missing stereo information is substituted from surrounding regions. To demonstrate the effectiveness of the algorithm, we present the two groups of experimental results for four widely used standard stereo data sets, including discussion on performance and comparison with other methods, which show that the algorithm has not only a fast speed, but also significantly improves the efficiency of holistic optimization.展开更多
Purpose:The purpose of this study is to propose a new method for the end-to-end classification of steel surface defects.Design/methodology/approach:This study proposes an AM-AoN-SNN algorithm,which combines an attenti...Purpose:The purpose of this study is to propose a new method for the end-to-end classification of steel surface defects.Design/methodology/approach:This study proposes an AM-AoN-SNN algorithm,which combines an attention mechanism(AM)with an All-optical Neuron-based spiking neural network(AoN-SNN).The AM enhances network learning and extracts defective features,while the AoN-SNN predicts both the labels of the defects and the final labels of the images.Compared to the conventional Leaky-Integrated and Fire SNN,the AoN-SNN has improved the activation of neurons.Findings:The experimental findings on Northeast University(NEU)-CLS demonstrate that the proposed neural network detection approach outperforms other methods.Furthermore,the network’s effectiveness was tested,and the results indicate that the proposed method can achieve high detection accuracy and strong anti-interference capabilities while maintaining a basic structure.Originality/value:This study introduces a novel approach to classifying steel surface defects using a combination of a shallow AoN-SNN and a hybrid AM with different network architectures.The proposed method is the first study of SNN networks applied to this task.展开更多
基金This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)(Grant Number:IMSIU-RP23008).
文摘This paper presents an innovative approach to enhance the querying capability of ChatGPT,a conversational artificial intelligence model,by incorporating voice-based interaction and a convolutional neural network(CNN)-based impaired vision detection model.The proposed system aims to improve user experience and accessibility by allowing users to interact with ChatGPT using voice commands.Additionally,a CNN-based model is employed to detect impairments in user vision,enabling the system to adapt its responses and provide appropriate assistance.This research tackles head-on the challenges of user experience and inclusivity in artificial intelligence(AI).It underscores our commitment to overcoming these obstacles,making ChatGPT more accessible and valuable for a broader audience.The integration of voice-based interaction and impaired vision detection represents a novel approach to conversational AI.Notably,this innovation transcends novelty;it carries the potential to profoundly impact the lives of users,particularly those with visual impairments.The modular approach to system design ensures adaptability and scalability,critical for the practical implementation of these advancements.Crucially,the solution places the user at its core.Customizing responses for those with visual impairments demonstrates AI’s potential to not only understand but also accommodate individual needs and preferences.
基金supported by the National Natural Science Foundation of China (No.60605023,60775048)Specialized Research Fund for the Doctoral Program of Higher Education (No.20060141006)
文摘An adaptive weighted stereo matching algorithm with multilevel and bidirectional dynamic programming based on ground control points (GCPs) is presented. To decrease time complexity without losing matching precision, using a multilevel search scheme, the coarse matching is processed in typical disparity space image, while the fine matching is processed in disparity-offset space image. In the upper level, GCPs are obtained by enhanced volumetric iterative algorithm enforcing the mutual constraint and the threshold constraint. Under the supervision of the highly reliable GCPs, bidirectional dynamic programming framework is employed to solve the inconsistency in the optimization path. In the lower level, to reduce running time, disparity-offset space is proposed to efficiently achieve the dense disparity image. In addition, an adaptive dual support-weight strategy is presented to aggregate matching cost, which considers photometric and geometric information. Further, post-processing algorithm can ameliorate disparity results in areas with depth discontinuities and related by occlusions using dual threshold algorithm, where missing stereo information is substituted from surrounding regions. To demonstrate the effectiveness of the algorithm, we present the two groups of experimental results for four widely used standard stereo data sets, including discussion on performance and comparison with other methods, which show that the algorithm has not only a fast speed, but also significantly improves the efficiency of holistic optimization.
基金supported by the Jiangsu Agriculture Science and Technology Innovation Fund(Grant No.CX(21)3145).
文摘Purpose:The purpose of this study is to propose a new method for the end-to-end classification of steel surface defects.Design/methodology/approach:This study proposes an AM-AoN-SNN algorithm,which combines an attention mechanism(AM)with an All-optical Neuron-based spiking neural network(AoN-SNN).The AM enhances network learning and extracts defective features,while the AoN-SNN predicts both the labels of the defects and the final labels of the images.Compared to the conventional Leaky-Integrated and Fire SNN,the AoN-SNN has improved the activation of neurons.Findings:The experimental findings on Northeast University(NEU)-CLS demonstrate that the proposed neural network detection approach outperforms other methods.Furthermore,the network’s effectiveness was tested,and the results indicate that the proposed method can achieve high detection accuracy and strong anti-interference capabilities while maintaining a basic structure.Originality/value:This study introduces a novel approach to classifying steel surface defects using a combination of a shallow AoN-SNN and a hybrid AM with different network architectures.The proposed method is the first study of SNN networks applied to this task.