Microwave fracturing of rocks before mechanical breakage could improve the performance of mechanical excavators and reduce environmental impacts.Previous research focused on the microwave fracturing of intact rock blo...Microwave fracturing of rocks before mechanical breakage could improve the performance of mechanical excavators and reduce environmental impacts.Previous research focused on the microwave fracturing of intact rock blocks.By using an open-ended antenna,this paper investigates the effect of pre-existing joints on the microwave fracturing of the Singapore Bukit Timah granite blocks.The results show that the specimens are weakened in the manners of cracking,spalling,melting,or a combination of them.The crack number and the total crack length produced by microwave treatment of jointed rock blocks are slightly smaller than those in the intact rock blocks.The interaction between joints and microwave-induced cracks can be summarized into the following four patterns:(1)microwave-induced cracks become arrested so that the crack propagation is terminated;(2)microwave-induced cracks penetrate the joints and continue to propagate;(3)microwave-induced cracks become deflected along the joints;and(4)microwave-induced cracks propagate forward following the joints.The smaller the approach angle between the microwave-induced crack and the preexisting joint is,the more microwave-induced cracks tend to be arrested at the joint.Increasing the approach angle between the microwave-induced crack and the joint can increase the chance of microwave-induced crack penetrating the joint.The results also show that the smaller the distance is between the microwave radiation point and the joint,the easier it is for microwave-induced cracks to penetrate the joints;otherwise,the microwave-induced crack is more likely to be arrested at the pre-existing joint.展开更多
During the installation of a pipe pile,the soil around the pile will be squeezed out. This paper deals with this squeezing effect of open-ended pipe piles using the cylindrical cavity expansion theory. The characteris...During the installation of a pipe pile,the soil around the pile will be squeezed out. This paper deals with this squeezing effect of open-ended pipe piles using the cylindrical cavity expansion theory. The characteristics of soil with different tension and compression moduli and dilation are involved by applying the elastic theory with different moduli and logarithmic strain. The closed-form solutions of the radius of the plastic region,the displacement of the boundary between the plastic region and the elastic region and the expansion pressure on the external surface of the pipe piles are obtained. When obtaining these solutions,the soil plug in the open-ended pipe pile is considered by employing an incremental filling ratio to quantify the degree of soil plugging. Moreover,the effects of the ratio of tension and compression moduli,angle of dilation and incremental filling ratio on the radius of the plastic region and the expansion pressure on the external surface of the pipe pile are investigated. The parametric analyses show that it is necessary and important to consider the difference between the tension modulus and compression modulus,dilation angle and incremental filling ratio for studying the squeezing effect of open-ended pipe pile installation. It is concluded that the analytical solutions presented in this paper are suitable for studying the squeezing effect of open-ended pipe piles.展开更多
The flawed engineering practice is considered the main factor that is affecting to the development quality of engineering postgraduates.Taking Foshan Base as an example,this paper has analyzed the operational pattern,...The flawed engineering practice is considered the main factor that is affecting to the development quality of engineering postgraduates.Taking Foshan Base as an example,this paper has analyzed the operational pattern,practice teaching model,and internal governance system of the open-ended base as a new system for engineering practice and proposed several suggestions for the reformation of engineering postgraduates based on the construction effect.展开更多
Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CASTopic of the Special Issue What are the top questions tow...Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CASTopic of the Special Issue What are the top questions towards better science and innovation and the required data to answer these questions?展开更多
Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CAS Topic of the Special Issue What are the top questions to...Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CAS Topic of the Special Issue What are the top questions towards better science and innovation and the required data to answer these questions?展开更多
Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capab...Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capabilities,a lack of textual priors,and incomplete information fusion and interaction.This paper proposes an enhanced bootstrapping language-image pre-training(BLIP)model for MedVQA based on multimodal feature augmentation and triple-path collaborative attention(FCA-BLIP)to address these issues.First,FCA-BLIP employs a unified bootstrap multimodal model architecture that integrates ResNet and bidirectional encoder representations from Transformer(BERT)models to enhance feature extraction capabilities.It enables a more precise analysis of the details in images and questions.Next,the pre-trained BLIP model is used to extract features from image-text sample pairs.The model can understand the semantic relationships and shared information between images and text.Finally,a novel attention structure is developed to fuse the multimodal feature vectors,thereby improving the alignment accuracy between modalities.Experimental results demonstrate that the proposed method performs well in clinical visual question-answering tasks.For the MedVQA task of staging diabetic macular edema in fundus imaging,the proposed method outperforms the existing major models in several performance metrics.展开更多
The consultation intention of emergency decision-makers in urban rail transit(URT)is input into the emergency knowledge base in the form of domain questions to obtain emergency decision support services.This approach ...The consultation intention of emergency decision-makers in urban rail transit(URT)is input into the emergency knowledge base in the form of domain questions to obtain emergency decision support services.This approach facilitates the rapid collection of complete knowledge and rules to form effective decisions.However,the current structured degree of the URT emergency knowledge base remains low,and the domain questions lack labeled datasets,resulting in a large deviation between the consultation outcomes and the intended objectives.To address this issue,this paper proposes a question intention recognition model for the URT emergency domain,leveraging knowledge graph(KG)and data enhancement technology.First,a structured storage of emergency cases and emergency plans is realized based on KG.Subsequently,a comprehensive question template is developed,and the labeled dataset of emergency domain questions in URT is generated through the KG.Lastly,data enhancement is applied by prompt learning and the NLP Chinese Data Augmentation(NLPCDA)tool,and the intention recognition model combining Generalized Auto-regression Pre-training for Language Understanding(XLNet)and Recurrent Convolutional Neural Network for Text Classification(TextRCNN)is constructed.Word embeddings are generated by XLNet,context information is further captured using Bidirectional Long Short-Term Memory Neural Network(BiLSTM),and salient features are extracted with Convolutional Neural Network(CNN).Experimental results demonstrate that the proposed model can enhance the clarity of classification and the identification of domain questions,thereby providing supportive knowledge for emergency decision-making in URT.展开更多
Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate...Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer.In this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in Arabic.To support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the trade-o.between the model complexity and the overall model performance.Some fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA models.So far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no questions.Hence,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA systems.Experiments indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that a.ects the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model complexity.The Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.展开更多
In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilizati...In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.展开更多
To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,t...To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,the question classifier draws both semantic and grammatical information into information retrieval and machine learning methods in the form of various training features,including the question word,the main verb of the question,the dependency structure,the position of the main auxiliary verb,the main noun of the question,the top hypernym of the main noun,etc.Then the QA query results are re-ranked by question class information.Experiments show that the questions in real-world web data sets can be accurately classified by the classifier,and the QA results after re-ranking can be obviously improved.It is proved that with both semantic and grammatical information,applications such as QA, built upon real-world web data sets, can be improved,thus showing better performance.展开更多
基金Innovative and Entrepreneurial Team Program of Jiangsu Province,China,Grant/Award Number:JSSCTD202140Innovative and Entrepreneurial Doctor Program of Jiangsu Province,Grant/Award Number:KYCX20_0114National Natural Science Foundation of China,Grant/Award Numbers:41831281,52104121。
文摘Microwave fracturing of rocks before mechanical breakage could improve the performance of mechanical excavators and reduce environmental impacts.Previous research focused on the microwave fracturing of intact rock blocks.By using an open-ended antenna,this paper investigates the effect of pre-existing joints on the microwave fracturing of the Singapore Bukit Timah granite blocks.The results show that the specimens are weakened in the manners of cracking,spalling,melting,or a combination of them.The crack number and the total crack length produced by microwave treatment of jointed rock blocks are slightly smaller than those in the intact rock blocks.The interaction between joints and microwave-induced cracks can be summarized into the following four patterns:(1)microwave-induced cracks become arrested so that the crack propagation is terminated;(2)microwave-induced cracks penetrate the joints and continue to propagate;(3)microwave-induced cracks become deflected along the joints;and(4)microwave-induced cracks propagate forward following the joints.The smaller the approach angle between the microwave-induced crack and the preexisting joint is,the more microwave-induced cracks tend to be arrested at the joint.Increasing the approach angle between the microwave-induced crack and the joint can increase the chance of microwave-induced crack penetrating the joint.The results also show that the smaller the distance is between the microwave radiation point and the joint,the easier it is for microwave-induced cracks to penetrate the joints;otherwise,the microwave-induced crack is more likely to be arrested at the pre-existing joint.
文摘During the installation of a pipe pile,the soil around the pile will be squeezed out. This paper deals with this squeezing effect of open-ended pipe piles using the cylindrical cavity expansion theory. The characteristics of soil with different tension and compression moduli and dilation are involved by applying the elastic theory with different moduli and logarithmic strain. The closed-form solutions of the radius of the plastic region,the displacement of the boundary between the plastic region and the elastic region and the expansion pressure on the external surface of the pipe piles are obtained. When obtaining these solutions,the soil plug in the open-ended pipe pile is considered by employing an incremental filling ratio to quantify the degree of soil plugging. Moreover,the effects of the ratio of tension and compression moduli,angle of dilation and incremental filling ratio on the radius of the plastic region and the expansion pressure on the external surface of the pipe pile are investigated. The parametric analyses show that it is necessary and important to consider the difference between the tension modulus and compression modulus,dilation angle and incremental filling ratio for studying the squeezing effect of open-ended pipe pile installation. It is concluded that the analytical solutions presented in this paper are suitable for studying the squeezing effect of open-ended pipe piles.
基金supported by Guangdong Province Graduate Education Innovation Program(2021JGXM103)the 2020“Research on Talents”Project by the Guangdong Planning Office of Philosophy and Social Science.
文摘The flawed engineering practice is considered the main factor that is affecting to the development quality of engineering postgraduates.Taking Foshan Base as an example,this paper has analyzed the operational pattern,practice teaching model,and internal governance system of the open-ended base as a new system for engineering practice and proposed several suggestions for the reformation of engineering postgraduates based on the construction effect.
文摘Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CASTopic of the Special Issue What are the top questions towards better science and innovation and the required data to answer these questions?
文摘Editors Yang Wang,Xi'an Jiaotong University Dongbo Shi,Shanghai Jiaotong University Ye Sun,University College London Zhesi Shen,National Science Library,CAS Topic of the Special Issue What are the top questions towards better science and innovation and the required data to answer these questions?
基金Supported by the Program for Liaoning Excellent Talents in University(No.LR15045)the Liaoning Provincial Science and Technology Department Applied Basic Research Plan(No.101300243).
文摘Medical visual question answering(MedVQA)faces unique challenges due to the high precision required for images and the specialized nature of the questions.These challenges include insufficient feature extraction capabilities,a lack of textual priors,and incomplete information fusion and interaction.This paper proposes an enhanced bootstrapping language-image pre-training(BLIP)model for MedVQA based on multimodal feature augmentation and triple-path collaborative attention(FCA-BLIP)to address these issues.First,FCA-BLIP employs a unified bootstrap multimodal model architecture that integrates ResNet and bidirectional encoder representations from Transformer(BERT)models to enhance feature extraction capabilities.It enables a more precise analysis of the details in images and questions.Next,the pre-trained BLIP model is used to extract features from image-text sample pairs.The model can understand the semantic relationships and shared information between images and text.Finally,a novel attention structure is developed to fuse the multimodal feature vectors,thereby improving the alignment accuracy between modalities.Experimental results demonstrate that the proposed method performs well in clinical visual question-answering tasks.For the MedVQA task of staging diabetic macular edema in fundus imaging,the proposed method outperforms the existing major models in several performance metrics.
基金supported in part by the National Natural Science Foundation of China.The funding numbers 62433005,62272036,62132003,and 62173167.
文摘The consultation intention of emergency decision-makers in urban rail transit(URT)is input into the emergency knowledge base in the form of domain questions to obtain emergency decision support services.This approach facilitates the rapid collection of complete knowledge and rules to form effective decisions.However,the current structured degree of the URT emergency knowledge base remains low,and the domain questions lack labeled datasets,resulting in a large deviation between the consultation outcomes and the intended objectives.To address this issue,this paper proposes a question intention recognition model for the URT emergency domain,leveraging knowledge graph(KG)and data enhancement technology.First,a structured storage of emergency cases and emergency plans is realized based on KG.Subsequently,a comprehensive question template is developed,and the labeled dataset of emergency domain questions in URT is generated through the KG.Lastly,data enhancement is applied by prompt learning and the NLP Chinese Data Augmentation(NLPCDA)tool,and the intention recognition model combining Generalized Auto-regression Pre-training for Language Understanding(XLNet)and Recurrent Convolutional Neural Network for Text Classification(TextRCNN)is constructed.Word embeddings are generated by XLNet,context information is further captured using Bidirectional Long Short-Term Memory Neural Network(BiLSTM),and salient features are extracted with Convolutional Neural Network(CNN).Experimental results demonstrate that the proposed model can enhance the clarity of classification and the identification of domain questions,thereby providing supportive knowledge for emergency decision-making in URT.
文摘Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer.In this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in Arabic.To support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the trade-o.between the model complexity and the overall model performance.Some fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA models.So far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no questions.Hence,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA systems.Experiments indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that a.ects the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model complexity.The Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.
文摘In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.
基金Microsoft Research Asia Internet Services in Academic Research Fund(No.FY07-RES-OPP-116)the Science and Technology Development Program of Tianjin(No.06YFGZGX05900)
文摘To improve question answering (QA) performance based on real-world web data sets,a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis,the question classifier draws both semantic and grammatical information into information retrieval and machine learning methods in the form of various training features,including the question word,the main verb of the question,the dependency structure,the position of the main auxiliary verb,the main noun of the question,the top hypernym of the main noun,etc.Then the QA query results are re-ranked by question class information.Experiments show that the questions in real-world web data sets can be accurately classified by the classifier,and the QA results after re-ranking can be obviously improved.It is proved that with both semantic and grammatical information,applications such as QA, built upon real-world web data sets, can be improved,thus showing better performance.