With the widespread use of social media,the propagation of health-related rumors has become a significant public health threat.Existing methods for detecting health rumors predominantly rely on external knowledge or p...With the widespread use of social media,the propagation of health-related rumors has become a significant public health threat.Existing methods for detecting health rumors predominantly rely on external knowledge or propagation structures,with only a few recent approaches attempting causal inference;however,these have not yet effectively integrated causal discovery with domain-specific knowledge graphs for detecting health rumors.In this study,we found that the combined use of causal discovery and domain-specific knowledge graphs can effectively identify implicit pseudo-causal logic embedded within texts,holding significant potential for health rumor detection.To this end,we propose CKDG—a dual-graph fusion framework based on causal logic and medical knowledge graphs.CKDG constructs a weighted causal graph to capture the implicit causal relationships in the text and introduces a medical knowledge graph to verify semantic consistency,thereby enhancing the ability to identify the misuse of professional terminology and pseudoscientific claims.In experiments conducted on a dataset comprising 8430 health rumors,CKDG achieved an accuracy of 91.28%and an F1 score of 90.38%,representing improvements of 5.11%and 3.29%over the best baseline,respectively.Our results indicate that the integrated use of causal discovery and domainspecific knowledge graphs offers significant advantages for health rumor detection systems.This method not only improves detection performance but also enhances the transparency and credibility of model decisions by tracing causal chains and sources of knowledge conflicts.We anticipate that this work will provide key technological support for the development of trustworthy health-information filtering systems,thereby improving the reliability of public health information on social media.展开更多
It is of great importance to obtain precise trace data,as traces are frequently the sole visible and measurable parameter in most outcrops.The manual recognition and detection of traces on high-resolution three-dimens...It is of great importance to obtain precise trace data,as traces are frequently the sole visible and measurable parameter in most outcrops.The manual recognition and detection of traces on high-resolution three-dimensional(3D)models are relatively straightforward but time-consuming.One potential solution to enhance this process is to use machine learning algorithms to detect the 3D traces.In this study,a unique pixel-wise texture mapper algorithm generates a dense point cloud representation of an outcrop with the precise resolution of the original textured 3D model.A virtual digital image rendering was then employed to capture virtual images of selected regions.This technique helps to overcome limitations caused by the surface morphology of the rock mass,such as restricted access,lighting conditions,and shading effects.After AI-powered trace detection on two-dimensional(2D)images,a 3D data structuring technique was applied to the selected trace pixels.In the 3D data structuring,the trace data were structured through 2D thinning,3D reprojection,clustering,segmentation,and segment linking.Finally,the linked segments were exported as 3D polylines,with each polyline in the output corresponding to a trace.The efficacy of the proposed method was assessed using a 3D model of a real-world case study,which was used to compare the results of artificial intelligence(AI)-aided and human intelligence trace detection.Rosette diagrams,which visualize the distribution of trace orientations,confirmed the high similarity between the automatically and manually generated trace maps.In conclusion,the proposed semi-automatic method was easy to use,fast,and accurate in detecting the dominant jointing system of the rock mass.展开更多
Esophageal cancer(EC),a common malignant tumor of the digestive tract,requires early diagnosis and timely treatment to improve patient prognosis.Automated detection of EC using medical imaging has the potential to inc...Esophageal cancer(EC),a common malignant tumor of the digestive tract,requires early diagnosis and timely treatment to improve patient prognosis.Automated detection of EC using medical imaging has the potential to increase screening efficiency and diagnostic accuracy,thereby significantly improving long-term survival rates and the quality of life of patients.Recent advances in deep learning(DL),particularly convolutional neural networks,have demons-trated remarkable performance in medical imaging analysis.These techniques have shown significant progress in the automated identification of malignant tumors,quantitative analysis of lesions,and improvement in diagnostic accuracy and efficiency.This article comprehensively examines the research progress of DL in medical imaging for EC,covering various imaging modalities such as digital pathology,endoscopy,computed tomography,etc.It explores the clinical value and application prospects of DL in EC screening and diagnosis.Additionally,the article addresses several critical challenges that must be overcome for the clinical translation of DL techniques,including constructing high-quality datasets,promoting multimodal feature fusion,and optimizing artificial intelligence-clinical workflow integration.By providing a detailed overview of the current state of DL in EC imaging and highlighting the key challenges and future directions,this article aims to guide future research and facilitate the clinical implementation of DL technologies in EC management,ultimately contributing to better patient outcomes.展开更多
Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious an...Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.展开更多
In modern industrial applications,ensuring the reliability of mechanical fittings is critical for maintaining operational safety and efficiency,particularly in power grid systems where split pins serve a pivotal role ...In modern industrial applications,ensuring the reliability of mechanical fittings is critical for maintaining operational safety and efficiency,particularly in power grid systems where split pins serve a pivotal role despite being susceptible to environmental degradation and failure.Existing UAV-based inspection systems are hampered by a low representation of split pin elements and complex backgrounds,leading to challenges in accurate fault detection and timely maintenance.To address this pressing issue,our study proposes an innovative fault detection method for split pins.The approach employs a three-step process:first,cropping operations are used to accurately isolate the fittings containing split pins;second,super-resolution reconstruction is applied to enhance image clarity and detail;and finally,an improved YOLOv8 network,augmented with inner-shape IoU and local window attention mechanisms,is utilized to refine local feature extraction and annotation accuracy.Experimental evaluations on a split pin defect dataset demonstrate robust performance,achieving an accuracy rate of 72.1%and a mean average precision(mAP)of 67.7%,thereby validating the method’s effectiveness under challenging conditions.The proposed approach contributes to the field by specifically targeting the challenges associated with split pin detection in UAV-based inspections,offering a practically applicable and reliably precise method.展开更多
6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,faul...6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,fault detection is investigated in this paper.Considering the fast response and low timeand-computational consumption,it is the first time that the Online Broad Learning System(OBLS)is applied to identify outages in cellular networks.In addition,the Automatic-constructed Online Broad Learning System(AOBLS)is put forward to rationalize its structure and consequently avoid over-fitting and under-fitting.Furthermore,a multi-layer classification structure is proposed to further improve the classification performance.To face the challenges caused by imbalanced data in fault detection problems,a novel weighting strategy is derived to achieve the Multilayer Automatic-constructed Weighted Online Broad Learning System(MAWOBLS)and ensemble learning with retrained Support Vector Machine(SVM),denoted as EMAWOBLS,for superior treatment with this imbalance issue.Simulation results show that the proposed algorithm has excellent performance in detecting faults with satisfactory time usage.展开更多
Current damage detection methods based on model updating and sensitivity Jacobian matrixes show a low convergence ratio and computational efficiency for online calculations.The aim of this paper is to construct a real...Current damage detection methods based on model updating and sensitivity Jacobian matrixes show a low convergence ratio and computational efficiency for online calculations.The aim of this paper is to construct a real-time automated damage detection method by developing a theory-assisted adaptive mutiagent twin delayed deep deterministic(TA2-MATD3)policy gradient algorithm.First,the theoretical framework of reinforcement-learning-driven damage detection is established.To address the disadvantages of traditional mutiagent twin delayed deep deterministic(MATD3)method,the theory-assisted mechanism and the adaptive experience playback mechanism are introduced.Moreover,a historical residential house built in 1889 was taken as an example,using its 12-month structural health monitoring data.TA2-MATD3 was compared with existing damage detection methods in terms of the convergence ratio,online computing efficiency,and damage detection accuracy.The results show that the computational efficiency of TA2-MATD3 is approximately 117–160 times that of the traditional methods.The convergence ratio of damage detection on the training set is approximately 97%,and that on the test set is in the range of 86.2%–91.9%.In addition,the main apparent damages found in the field survey were identified by TA2-MATD3.The results indicate that the proposed method can significantly improve the online computing efficiency and damage detection accuracy.This research can provide novel perspectives for the use of reinforcement learning methods to conduct damage detection in online structural health monitoring.展开更多
The rapid proliferation of Internet of Things(IoT)devices in critical healthcare infrastructure has introduced significant security and privacy challenges that demand innovative,distributed architectural solutions.Thi...The rapid proliferation of Internet of Things(IoT)devices in critical healthcare infrastructure has introduced significant security and privacy challenges that demand innovative,distributed architectural solutions.This paper proposes FE-ACS(Fog-Edge Adaptive Cybersecurity System),a novel hierarchical security framework that intelligently distributes AI-powered anomaly detection algorithms across edge,fog,and cloud layers to optimize security efficacy,latency,and privacy.Our comprehensive evaluation demonstrates that FE-ACS achieves superior detection performance with an AUC-ROC of 0.985 and an F1-score of 0.923,while maintaining significantly lower end-to-end latency(18.7 ms)compared to cloud-centric(152.3 ms)and fog-only(34.5 ms)architectures.The system exhibits exceptional scalability,supporting up to 38,000 devices with logarithmic performance degradation—a 67×improvement over conventional cloud-based approaches.By incorporating differential privacy mechanisms with balanced privacy-utility tradeoffs(ε=1.0–1.5),FE-ACS maintains 90%–93%detection accuracy while ensuring strong privacy guarantees for sensitive healthcare data.Computational efficiency analysis reveals that our architecture achieves a detection rate of 12,400 events per second with only 12.3 mJ energy consumption per inference.In healthcare risk assessment,FE-ACS demonstrates robust operational viability with low patient safety risk(14.7%)and high system reliability(94.0%).The proposed framework represents a significant advancement in distributed security architectures,offering a scalable,privacy-preserving,and real-time solution for protecting healthcare IoT ecosystems against evolving cyber threats.展开更多
Breast cancer screening programs rely heavily on mammography for early detection;however,diagnostic performance is strongly affected by inter-reader variability,breast density,and the limitations of conven-tional comp...Breast cancer screening programs rely heavily on mammography for early detection;however,diagnostic performance is strongly affected by inter-reader variability,breast density,and the limitations of conven-tional computer-aided detection systems.Recent advances in deep learning have enabled more robust and scalable solutions for large-scale screening,yet a systematic comparison of modern object detection architectures on nationally representative datasets remains limited.This study presents a comprehensive quantitative comparison of prominent deep learning–based object detection architectures for Artificial Intelligence-assisted mammography analysis using the MammosighTR dataset,developed within the Turkish National Breast Cancer Screening Program.The dataset comprises 12,740 patient cases collected between 2016 and 2022,annotated with BI-RADS categories,breast density levels,and lesion localization labels.A total of 31 models were evaluated,including One-Stage,Two-Stage,and Transformer-based architectures,under a unified experimental framework at both patient and breast levels.The results demonstrate that Two-Stage architectures consistently outperform One-Stage models,achieving approximately 2%–4%higher Macro F1-Scores and more balanced precision–recall trade-offs,with Double-Head R-CNN and Dynamic R-CNN yielding the highest overall performance(Macro F1≈0.84–0.86).This advantage is primarily attributed to the region proposal mechanism and improved class balance inherent to Two-Stage designs.One-Stage detectors exhibited higher sensitivity and faster inference,reaching Recall values above 0.88,but experienced minor reductions in Precision and overall accuracy(≈1%–2%)compared with Two-Stage models.Among Transformer-based architectures,Deformable DEtection TRansformer demonstrated strong robustness and consistency across datasets,achieving Macro F1-Scores comparable to CNN-based detectors(≈0.83–0.85)while exhibiting minimal performance degradation under distributional shifts.Breast density–based analysis revealed increased misclassification rates in medium-density categories(types B and C),whereas Transformer-based architectures maintained more stable performance in high-density type D tissue.These findings quantitatively confirm that both architectural design and tissue characteristics play a decisive role in diagnostic accuracy.Overall,the study provides a reproducible benchmark and highlights the potential of hybrid approaches that combine the accuracy of Two-Stage detectors with the contextual modeling capability of Transformer architectures for clinically reliable breast cancer screening systems.展开更多
Traffic sign detection is an important part of autonomous driving,and its recognition accuracy and speed are directly related to road traffic safety.Although convolutional neural networks(CNNs)have made certain breakt...Traffic sign detection is an important part of autonomous driving,and its recognition accuracy and speed are directly related to road traffic safety.Although convolutional neural networks(CNNs)have made certain breakthroughs in this field,in the face of complex scenes,such as image blur and target occlusion,the traffic sign detection continues to exhibit limited accuracy,accompanied by false positives and missed detections.To address the above problems,a traffic sign detection algorithm,You Only Look Once-based Skip Dynamic Way(YOLO-SDW)based on You Only Look Once version 8 small(YOLOv8s),is proposed.Firstly,a Skip Connection Reconstruction(SCR)module is introduced to efficiently integrate fine-grained feature information and enhance the detection accuracy of the algorithm in complex scenes.Secondly,a C2f module based on Dynamic Snake Convolution(C2f-DySnake)is proposed to dynamically adjust the receptive field information,improve the algorithm’s feature extraction ability for blurred or occluded targets,and reduce the occurrence of false detections and missed detections.Finally,the Wise Powerful IoU v2(WPIoUv2)loss function is proposed to further improve the detection accuracy of the algorithm.Experimental results show that the average precision mAP@0.5 of YOLO-SDW on the TT100K dataset is 89.2%,and mAP@0.5:0.95 is 68.5%,which is 4%and 3.3%higher than the YOLOv8s baseline,respectively.YOLO-SDW ensures real-time performance while having higher accuracy.展开更多
With the continuous progress of automatic driving technology,automatic driving technology standards are gradually affecting the determination of criminal responsibility for traffic accidents in China.At present,the ch...With the continuous progress of automatic driving technology,automatic driving technology standards are gradually affecting the determination of criminal responsibility for traffic accidents in China.At present,the characteristics and tendency of China's automatic driving technology standards present the situation of high policy relevance coexisting with low normative binding,professionalism coexist with barriers,forefront coexist with ambiguity.Therefore,challenges are presented both theoretically and practically on the determination of criminal responsibility based on automatic driving technology standard..In this regard,the misunderstanding should be clarified in theory:The legal order under the automatic driving technology standard has constitutionality and systematic,and there is a balance between the frontier of automatic driving technology development and the lagging of criminal law.The automatic driving technology risk level system should be built to clarify the boundary of the effectiveness of criminal law norms,seeking fora breakthrough in the application of the establishment of a comprehensive judgment system of the risks and accidents and the system of evidence to prove the system,which clarifies the determination of criminal responsibility under the automatic driving technology standard.This essay hopes to pursue breakthroughs in the application-to establish a comprehensive judgment system of risks and accidents as well as an evidence proof system,so as to clarify the determination of criminal responsibility under automatic driving technology standards.展开更多
In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds...In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds in current intelligent inspection algorithms,this paper proposes CG-YOLOv8,a lightweight and improved model based on YOLOv8n for PCB surface defect detection.The proposed method optimizes the network architecture and compresses parameters to reduce model complexity while maintaining high detection accuracy,thereby enhancing the capability of identifying diverse defects under complex conditions.Specifically,a cascaded multi-receptive field(CMRF)module is adopted to replace the SPPF module in the backbone to improve feature perception,and an inverted residual mobile block(IRMB)is integrated into the C2f module to further enhance performance.Additionally,conventional convolution layers are replaced with GSConv to reduce computational cost,and a lightweight Convolutional Block Attention Module based Convolution(CBAMConv)module is introduced after Grouped Spatial Convolution(GSConv)to preserve accuracy through attention mechanisms.The detection head is also optimized by removing medium and large-scale detection layers,thereby enhancing the model’s ability to detect small-scale defects and further reducing complexity.Experimental results show that,compared to the original YOLOv8n,the proposed CG-YOLOv8 reduces parameter count by 53.9%,improves mAP@0.5 by 2.2%,and increases precision and recall by 2.0%and 1.8%,respectively.These improvements demonstrate that CG-YOLOv8 offers an efficient and lightweight solution for PCB surface defect detection.展开更多
This study proposes a lightweight rice disease detection model optimized for edge computing environments.The goal is to enhance the You Only Look Once(YOLO)v5 architecture to achieve a balance between real-time diagno...This study proposes a lightweight rice disease detection model optimized for edge computing environments.The goal is to enhance the You Only Look Once(YOLO)v5 architecture to achieve a balance between real-time diagnostic performance and computational efficiency.To this end,a total of 3234 high-resolution images(2400×1080)were collected from three major rice diseases Rice Blast,Bacterial Blight,and Brown Spot—frequently found in actual rice cultivation fields.These images served as the training dataset.The proposed YOLOv5-V2 model removes the Focus layer from the original YOLOv5s and integrates ShuffleNet V2 into the backbone,thereby resulting in both model compression and improved inference speed.Additionally,YOLOv5-P,based on PP-PicoDet,was configured as a comparative model to quantitatively evaluate performance.Experimental results demonstrated that YOLOv5-V2 achieved excellent detection performance,with an mAP 0.5 of 89.6%,mAP 0.5–0.95 of 66.7%,precision of 91.3%,and recall of 85.6%,while maintaining a lightweight model size of 6.45 MB.In contrast,YOLOv5-P exhibited a smaller model size of 4.03 MB,but showed lower performance with an mAP 0.5 of 70.3%,mAP 0.5–0.95 of 35.2%,precision of 62.3%,and recall of 74.1%.This study lays a technical foundation for the implementation of smart agriculture and real-time disease diagnosis systems by proposing a model that satisfies both accuracy and lightweight requirements.展开更多
The global population is rapidly expanding,driving an increasing demand for intelligent healthcare systems.Artificial intelligence(AI)applications in remote patient monitoring and diagnosis have achieved remarkable pr...The global population is rapidly expanding,driving an increasing demand for intelligent healthcare systems.Artificial intelligence(AI)applications in remote patient monitoring and diagnosis have achieved remarkable progress and are emerging as a major development trend.Among these applications,mouth motion tracking and mouth-state detection represent an important direction,providing valuable support for diagnosing neuromuscular disorders such as dysphagia,Bell’s palsy,and Parkinson’s disease.In this study,we focus on developing a real-time system capable of monitoring and detecting mouth state that can be efficiently deployed on edge devices.The proposed system integrates the Facial Landmark Detection technique with an optimized model combining a Bidirectional Gated Recurrent Unit(BiGRU)and Comprehensive Learning Particle Swarm Optimization(CLPSO).We conducted a comprehensive comparison and evaluation of the proposed model against several traditional models using multiple performance metrics,including accuracy,precision,recall,F1-score,cosine similarity,ROC–AUC,and the precision–recall curve.The proposed method achieved an impressive accuracy of 96.57%with an excellent precision of 98.25%on our self-collected dataset,outperforming traditional models and related works in the same field.These findings highlight the potential of the proposed approach for implementation in real-time patient monitoring systems,contributing to improved diagnostic accuracy and supporting healthcare professionals in patient treatment and care.展开更多
Automation and intelligence have become the primary trends in the design of investment casting processes.However,the design of gating and riser systems still lacks precise quantitative evaluation criteria.Numerical si...Automation and intelligence have become the primary trends in the design of investment casting processes.However,the design of gating and riser systems still lacks precise quantitative evaluation criteria.Numerical simulation plays a significant role in quantitatively evaluating current processes and making targeted improvements,but its limitations lie in the inability to dynamically reflect the formation outcomes of castings under varying process conditions,making real-time adjustments to gating and riser designs challenging.In this study,an automated design model for gating and riser systems based on integrated parametric 3D modeling-simulation framework is proposed,which enhances the flexibility and usability of evaluating the casting process by simulation.Firstly,geometric feature extraction technology is employed to obtain the geometric information of the target casting.Based on this information,an automated design framework for gating and riser systems is established,incorporating multiple structural parameters for real-time process control.Subsequently,the simulation results for various structural parameters are analyzed,and the influence of these parameters on casting formation is thoroughly investigated.Finally,the optimal design scheme is generated and validated through experimental verification.Simulation analysis and experimental results show that using a larger gate neck(24 mm in side length) and external risers promotes a more uniform temperature distribution and a more stable flow state,effectively eliminating shrinkage cavities and enhancing process yield by 15%.展开更多
Modern manufacturing processes have become more reliant on automation because of the accelerated transition from Industry 3.0 to Industry 4.0.Manual inspection of products on assembly lines remains inefficient,prone t...Modern manufacturing processes have become more reliant on automation because of the accelerated transition from Industry 3.0 to Industry 4.0.Manual inspection of products on assembly lines remains inefficient,prone to errors and lacks consistency,emphasizing the need for a reliable and automated inspection system.Leveraging both object detection and image segmentation approaches,this research proposes a vision-based solution for the detection of various kinds of tools in the toolkit using deep learning(DL)models.Two Intel RealSense D455f depth cameras were arranged in a top down configuration to capture both RGB and depth images of the toolkits.After applying multiple constraints and enhancing them through preprocessing and augmentation,a dataset consisting of 3300 annotated RGB-D photos was generated.Several DL models were selected through a comprehensive assessment of mean Average Precision(mAP),precision-recall equilibrium,inference latency(target≥30 FPS),and computational burden,resulting in a preference for YOLO and Region-based Convolutional Neural Networks(R-CNN)variants over ViT-based models due to the latter’s increased latency and resource requirements.YOLOV5,YOLOV8,YOLOV11,Faster R-CNN,and Mask R-CNN were trained on the annotated dataset and evaluated using key performance metrics(Recall,Accuracy,F1-score,and Precision).YOLOV11 demonstrated balanced excellence with 93.0%precision,89.9%recall,and a 90.6%F1-score in object detection,as well as 96.9%precision,95.3%recall,and a 96.5%F1-score in instance segmentation with an average inference time of 25 ms per frame(≈40 FPS),demonstrating real-time performance.Leveraging these results,a YOLOV11-based windows application was successfully deployed in a real-time assembly line environment,where it accurately processed live video streams to detect and segment tools within toolkits,demonstrating its practical effectiveness in industrial automation.The application is capable of precisely measuring socket dimensions by utilising edge detection techniques on YOLOv11 segmentation masks,in addition to detection and segmentation.This makes it possible to do specification-level quality control right on the assembly line,which improves the ability to examine things in real time.The implementation is a big step forward for intelligent manufacturing in the Industry 4.0 paradigm.It provides a scalable,efficient,and accurate way to do automated inspection and dimensional verification activities.展开更多
Synthetic speech detection is an essential task in the field of voice security,aimed at identifying deceptive voice attacks generated by text-to-speech(TTS)systems or voice conversion(VC)systems.In this paper,we propo...Synthetic speech detection is an essential task in the field of voice security,aimed at identifying deceptive voice attacks generated by text-to-speech(TTS)systems or voice conversion(VC)systems.In this paper,we propose a synthetic speech detection model called TFTransformer,which integrates both local and global features to enhance detection capabilities by effectively modeling local and global dependencies.Structurally,the model is divided into two main components:a front-end and a back-end.The front-end of the model uses a combination of SincLayer and two-dimensional(2D)convolution to extract high-level feature maps(HFM)containing local dependency of the input speech signals.The back-end uses time-frequency Transformer module to process these feature maps and further capture global dependency.Furthermore,we propose TFTransformer-SE,which incorporates a channel attention mechanism within the 2D convolutional blocks.This enhancement aims to more effectively capture local dependencies,thereby improving the model’s performance.The experiments were conducted on the ASVspoof 2021 LA dataset,and the results showed that the model achieved an equal error rate(EER)of 3.37%without data augmentation.Additionally,we evaluated the model using the ASVspoof 2019 LA dataset,achieving an EER of 0.84%,also without data augmentation.This demonstrates that combining local and global dependencies in the time-frequency domain can significantly improve detection accuracy.展开更多
In recent years,the rapid advancement of artificial intelligence(AI)technology has enabled AI-assisted negative screening to significantly enhance physicians'efficiency through image feature analysis and multimoda...In recent years,the rapid advancement of artificial intelligence(AI)technology has enabled AI-assisted negative screening to significantly enhance physicians'efficiency through image feature analysis and multimodal data modeling,allowing them to focus more on diagnosing positive cases.Meanwhile,multispectral imaging(MSI)integrates spectral and spatial resolution to capture subtle tissue features invisible to the human eye,providing high-resolution data support for pathological analysis.Combining AI technology with MSI and employing quantitative methods to analyze multiband biomarkers(such as absorbance differences in keratin pearls)can effectively improve diagnostic specificity and reduce subjective errors in manual slide interpretation.To address the challenge of identifying negative tissue sections,we developed a discrimination algorithm powered by MSI.We demonstrated its efficacy using cutaneous squamous cell carcinoma(cSCC)as a representative case study.The algorithm achieved 100%accuracy in excluding negative cases and effectively mitigated the false-positive problem caused by cSCC heterogeneity.We constructed a multispectral image(MSI)dataset acquired at 520 nm,600 nm,and 630 nm wavelengths.Subsequently,we employed an optimized MobileViT model for tissue classification and performed comparative analyses against other models.The experimental results showed that our optimized MobileViT model achieved superior performance in identifying negative tissue sections,with a perfect accuracy rate of 100%.Thus,our results confirm the feasibility of integrating MSI with AI to exclude negative cases with perfect accuracy,offering a novel solution to alleviate the workload of pathologists.展开更多
Aiming at the scale adaptation of automatic driving target detection algorithms in low illumination environments and the shortcomings in target occlusion processing,this paper proposes a YOLO-LKSDS automatic driving d...Aiming at the scale adaptation of automatic driving target detection algorithms in low illumination environments and the shortcomings in target occlusion processing,this paper proposes a YOLO-LKSDS automatic driving detection model.Firstly,the Contrast-Limited Adaptive Histogram Equalisation(CLAHE)image enhancement algorithm is improved to increase the image contrast and enhance the detailed features of the target;then,on the basis of the YOLOv5 model,the Kmeans++clustering algorithm is introduced to obtain a suitable anchor frame,and SPPELAN spatial pyramid pooling is improved to enhance the accuracy and robustness of the model for multi-scale target detection.Finally,an improved SEAM(Separated and Enhancement Attention Module)attention mechanism is combined with the DIOU-NMS algorithm to optimize the model’s performance when dealing with occlusion and dense scenes.Compared with the original model,the improved YOLO-LKSDS model achieves a 13.3%improvement in accuracy,a 1.7%improvement in mAP,and 240,000 fewer parameters on the BDD100K dataset.In order to validate the generalization of the improved algorithm,we selected the KITTI dataset for experimentation,which shows that YOLOv5’s accuracy improves by 21.1%,recall by 36.6%,and mAP50 by 29.5%,respectively,on the KITTI dataset.The deployment of this paper’s algorithm is verified by an edge computing platform,where the average speed of detection reaches 24.4 FPS while power consumption remains below 9 W,demonstrating high real-time capability and energy efficiency.展开更多
Ferroptosis has exhibited great potential in therapies and intracellular reducing agents of sulfur species(RSSs) in the thiol-dependent redox systems are crucial in ferroptosis.This makes the simultaneous detection of...Ferroptosis has exhibited great potential in therapies and intracellular reducing agents of sulfur species(RSSs) in the thiol-dependent redox systems are crucial in ferroptosis.This makes the simultaneous detection of multiple RSSs significant for evaluating ferroptosis therapy.However,the traditional techniques,including fluorescent(FL) imaging and electrospray ionization-based mass spectrometry(MS) detection,cannot achieve the discrimination of different RSSs.Herein,simultaneous MS detection of multiple RSSs,including cysteine(Cys),homocysteine(Hcy),glutathione(GSH) and hydrogen sulfide(H_(2)S),was obtained upon enhancing ionization efficiency by a fluorescent probe(NBD-O-1).Based on the interaction between NBD-O-1 and RSSs,the complex of RSSs with a fragment of NBD-O-1 can be generated,which can be easily ionized for MS detection in the negative mode.Therefore,the intracellular RSSs can be well detected upon the incubation of He La cells with the probe of NBD-O-1,exhibiting the total RSS levels by the FL imaging and further providing expression of each RSS by enhanced MS detection.Furthermore,the RSSs during ferroptosis in He La cells have been evaluated using the present strategy,demonstrating the potential for ferroptosis examinations.This work has made an unconventional application of a fluorescent probe to enhance the detection of multiple RSSs by MS,providing significant molecular information for addressing the ferroptosis mechanism.展开更多
基金funded by the Hunan Provincial Natural Science Foundation of China(Grant No.2025JJ70105)the Hunan Provincial College Students’Innovation and Entrepreneurship Training Program(Project No.S202411342056)The article processing charge(APC)was funded by the Project No.2025JJ70105.
文摘With the widespread use of social media,the propagation of health-related rumors has become a significant public health threat.Existing methods for detecting health rumors predominantly rely on external knowledge or propagation structures,with only a few recent approaches attempting causal inference;however,these have not yet effectively integrated causal discovery with domain-specific knowledge graphs for detecting health rumors.In this study,we found that the combined use of causal discovery and domain-specific knowledge graphs can effectively identify implicit pseudo-causal logic embedded within texts,holding significant potential for health rumor detection.To this end,we propose CKDG—a dual-graph fusion framework based on causal logic and medical knowledge graphs.CKDG constructs a weighted causal graph to capture the implicit causal relationships in the text and introduces a medical knowledge graph to verify semantic consistency,thereby enhancing the ability to identify the misuse of professional terminology and pseudoscientific claims.In experiments conducted on a dataset comprising 8430 health rumors,CKDG achieved an accuracy of 91.28%and an F1 score of 90.38%,representing improvements of 5.11%and 3.29%over the best baseline,respectively.Our results indicate that the integrated use of causal discovery and domainspecific knowledge graphs offers significant advantages for health rumor detection systems.This method not only improves detection performance but also enhances the transparency and credibility of model decisions by tracing causal chains and sources of knowledge conflicts.We anticipate that this work will provide key technological support for the development of trustworthy health-information filtering systems,thereby improving the reliability of public health information on social media.
基金supported by grants from the Human Resources Development program (Grant No.20204010600250)the Training Program of CCUS for the Green Growth (Grant No.20214000000500)by the Korea Institute of Energy Technology Evaluation and Planning (KETEP)funded by the Ministry of Trade,Industry,and Energy of the Korean Government (MOTIE).
文摘It is of great importance to obtain precise trace data,as traces are frequently the sole visible and measurable parameter in most outcrops.The manual recognition and detection of traces on high-resolution three-dimensional(3D)models are relatively straightforward but time-consuming.One potential solution to enhance this process is to use machine learning algorithms to detect the 3D traces.In this study,a unique pixel-wise texture mapper algorithm generates a dense point cloud representation of an outcrop with the precise resolution of the original textured 3D model.A virtual digital image rendering was then employed to capture virtual images of selected regions.This technique helps to overcome limitations caused by the surface morphology of the rock mass,such as restricted access,lighting conditions,and shading effects.After AI-powered trace detection on two-dimensional(2D)images,a 3D data structuring technique was applied to the selected trace pixels.In the 3D data structuring,the trace data were structured through 2D thinning,3D reprojection,clustering,segmentation,and segment linking.Finally,the linked segments were exported as 3D polylines,with each polyline in the output corresponding to a trace.The efficacy of the proposed method was assessed using a 3D model of a real-world case study,which was used to compare the results of artificial intelligence(AI)-aided and human intelligence trace detection.Rosette diagrams,which visualize the distribution of trace orientations,confirmed the high similarity between the automatically and manually generated trace maps.In conclusion,the proposed semi-automatic method was easy to use,fast,and accurate in detecting the dominant jointing system of the rock mass.
基金Supported by Funding for Clinical Trials from the Nanjing Drum Tower Hospital,Affiliated Hospital of Medical School,Nanjing University,No.2021-LCYJ-MS-11.
文摘Esophageal cancer(EC),a common malignant tumor of the digestive tract,requires early diagnosis and timely treatment to improve patient prognosis.Automated detection of EC using medical imaging has the potential to increase screening efficiency and diagnostic accuracy,thereby significantly improving long-term survival rates and the quality of life of patients.Recent advances in deep learning(DL),particularly convolutional neural networks,have demons-trated remarkable performance in medical imaging analysis.These techniques have shown significant progress in the automated identification of malignant tumors,quantitative analysis of lesions,and improvement in diagnostic accuracy and efficiency.This article comprehensively examines the research progress of DL in medical imaging for EC,covering various imaging modalities such as digital pathology,endoscopy,computed tomography,etc.It explores the clinical value and application prospects of DL in EC screening and diagnosis.Additionally,the article addresses several critical challenges that must be overcome for the clinical translation of DL techniques,including constructing high-quality datasets,promoting multimodal feature fusion,and optimizing artificial intelligence-clinical workflow integration.By providing a detailed overview of the current state of DL in EC imaging and highlighting the key challenges and future directions,this article aims to guide future research and facilitate the clinical implementation of DL technologies in EC management,ultimately contributing to better patient outcomes.
基金the National Key Research and Development Program of China (Grant No.2022YFF0711400)the National Space Science Data Center Youth Open Project (Grant No. NSSDC2302001)
文摘Impact craters are important for understanding the evolution of lunar geologic and surface erosion rates,among other functions.However,the morphological characteristics of these micro impact craters are not obvious and they are numerous,resulting in low detection accuracy by deep learning models.Therefore,we proposed a new multi-scale fusion crater detection algorithm(MSF-CDA)based on the YOLO11 to improve the accuracy of lunar impact crater detection,especially for small craters with a diameter of<1 km.Using the images taken by the LROC(Lunar Reconnaissance Orbiter Camera)at the Chang’e-4(CE-4)landing area,we constructed three separate datasets for craters with diameters of 0-70 m,70-140 m,and>140 m.We then trained three submodels separately with these three datasets.Additionally,we designed a slicing-amplifying-slicing strategy to enhance the ability to extract features from small craters.To handle redundant predictions,we proposed a new Non-Maximum Suppression with Area Filtering method to fuse the results in overlapping targets within the multi-scale submodels.Finally,our new MSF-CDA method achieved high detection performance,with the Precision,Recall,and F1 score having values of 0.991,0.987,and 0.989,respectively,perfectly addressing the problems induced by the lesser features and sample imbalance of small craters.Our MSF-CDA can provide strong data support for more in-depth study of the geological evolution of the lunar surface and finer geological age estimations.This strategy can also be used to detect other small objects with lesser features and sample imbalance problems.We detected approximately 500,000 impact craters in an area of approximately 214 km2 around the CE-4 landing area.By statistically analyzing the new data,we updated the distribution function of the number and diameter of impact craters.Finally,we identified the most suitable lighting conditions for detecting impact crater targets by analyzing the effect of different lighting conditions on the detection accuracy.
基金Fundamental Research Funds for the Central Universities(2023MS134)。
文摘In modern industrial applications,ensuring the reliability of mechanical fittings is critical for maintaining operational safety and efficiency,particularly in power grid systems where split pins serve a pivotal role despite being susceptible to environmental degradation and failure.Existing UAV-based inspection systems are hampered by a low representation of split pin elements and complex backgrounds,leading to challenges in accurate fault detection and timely maintenance.To address this pressing issue,our study proposes an innovative fault detection method for split pins.The approach employs a three-step process:first,cropping operations are used to accurately isolate the fittings containing split pins;second,super-resolution reconstruction is applied to enhance image clarity and detail;and finally,an improved YOLOv8 network,augmented with inner-shape IoU and local window attention mechanisms,is utilized to refine local feature extraction and annotation accuracy.Experimental evaluations on a split pin defect dataset demonstrate robust performance,achieving an accuracy rate of 72.1%and a mean average precision(mAP)of 67.7%,thereby validating the method’s effectiveness under challenging conditions.The proposed approach contributes to the field by specifically targeting the challenges associated with split pin detection in UAV-based inspections,offering a practically applicable and reliably precise method.
基金supported in part by the National Key Research and Development Project under Grant 2020YFB1806805partially funded through a grant from Qualcomm。
文摘6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,fault detection is investigated in this paper.Considering the fast response and low timeand-computational consumption,it is the first time that the Online Broad Learning System(OBLS)is applied to identify outages in cellular networks.In addition,the Automatic-constructed Online Broad Learning System(AOBLS)is put forward to rationalize its structure and consequently avoid over-fitting and under-fitting.Furthermore,a multi-layer classification structure is proposed to further improve the classification performance.To face the challenges caused by imbalanced data in fault detection problems,a novel weighting strategy is derived to achieve the Multilayer Automatic-constructed Weighted Online Broad Learning System(MAWOBLS)and ensemble learning with retrained Support Vector Machine(SVM),denoted as EMAWOBLS,for superior treatment with this imbalance issue.Simulation results show that the proposed algorithm has excellent performance in detecting faults with satisfactory time usage.
基金supported by National Key Research and Development Program of China(2023YFF0906100)National Natural Science Foundation of China(52408008)Key Research and Development Program of Jiangsu Province(BE2022833).
文摘Current damage detection methods based on model updating and sensitivity Jacobian matrixes show a low convergence ratio and computational efficiency for online calculations.The aim of this paper is to construct a real-time automated damage detection method by developing a theory-assisted adaptive mutiagent twin delayed deep deterministic(TA2-MATD3)policy gradient algorithm.First,the theoretical framework of reinforcement-learning-driven damage detection is established.To address the disadvantages of traditional mutiagent twin delayed deep deterministic(MATD3)method,the theory-assisted mechanism and the adaptive experience playback mechanism are introduced.Moreover,a historical residential house built in 1889 was taken as an example,using its 12-month structural health monitoring data.TA2-MATD3 was compared with existing damage detection methods in terms of the convergence ratio,online computing efficiency,and damage detection accuracy.The results show that the computational efficiency of TA2-MATD3 is approximately 117–160 times that of the traditional methods.The convergence ratio of damage detection on the training set is approximately 97%,and that on the test set is in the range of 86.2%–91.9%.In addition,the main apparent damages found in the field survey were identified by TA2-MATD3.The results indicate that the proposed method can significantly improve the online computing efficiency and damage detection accuracy.This research can provide novel perspectives for the use of reinforcement learning methods to conduct damage detection in online structural health monitoring.
基金supported by the Deanship of Graduate Studies and Scientific Research at Jouf University under grant No.(DGSSR-2025-02-01276).
文摘The rapid proliferation of Internet of Things(IoT)devices in critical healthcare infrastructure has introduced significant security and privacy challenges that demand innovative,distributed architectural solutions.This paper proposes FE-ACS(Fog-Edge Adaptive Cybersecurity System),a novel hierarchical security framework that intelligently distributes AI-powered anomaly detection algorithms across edge,fog,and cloud layers to optimize security efficacy,latency,and privacy.Our comprehensive evaluation demonstrates that FE-ACS achieves superior detection performance with an AUC-ROC of 0.985 and an F1-score of 0.923,while maintaining significantly lower end-to-end latency(18.7 ms)compared to cloud-centric(152.3 ms)and fog-only(34.5 ms)architectures.The system exhibits exceptional scalability,supporting up to 38,000 devices with logarithmic performance degradation—a 67×improvement over conventional cloud-based approaches.By incorporating differential privacy mechanisms with balanced privacy-utility tradeoffs(ε=1.0–1.5),FE-ACS maintains 90%–93%detection accuracy while ensuring strong privacy guarantees for sensitive healthcare data.Computational efficiency analysis reveals that our architecture achieves a detection rate of 12,400 events per second with only 12.3 mJ energy consumption per inference.In healthcare risk assessment,FE-ACS demonstrates robust operational viability with low patient safety risk(14.7%)and high system reliability(94.0%).The proposed framework represents a significant advancement in distributed security architectures,offering a scalable,privacy-preserving,and real-time solution for protecting healthcare IoT ecosystems against evolving cyber threats.
文摘Breast cancer screening programs rely heavily on mammography for early detection;however,diagnostic performance is strongly affected by inter-reader variability,breast density,and the limitations of conven-tional computer-aided detection systems.Recent advances in deep learning have enabled more robust and scalable solutions for large-scale screening,yet a systematic comparison of modern object detection architectures on nationally representative datasets remains limited.This study presents a comprehensive quantitative comparison of prominent deep learning–based object detection architectures for Artificial Intelligence-assisted mammography analysis using the MammosighTR dataset,developed within the Turkish National Breast Cancer Screening Program.The dataset comprises 12,740 patient cases collected between 2016 and 2022,annotated with BI-RADS categories,breast density levels,and lesion localization labels.A total of 31 models were evaluated,including One-Stage,Two-Stage,and Transformer-based architectures,under a unified experimental framework at both patient and breast levels.The results demonstrate that Two-Stage architectures consistently outperform One-Stage models,achieving approximately 2%–4%higher Macro F1-Scores and more balanced precision–recall trade-offs,with Double-Head R-CNN and Dynamic R-CNN yielding the highest overall performance(Macro F1≈0.84–0.86).This advantage is primarily attributed to the region proposal mechanism and improved class balance inherent to Two-Stage designs.One-Stage detectors exhibited higher sensitivity and faster inference,reaching Recall values above 0.88,but experienced minor reductions in Precision and overall accuracy(≈1%–2%)compared with Two-Stage models.Among Transformer-based architectures,Deformable DEtection TRansformer demonstrated strong robustness and consistency across datasets,achieving Macro F1-Scores comparable to CNN-based detectors(≈0.83–0.85)while exhibiting minimal performance degradation under distributional shifts.Breast density–based analysis revealed increased misclassification rates in medium-density categories(types B and C),whereas Transformer-based architectures maintained more stable performance in high-density type D tissue.These findings quantitatively confirm that both architectural design and tissue characteristics play a decisive role in diagnostic accuracy.Overall,the study provides a reproducible benchmark and highlights the potential of hybrid approaches that combine the accuracy of Two-Stage detectors with the contextual modeling capability of Transformer architectures for clinically reliable breast cancer screening systems.
基金funded by Key research and development Program of Henan Province(No.251111211200)National Natural Science Foundation of China(Grant No.U2004163).
文摘Traffic sign detection is an important part of autonomous driving,and its recognition accuracy and speed are directly related to road traffic safety.Although convolutional neural networks(CNNs)have made certain breakthroughs in this field,in the face of complex scenes,such as image blur and target occlusion,the traffic sign detection continues to exhibit limited accuracy,accompanied by false positives and missed detections.To address the above problems,a traffic sign detection algorithm,You Only Look Once-based Skip Dynamic Way(YOLO-SDW)based on You Only Look Once version 8 small(YOLOv8s),is proposed.Firstly,a Skip Connection Reconstruction(SCR)module is introduced to efficiently integrate fine-grained feature information and enhance the detection accuracy of the algorithm in complex scenes.Secondly,a C2f module based on Dynamic Snake Convolution(C2f-DySnake)is proposed to dynamically adjust the receptive field information,improve the algorithm’s feature extraction ability for blurred or occluded targets,and reduce the occurrence of false detections and missed detections.Finally,the Wise Powerful IoU v2(WPIoUv2)loss function is proposed to further improve the detection accuracy of the algorithm.Experimental results show that the average precision mAP@0.5 of YOLO-SDW on the TT100K dataset is 89.2%,and mAP@0.5:0.95 is 68.5%,which is 4%and 3.3%higher than the YOLOv8s baseline,respectively.YOLO-SDW ensures real-time performance while having higher accuracy.
基金The National Social Science Foundation Youth Project of China:Research on the collaborative govemance path of administrative law and criminal law against dangerous driving behaviors in the digital-intelligent society(25CFX108)。
文摘With the continuous progress of automatic driving technology,automatic driving technology standards are gradually affecting the determination of criminal responsibility for traffic accidents in China.At present,the characteristics and tendency of China's automatic driving technology standards present the situation of high policy relevance coexisting with low normative binding,professionalism coexist with barriers,forefront coexist with ambiguity.Therefore,challenges are presented both theoretically and practically on the determination of criminal responsibility based on automatic driving technology standard..In this regard,the misunderstanding should be clarified in theory:The legal order under the automatic driving technology standard has constitutionality and systematic,and there is a balance between the frontier of automatic driving technology development and the lagging of criminal law.The automatic driving technology risk level system should be built to clarify the boundary of the effectiveness of criminal law norms,seeking fora breakthrough in the application of the establishment of a comprehensive judgment system of the risks and accidents and the system of evidence to prove the system,which clarifies the determination of criminal responsibility under the automatic driving technology standard.This essay hopes to pursue breakthroughs in the application-to establish a comprehensive judgment system of risks and accidents as well as an evidence proof system,so as to clarify the determination of criminal responsibility under automatic driving technology standards.
基金funded by the Joint Funds of the National Natural Science Foundation of China(U2341223)the Beijing Municipal Natural Science Foundation(No.4232067).
文摘In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds in current intelligent inspection algorithms,this paper proposes CG-YOLOv8,a lightweight and improved model based on YOLOv8n for PCB surface defect detection.The proposed method optimizes the network architecture and compresses parameters to reduce model complexity while maintaining high detection accuracy,thereby enhancing the capability of identifying diverse defects under complex conditions.Specifically,a cascaded multi-receptive field(CMRF)module is adopted to replace the SPPF module in the backbone to improve feature perception,and an inverted residual mobile block(IRMB)is integrated into the C2f module to further enhance performance.Additionally,conventional convolution layers are replaced with GSConv to reduce computational cost,and a lightweight Convolutional Block Attention Module based Convolution(CBAMConv)module is introduced after Grouped Spatial Convolution(GSConv)to preserve accuracy through attention mechanisms.The detection head is also optimized by removing medium and large-scale detection layers,thereby enhancing the model’s ability to detect small-scale defects and further reducing complexity.Experimental results show that,compared to the original YOLOv8n,the proposed CG-YOLOv8 reduces parameter count by 53.9%,improves mAP@0.5 by 2.2%,and increases precision and recall by 2.0%and 1.8%,respectively.These improvements demonstrate that CG-YOLOv8 offers an efficient and lightweight solution for PCB surface defect detection.
文摘This study proposes a lightweight rice disease detection model optimized for edge computing environments.The goal is to enhance the You Only Look Once(YOLO)v5 architecture to achieve a balance between real-time diagnostic performance and computational efficiency.To this end,a total of 3234 high-resolution images(2400×1080)were collected from three major rice diseases Rice Blast,Bacterial Blight,and Brown Spot—frequently found in actual rice cultivation fields.These images served as the training dataset.The proposed YOLOv5-V2 model removes the Focus layer from the original YOLOv5s and integrates ShuffleNet V2 into the backbone,thereby resulting in both model compression and improved inference speed.Additionally,YOLOv5-P,based on PP-PicoDet,was configured as a comparative model to quantitatively evaluate performance.Experimental results demonstrated that YOLOv5-V2 achieved excellent detection performance,with an mAP 0.5 of 89.6%,mAP 0.5–0.95 of 66.7%,precision of 91.3%,and recall of 85.6%,while maintaining a lightweight model size of 6.45 MB.In contrast,YOLOv5-P exhibited a smaller model size of 4.03 MB,but showed lower performance with an mAP 0.5 of 70.3%,mAP 0.5–0.95 of 35.2%,precision of 62.3%,and recall of 74.1%.This study lays a technical foundation for the implementation of smart agriculture and real-time disease diagnosis systems by proposing a model that satisfies both accuracy and lightweight requirements.
基金supported by the National Science and Technology Council,Taiwan,with grant numbers NSTC 114-2622-8-992-007-TD1 and 112-2811-E-992-003-MY3.
文摘The global population is rapidly expanding,driving an increasing demand for intelligent healthcare systems.Artificial intelligence(AI)applications in remote patient monitoring and diagnosis have achieved remarkable progress and are emerging as a major development trend.Among these applications,mouth motion tracking and mouth-state detection represent an important direction,providing valuable support for diagnosing neuromuscular disorders such as dysphagia,Bell’s palsy,and Parkinson’s disease.In this study,we focus on developing a real-time system capable of monitoring and detecting mouth state that can be efficiently deployed on edge devices.The proposed system integrates the Facial Landmark Detection technique with an optimized model combining a Bidirectional Gated Recurrent Unit(BiGRU)and Comprehensive Learning Particle Swarm Optimization(CLPSO).We conducted a comprehensive comparison and evaluation of the proposed model against several traditional models using multiple performance metrics,including accuracy,precision,recall,F1-score,cosine similarity,ROC–AUC,and the precision–recall curve.The proposed method achieved an impressive accuracy of 96.57%with an excellent precision of 98.25%on our self-collected dataset,outperforming traditional models and related works in the same field.These findings highlight the potential of the proposed approach for implementation in real-time patient monitoring systems,contributing to improved diagnostic accuracy and supporting healthcare professionals in patient treatment and care.
基金financially supported by the National Key Research and Development Program of China (2022YFB3706802)。
文摘Automation and intelligence have become the primary trends in the design of investment casting processes.However,the design of gating and riser systems still lacks precise quantitative evaluation criteria.Numerical simulation plays a significant role in quantitatively evaluating current processes and making targeted improvements,but its limitations lie in the inability to dynamically reflect the formation outcomes of castings under varying process conditions,making real-time adjustments to gating and riser designs challenging.In this study,an automated design model for gating and riser systems based on integrated parametric 3D modeling-simulation framework is proposed,which enhances the flexibility and usability of evaluating the casting process by simulation.Firstly,geometric feature extraction technology is employed to obtain the geometric information of the target casting.Based on this information,an automated design framework for gating and riser systems is established,incorporating multiple structural parameters for real-time process control.Subsequently,the simulation results for various structural parameters are analyzed,and the influence of these parameters on casting formation is thoroughly investigated.Finally,the optimal design scheme is generated and validated through experimental verification.Simulation analysis and experimental results show that using a larger gate neck(24 mm in side length) and external risers promotes a more uniform temperature distribution and a more stable flow state,effectively eliminating shrinkage cavities and enhancing process yield by 15%.
文摘Modern manufacturing processes have become more reliant on automation because of the accelerated transition from Industry 3.0 to Industry 4.0.Manual inspection of products on assembly lines remains inefficient,prone to errors and lacks consistency,emphasizing the need for a reliable and automated inspection system.Leveraging both object detection and image segmentation approaches,this research proposes a vision-based solution for the detection of various kinds of tools in the toolkit using deep learning(DL)models.Two Intel RealSense D455f depth cameras were arranged in a top down configuration to capture both RGB and depth images of the toolkits.After applying multiple constraints and enhancing them through preprocessing and augmentation,a dataset consisting of 3300 annotated RGB-D photos was generated.Several DL models were selected through a comprehensive assessment of mean Average Precision(mAP),precision-recall equilibrium,inference latency(target≥30 FPS),and computational burden,resulting in a preference for YOLO and Region-based Convolutional Neural Networks(R-CNN)variants over ViT-based models due to the latter’s increased latency and resource requirements.YOLOV5,YOLOV8,YOLOV11,Faster R-CNN,and Mask R-CNN were trained on the annotated dataset and evaluated using key performance metrics(Recall,Accuracy,F1-score,and Precision).YOLOV11 demonstrated balanced excellence with 93.0%precision,89.9%recall,and a 90.6%F1-score in object detection,as well as 96.9%precision,95.3%recall,and a 96.5%F1-score in instance segmentation with an average inference time of 25 ms per frame(≈40 FPS),demonstrating real-time performance.Leveraging these results,a YOLOV11-based windows application was successfully deployed in a real-time assembly line environment,where it accurately processed live video streams to detect and segment tools within toolkits,demonstrating its practical effectiveness in industrial automation.The application is capable of precisely measuring socket dimensions by utilising edge detection techniques on YOLOv11 segmentation masks,in addition to detection and segmentation.This makes it possible to do specification-level quality control right on the assembly line,which improves the ability to examine things in real time.The implementation is a big step forward for intelligent manufacturing in the Industry 4.0 paradigm.It provides a scalable,efficient,and accurate way to do automated inspection and dimensional verification activities.
基金supported by project ZR2022MF330 supported by Shandong Provincial Natural Science Foundationthe National Natural Science Foundation of China under Grant No.61701286.
文摘Synthetic speech detection is an essential task in the field of voice security,aimed at identifying deceptive voice attacks generated by text-to-speech(TTS)systems or voice conversion(VC)systems.In this paper,we propose a synthetic speech detection model called TFTransformer,which integrates both local and global features to enhance detection capabilities by effectively modeling local and global dependencies.Structurally,the model is divided into two main components:a front-end and a back-end.The front-end of the model uses a combination of SincLayer and two-dimensional(2D)convolution to extract high-level feature maps(HFM)containing local dependency of the input speech signals.The back-end uses time-frequency Transformer module to process these feature maps and further capture global dependency.Furthermore,we propose TFTransformer-SE,which incorporates a channel attention mechanism within the 2D convolutional blocks.This enhancement aims to more effectively capture local dependencies,thereby improving the model’s performance.The experiments were conducted on the ASVspoof 2021 LA dataset,and the results showed that the model achieved an equal error rate(EER)of 3.37%without data augmentation.Additionally,we evaluated the model using the ASVspoof 2019 LA dataset,achieving an EER of 0.84%,also without data augmentation.This demonstrates that combining local and global dependencies in the time-frequency domain can significantly improve detection accuracy.
基金funded by the Natural Science Foundation of Shanghai Municipality(No.21ZR1440500)the Shanghai Science and Technology Commission(Grant No.21S31902700).
文摘In recent years,the rapid advancement of artificial intelligence(AI)technology has enabled AI-assisted negative screening to significantly enhance physicians'efficiency through image feature analysis and multimodal data modeling,allowing them to focus more on diagnosing positive cases.Meanwhile,multispectral imaging(MSI)integrates spectral and spatial resolution to capture subtle tissue features invisible to the human eye,providing high-resolution data support for pathological analysis.Combining AI technology with MSI and employing quantitative methods to analyze multiband biomarkers(such as absorbance differences in keratin pearls)can effectively improve diagnostic specificity and reduce subjective errors in manual slide interpretation.To address the challenge of identifying negative tissue sections,we developed a discrimination algorithm powered by MSI.We demonstrated its efficacy using cutaneous squamous cell carcinoma(cSCC)as a representative case study.The algorithm achieved 100%accuracy in excluding negative cases and effectively mitigated the false-positive problem caused by cSCC heterogeneity.We constructed a multispectral image(MSI)dataset acquired at 520 nm,600 nm,and 630 nm wavelengths.Subsequently,we employed an optimized MobileViT model for tissue classification and performed comparative analyses against other models.The experimental results showed that our optimized MobileViT model achieved superior performance in identifying negative tissue sections,with a perfect accuracy rate of 100%.Thus,our results confirm the feasibility of integrating MSI with AI to exclude negative cases with perfect accuracy,offering a novel solution to alleviate the workload of pathologists.
基金supported by the Key R&D Program of Shaanxi Province(No.2025CYYBXM-078).
文摘Aiming at the scale adaptation of automatic driving target detection algorithms in low illumination environments and the shortcomings in target occlusion processing,this paper proposes a YOLO-LKSDS automatic driving detection model.Firstly,the Contrast-Limited Adaptive Histogram Equalisation(CLAHE)image enhancement algorithm is improved to increase the image contrast and enhance the detailed features of the target;then,on the basis of the YOLOv5 model,the Kmeans++clustering algorithm is introduced to obtain a suitable anchor frame,and SPPELAN spatial pyramid pooling is improved to enhance the accuracy and robustness of the model for multi-scale target detection.Finally,an improved SEAM(Separated and Enhancement Attention Module)attention mechanism is combined with the DIOU-NMS algorithm to optimize the model’s performance when dealing with occlusion and dense scenes.Compared with the original model,the improved YOLO-LKSDS model achieves a 13.3%improvement in accuracy,a 1.7%improvement in mAP,and 240,000 fewer parameters on the BDD100K dataset.In order to validate the generalization of the improved algorithm,we selected the KITTI dataset for experimentation,which shows that YOLOv5’s accuracy improves by 21.1%,recall by 36.6%,and mAP50 by 29.5%,respectively,on the KITTI dataset.The deployment of this paper’s algorithm is verified by an edge computing platform,where the average speed of detection reaches 24.4 FPS while power consumption remains below 9 W,demonstrating high real-time capability and energy efficiency.
基金supported by the National Key Research and Development Program of China (No.2024YFA1509600)National Natural Science Foundation of China (Nos.22474010 and 22274012)the Fundamental Research Funds for the Central Universities (No.2233300007)。
文摘Ferroptosis has exhibited great potential in therapies and intracellular reducing agents of sulfur species(RSSs) in the thiol-dependent redox systems are crucial in ferroptosis.This makes the simultaneous detection of multiple RSSs significant for evaluating ferroptosis therapy.However,the traditional techniques,including fluorescent(FL) imaging and electrospray ionization-based mass spectrometry(MS) detection,cannot achieve the discrimination of different RSSs.Herein,simultaneous MS detection of multiple RSSs,including cysteine(Cys),homocysteine(Hcy),glutathione(GSH) and hydrogen sulfide(H_(2)S),was obtained upon enhancing ionization efficiency by a fluorescent probe(NBD-O-1).Based on the interaction between NBD-O-1 and RSSs,the complex of RSSs with a fragment of NBD-O-1 can be generated,which can be easily ionized for MS detection in the negative mode.Therefore,the intracellular RSSs can be well detected upon the incubation of He La cells with the probe of NBD-O-1,exhibiting the total RSS levels by the FL imaging and further providing expression of each RSS by enhanced MS detection.Furthermore,the RSSs during ferroptosis in He La cells have been evaluated using the present strategy,demonstrating the potential for ferroptosis examinations.This work has made an unconventional application of a fluorescent probe to enhance the detection of multiple RSSs by MS,providing significant molecular information for addressing the ferroptosis mechanism.