An improved analytical design to investigate the static stiffness of a convoluted air spring is developed and presented in this article.An air spring provides improved ride comfort by achieving variable volume at vari...An improved analytical design to investigate the static stiffness of a convoluted air spring is developed and presented in this article.An air spring provides improved ride comfort by achieving variable volume at various strokes of the suspension.An analytical relation is derived to calculate the volume and the rate of change in the volume of the convoluted bellow with respect to various suspension heights.This expression is used in the equation to calculate the variable stiffness of the bellow.The obtained analytical characteristics are validated with a detailed experiment to test the static vertical stiffness of the air spring.The convoluted air bellow is tested in an Avery spring-testing apparatus for various loads.The bellow is modeled in the ABAQUS environment to perform finite element analysis(FEA)to understand and visualize the deflection of the bellow at various elevated internal pressures and external loads.The proposed air spring model is a fiber-reinforced rubber bellow enclosed between two metal plates.The Mooney-Rivlin material model was used to model the hyperelastic rubber material for FEA.From the results,it is observed that the experimental and analytical results match with a minor error of 7.54%.The derived relations and validations would provide design guidance at the developmental stage of air bellows.These expressions would also play a major role in designing an effective active air suspension system by accurately calculating the required stiffness at various loads.展开更多
The post-hole convolute(PHC),which is used to transport and combine the pulse power flux,is a key component in huge pulsed power generators.Current loss at the PHC is a challenging problem for researchers.To explore a...The post-hole convolute(PHC),which is used to transport and combine the pulse power flux,is a key component in huge pulsed power generators.Current loss at the PHC is a challenging problem for researchers.To explore a method of reducing the current loss,a single-hole PHC was designed for experiments on the current loss on the Qiang Guang I generator.The experimental results showed that the current loss at the single-hole PHC is related to the distance/between the vicinity of the cathode hole and the surface of the downstream side of the post.Meanwhile,a single-hole PHC with a blob cathode hole transmitted current more effectively than the PHC with a circle cathode hole.The relative current loss at the single-hole PHC with the cathode coaled w ith gold foil was about 30%-50% of that with the cathode coated with nickel and titanium foil.The gap closing speed was also obtained from the current waveforms in the experiments.The speed was 5.74-14.52 cmμs 1 which was different from the classical plasma expansion velocity of 3 cmμs 1.展开更多
We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for gen...We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for general cases of off-axis scenarios where the sampling intervals and numbers of the input and observation planes are unequal.Additionally,it allows for arbitrary adjustment of the sampling interval of the impulse response function,facilitating a manual trade-off between computational load and accuracy.The er-rors associated with this method,which is equivalent to interpolation,primarily arise from the discontinuities of the sampling matrix of the impulse response function on its boundaries of periodic extension.To address this issue,we propose the concept of the padding function and its construction method,and evaluate its ef-fectiveness in enhancing computational accuracy.The feasibility of the proposed method is verified by nu-merical simulation and compared with the direct integration DI-method in a simplified scenario.It shows that the proposed method has good computational accuracy for the general case where the sampling interval of the input and observation plane is not equal under non-near-field diffraction,and when the diffraction distance is large,although the computational accuracy of the proposed method cannot exceed that of the DI-method,the computational amount can be significantly reduced with almost no effect on the computational accuracy.This method provides a general numerical calculation scheme of diffraction in the non-near field case for areas such as computational holography.展开更多
Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon...Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.展开更多
The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adver...The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.展开更多
Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representat...Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representations across diverse real-world applications.展开更多
Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating In...Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods.展开更多
Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreou...Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreous detachment(PVD)and epiretinal membrane(ERM).They can cause severe damage to the structure and function of the retina,leading to permanent vision loss.Therefore,fully automated detection of abnormalities at the VRI is crucial for the management of high myopia.This paper presents a DS-YOLOv7 network aimed at accurately identifying abnormalities,including partial PVD,complete PVD,and ERM from retinal OCT images.Built upon the YOLOv7 network,the proposed model integrates the advanced dynamic snake convolution(DSConv)module to capture the curvilinear characteristics of lesions,and the mixture of attention and convolution(ACMix)module to improve the precision and robustness of feature extraction through effective fusion of self-attention mechanisms and convolution.Moreover,the introduction of the efficient complete intersection-over-union(ECIoU)loss function further enhances the coordinate regression capability of the model.Threefold cross-validation on a dataset with 1973 OCT B-scans from 46 patients shows that the DS-YOLOv7 achieved superior performance in vitreoretinal interface abnormality detection,with mAP@0.5 of 0.714,mAP@0.75 of 0.438,and mAP@0.5:0.95 of 0.424.The proposed model can provide an accurate and efficient diagnostic tool for patients with high myopia.展开更多
Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression...Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.展开更多
Malware poses a significant threat to the Internet of Things(IoT).It enables unauthorized access to devices in the IoT environment.The lack of unique architectural standards causes challenges in developing robust malw...Malware poses a significant threat to the Internet of Things(IoT).It enables unauthorized access to devices in the IoT environment.The lack of unique architectural standards causes challenges in developing robust malware detection(MD)models.The existing models demand substantial computational resources.This study intends to build a lightweight MD model to detect anomalies in IoT networks.The authors develop a transformation technique,converting the malware binaries into images.MobileNet V2 is fine-tuned using improved grey wolf optimization(IGWO)to extract crucial features of malicious and benign samples.The ResNeXt model is combined with the Linformer’s attention mechanism to identify Malware features.A fully connected layer is integrated with gradientweighted class activation mapping(Grad-CAM)in order to facilitate an interpretable classification model.The proposed model is evaluated using the IoT malware and the IoT-23 datasets.The model performs well on the two datasets with an accuracy of 98.94%,precision of 98.46%,recall of 98.11%,and F1-score of 98.28%on the IoT malware dataset,and an accuracy of 98.23%,precision of 96.80%,recall of 96.64%,and F1-score of 96.71%on the IoT-23 dataset,respectively.The findings indicate that the model has a high standard of classification.The lightweight architecture enables efficient deployment with an inference time of 1.42 s.Inference time has no direct impact on accuracy,precision,recall,or F1-score.However,the inference speed would warrant timely detection in latency-sensitive IoT applications.By achieving a remarkable result,the proposed study offers a comprehensive solution:a scalable,interpretable,and computationally efficient MD model for the evolving IoT landscape.展开更多
Microseismic(MS)monitoring is an effective technique to detect mining-induced rock fractures.However,recognizing grouting-induced signals is challenging due to complex geological conditions in deep rock plates.Therefo...Microseismic(MS)monitoring is an effective technique to detect mining-induced rock fractures.However,recognizing grouting-induced signals is challenging due to complex geological conditions in deep rock plates.Therefore,a hybrid model(WM-ResNet50)integrating data enhancement,a deep convolutional neural network(CNN),and convolutional block attention modules(CBAM)was proposed.Firstly,an MS system was established at the Xieqiao coal mine in Anhui Province,China.MS waveforms and injection parameters were acquired during grouting.Secondly,signals were categorized based on time-frequency characteristics to build a dataset,which was divided into training,validation,and test sets at a ratio of 4:1:1.Subsequently,the performance of WM-ResNet50 was evaluated based on indices such as individual precision,total accuracy,recall,and loss function.The results indicated that WMResNet50 achieved an average recognition accuracy of 94.38%,surpassing that of a simple CNN(90.04%),ResNet18(91.72%),and ResNet50(92.48%).Finally,WM-ResNet50 was applied to monitor the whole process at laboratory tests and field cases.Both results affirmed the feasibility and effectiveness of MS inversion in predicting actual slurry diffusion ranges within deep rock layers.By comparison,it was revealed that the MS sources classified by WM-ResNet50 matched grouting records well.A solution to address insufficient diffusion under long-borehole grouting has been proposed.WM-ResNet50′s accuracy was validated through in-situ coring and XRD analysis for cement-based hydration products.This study provides a beneficial reference for similar rock signal processing and in-field grouting practices.展开更多
In daily life,keyword spotting plays an important role in human-computer interaction.However,noise often interferes with the extraction of time-frequency information,and achieving both computational efficiency and rec...In daily life,keyword spotting plays an important role in human-computer interaction.However,noise often interferes with the extraction of time-frequency information,and achieving both computational efficiency and recognition accuracy on resource-constrained devices such as mobile terminals remains a major challenge.To address this,we propose a novel time-frequency dual-branch parallel residual network,which integrates a Dual-Branch Broadcast Residual module and a Time-Frequency Coordinate Attention module.The time-domain and frequency-domain branches are designed in parallel to independently extract temporal and spectral features,effectively avoiding the potential information loss caused by serial stacking,while enhancing information flow and multi-scale feature fusion.In terms of training strategy,a curriculum learning approach is introduced to progressively improve model robustness fromeasy to difficult tasks.Experimental results demonstrate that the proposed method consistently outperforms existing lightweight models under various signal-to-noise ratio(SNR)conditions,achieving superior far-field recognition performance on the Google Speech Commands V2 dataset.Notably,the model maintains stable performance even in low-SNR environments such as–10 dB,and generalizes well to unseen SNR conditions during training,validating its robustness to novel noise scenarios.Furthermore,the proposed model exhibits significantly fewer parameters,making it highly suitable for deployment on resource-limited devices.Overall,the model achieves a favorable balance between performance and parameter efficiency,demonstrating strong potential for practical applications.展开更多
Log anomaly detection is essential for maintaining the reliability and security of large-scale networked systems.Most traditional techniques rely on log parsing in the reprocessing stage and utilize handcrafted featur...Log anomaly detection is essential for maintaining the reliability and security of large-scale networked systems.Most traditional techniques rely on log parsing in the reprocessing stage and utilize handcrafted features that limit their adaptability across various systems.In this study,we propose a hybrid model,BertGCN,that integrates BERT-based contextual embedding with Graph Convolutional Networks(GCNs)to identify anomalies in raw system logs,thereby eliminating the need for log parsing.TheBERT module captures semantic representations of log messages,while the GCN models the structural relationships among log entries through a text-based graph.This combination enables BertGCN to capture both the contextual and semantic characteristics of log data.BertGCN showed excellent performance on the HDFS and BGL datasets,demonstrating its effectiveness and resilience in detecting anomalies.Compared to multiple baselines,our proposed BertGCN showed improved precision,recall,and F1 scores.展开更多
Stereo matching is a pivotal task in computer vision,enabling precise depth estimation from stereo image pairs,yet it encounters challenges in regions with reflections,repetitive textures,or fine structures.In this pa...Stereo matching is a pivotal task in computer vision,enabling precise depth estimation from stereo image pairs,yet it encounters challenges in regions with reflections,repetitive textures,or fine structures.In this paper,we propose a Semantic-Guided Parallax Attention Stereo Matching Network(SGPASMnet)that can be trained in unsupervised manner,building upon the Parallax Attention Stereo Matching Network(PASMnet).Our approach leverages unsupervised learning to address the scarcity of ground truth disparity in stereo matching datasets,facilitating robust training across diverse scene-specific datasets and enhancing generalization.SGPASMnet incorporates two novel components:a Cross-Scale Feature Interaction(CSFI)block and semantic feature augmentation using a pre-trained semantic segmentation model,SegFormer,seamlessly embedded into the parallax attention mechanism.The CSFI block enables effective fusion ofmulti-scale features,integrating coarse and fine details to enhance disparity estimation accuracy.Semantic features,extracted by SegFormer,enrich the parallax attention mechanism by providing high-level scene context,significantly improving performance in ambiguous regions.Our model unifies these enhancements within a cohesive architecture,comprising semantic feature extraction,an hourglass network,a semantic-guided cascaded parallax attentionmodule,outputmodule,and a disparity refinement network.Evaluations on the KITTI2015 dataset demonstrate that our unsupervised method achieves a lower error rate compared to the original PASMnet,highlighting the effectiveness of our enhancements in handling complex scenes.By harnessing unsupervised learning without ground truth disparity needed,SGPASMnet offers a scalable and robust solution for accurate stereo matching,with superior generalization across varied real-world applications.展开更多
The rapid development of information technology and accelerated digitalization have led to an explosive growth of data across various fields.As a key technology for knowledge representation and sharing,knowledge graph...The rapid development of information technology and accelerated digitalization have led to an explosive growth of data across various fields.As a key technology for knowledge representation and sharing,knowledge graphs play a crucial role by constructing structured networks of relationships among entities.However,data sparsity and numerous unexplored implicit relations result in the widespread incompleteness of knowledge graphs.In static knowledge graph completion,most existing methods rely on linear operations or simple interaction mechanisms for triple encoding,making it difficult to fully capture the deep semantic associations between entities and relations.Moreover,many methods focus only on the local information of individual triples,ignoring the rich semantic dependencies embedded in the neighboring nodes of entities within the graph structure,which leads to incomplete embedding representations.To address these challenges,we propose Two-Stage Mixer Embedding(TSMixerE),a static knowledge graph completion method based on entity context.In the unit semantic extraction stage,TSMixerE leveragesmulti-scale circular convolution to capture local features atmultiple granularities,enhancing the flexibility and robustness of feature interactions.A channel attention mechanism amplifies key channel responses to suppress noise and irrelevant information,thereby improving the discriminative power and semantic depth of feature representations.For contextual information fusion,a multi-layer self-attentionmechanism enables deep interactions among contextual cues,effectively integrating local details with global context.Simultaneously,type embeddings clarify the semantic identities and roles of each component,enhancing the model’s sensitivity and fusion capabilities for diverse information sources.Furthermore,TSMixerE constructs contextual unit sequences for entities,fully exploring neighborhood information within the graph structure to model complex semantic dependencies,thus improving the completeness and generalization of embedding representations.展开更多
This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid ag...This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.展开更多
文摘An improved analytical design to investigate the static stiffness of a convoluted air spring is developed and presented in this article.An air spring provides improved ride comfort by achieving variable volume at various strokes of the suspension.An analytical relation is derived to calculate the volume and the rate of change in the volume of the convoluted bellow with respect to various suspension heights.This expression is used in the equation to calculate the variable stiffness of the bellow.The obtained analytical characteristics are validated with a detailed experiment to test the static vertical stiffness of the air spring.The convoluted air bellow is tested in an Avery spring-testing apparatus for various loads.The bellow is modeled in the ABAQUS environment to perform finite element analysis(FEA)to understand and visualize the deflection of the bellow at various elevated internal pressures and external loads.The proposed air spring model is a fiber-reinforced rubber bellow enclosed between two metal plates.The Mooney-Rivlin material model was used to model the hyperelastic rubber material for FEA.From the results,it is observed that the experimental and analytical results match with a minor error of 7.54%.The derived relations and validations would provide design guidance at the developmental stage of air bellows.These expressions would also play a major role in designing an effective active air suspension system by accurately calculating the required stiffness at various loads.
文摘The post-hole convolute(PHC),which is used to transport and combine the pulse power flux,is a key component in huge pulsed power generators.Current loss at the PHC is a challenging problem for researchers.To explore a method of reducing the current loss,a single-hole PHC was designed for experiments on the current loss on the Qiang Guang I generator.The experimental results showed that the current loss at the single-hole PHC is related to the distance/between the vicinity of the cathode hole and the surface of the downstream side of the post.Meanwhile,a single-hole PHC with a blob cathode hole transmitted current more effectively than the PHC with a circle cathode hole.The relative current loss at the single-hole PHC with the cathode coaled w ith gold foil was about 30%-50% of that with the cathode coated with nickel and titanium foil.The gap closing speed was also obtained from the current waveforms in the experiments.The speed was 5.74-14.52 cmμs 1 which was different from the classical plasma expansion velocity of 3 cmμs 1.
文摘We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for general cases of off-axis scenarios where the sampling intervals and numbers of the input and observation planes are unequal.Additionally,it allows for arbitrary adjustment of the sampling interval of the impulse response function,facilitating a manual trade-off between computational load and accuracy.The er-rors associated with this method,which is equivalent to interpolation,primarily arise from the discontinuities of the sampling matrix of the impulse response function on its boundaries of periodic extension.To address this issue,we propose the concept of the padding function and its construction method,and evaluate its ef-fectiveness in enhancing computational accuracy.The feasibility of the proposed method is verified by nu-merical simulation and compared with the direct integration DI-method in a simplified scenario.It shows that the proposed method has good computational accuracy for the general case where the sampling interval of the input and observation plane is not equal under non-near-field diffraction,and when the diffraction distance is large,although the computational accuracy of the proposed method cannot exceed that of the DI-method,the computational amount can be significantly reduced with almost no effect on the computational accuracy.This method provides a general numerical calculation scheme of diffraction in the non-near field case for areas such as computational holography.
基金Supported by the National key research and development program in the 14th five year plan 2021YFA1200700)the National Natural Science Foundation of China(62535018,62431025,62561160113)the Natural Science Foundation of Shanghai(23ZR1473400).
文摘Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.
文摘The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.
基金supported by the National Natural Science Foundation of China(62402399)the New Chongqing Youth Innovation Talent Project(CSTB2024NSCQ-QCXMX0035)。
文摘Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representations across diverse real-world applications.
文摘Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods.
基金supported by the National Natural Science Foundation of China(62271337,62371326,and 62371328)the National Key Research and Development Program of China(2019FYC1710204)+1 种基金the National Clinical Key Specialty Construction Project(10000015Z155080000004)the Natural Science Foundation of Jiangsu Province(BK20231310).
文摘Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreous detachment(PVD)and epiretinal membrane(ERM).They can cause severe damage to the structure and function of the retina,leading to permanent vision loss.Therefore,fully automated detection of abnormalities at the VRI is crucial for the management of high myopia.This paper presents a DS-YOLOv7 network aimed at accurately identifying abnormalities,including partial PVD,complete PVD,and ERM from retinal OCT images.Built upon the YOLOv7 network,the proposed model integrates the advanced dynamic snake convolution(DSConv)module to capture the curvilinear characteristics of lesions,and the mixture of attention and convolution(ACMix)module to improve the precision and robustness of feature extraction through effective fusion of self-attention mechanisms and convolution.Moreover,the introduction of the efficient complete intersection-over-union(ECIoU)loss function further enhances the coordinate regression capability of the model.Threefold cross-validation on a dataset with 1973 OCT B-scans from 46 patients shows that the DS-YOLOv7 achieved superior performance in vitreoretinal interface abnormality detection,with mAP@0.5 of 0.714,mAP@0.75 of 0.438,and mAP@0.5:0.95 of 0.424.The proposed model can provide an accurate and efficient diagnostic tool for patients with high myopia.
基金supported by the Science and Technology Innovation Key R&D Program of Chongqing(CSTB2025TIAD-STX0032)National Key Research and Development Program of China(2024YFF0908200)+1 种基金the Chongqing Technology Innovation and Application Development Special Key Project(CSTB2024TIAD-KPX0018)the Southwest University Graduate Student Research Innovation(SWUB24051)。
文摘Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.
基金supported by the Deanship of Scientific Research,Vice Presidency for Graduate Studies and Scientific Research,King Faisal University,Saudi Arabia[Grant No.KFU253774].
文摘Malware poses a significant threat to the Internet of Things(IoT).It enables unauthorized access to devices in the IoT environment.The lack of unique architectural standards causes challenges in developing robust malware detection(MD)models.The existing models demand substantial computational resources.This study intends to build a lightweight MD model to detect anomalies in IoT networks.The authors develop a transformation technique,converting the malware binaries into images.MobileNet V2 is fine-tuned using improved grey wolf optimization(IGWO)to extract crucial features of malicious and benign samples.The ResNeXt model is combined with the Linformer’s attention mechanism to identify Malware features.A fully connected layer is integrated with gradientweighted class activation mapping(Grad-CAM)in order to facilitate an interpretable classification model.The proposed model is evaluated using the IoT malware and the IoT-23 datasets.The model performs well on the two datasets with an accuracy of 98.94%,precision of 98.46%,recall of 98.11%,and F1-score of 98.28%on the IoT malware dataset,and an accuracy of 98.23%,precision of 96.80%,recall of 96.64%,and F1-score of 96.71%on the IoT-23 dataset,respectively.The findings indicate that the model has a high standard of classification.The lightweight architecture enables efficient deployment with an inference time of 1.42 s.Inference time has no direct impact on accuracy,precision,recall,or F1-score.However,the inference speed would warrant timely detection in latency-sensitive IoT applications.By achieving a remarkable result,the proposed study offers a comprehensive solution:a scalable,interpretable,and computationally efficient MD model for the evolving IoT landscape.
基金financial support from the National Natural Science Foundation of China(Nos.52204089,52374082)the Young Elite Scientists Sponsorship Program(No.2023QNRC001)by China Association for Science and Technology(CAST).
文摘Microseismic(MS)monitoring is an effective technique to detect mining-induced rock fractures.However,recognizing grouting-induced signals is challenging due to complex geological conditions in deep rock plates.Therefore,a hybrid model(WM-ResNet50)integrating data enhancement,a deep convolutional neural network(CNN),and convolutional block attention modules(CBAM)was proposed.Firstly,an MS system was established at the Xieqiao coal mine in Anhui Province,China.MS waveforms and injection parameters were acquired during grouting.Secondly,signals were categorized based on time-frequency characteristics to build a dataset,which was divided into training,validation,and test sets at a ratio of 4:1:1.Subsequently,the performance of WM-ResNet50 was evaluated based on indices such as individual precision,total accuracy,recall,and loss function.The results indicated that WMResNet50 achieved an average recognition accuracy of 94.38%,surpassing that of a simple CNN(90.04%),ResNet18(91.72%),and ResNet50(92.48%).Finally,WM-ResNet50 was applied to monitor the whole process at laboratory tests and field cases.Both results affirmed the feasibility and effectiveness of MS inversion in predicting actual slurry diffusion ranges within deep rock layers.By comparison,it was revealed that the MS sources classified by WM-ResNet50 matched grouting records well.A solution to address insufficient diffusion under long-borehole grouting has been proposed.WM-ResNet50′s accuracy was validated through in-situ coring and XRD analysis for cement-based hydration products.This study provides a beneficial reference for similar rock signal processing and in-field grouting practices.
文摘In daily life,keyword spotting plays an important role in human-computer interaction.However,noise often interferes with the extraction of time-frequency information,and achieving both computational efficiency and recognition accuracy on resource-constrained devices such as mobile terminals remains a major challenge.To address this,we propose a novel time-frequency dual-branch parallel residual network,which integrates a Dual-Branch Broadcast Residual module and a Time-Frequency Coordinate Attention module.The time-domain and frequency-domain branches are designed in parallel to independently extract temporal and spectral features,effectively avoiding the potential information loss caused by serial stacking,while enhancing information flow and multi-scale feature fusion.In terms of training strategy,a curriculum learning approach is introduced to progressively improve model robustness fromeasy to difficult tasks.Experimental results demonstrate that the proposed method consistently outperforms existing lightweight models under various signal-to-noise ratio(SNR)conditions,achieving superior far-field recognition performance on the Google Speech Commands V2 dataset.Notably,the model maintains stable performance even in low-SNR environments such as–10 dB,and generalizes well to unseen SNR conditions during training,validating its robustness to novel noise scenarios.Furthermore,the proposed model exhibits significantly fewer parameters,making it highly suitable for deployment on resource-limited devices.Overall,the model achieves a favorable balance between performance and parameter efficiency,demonstrating strong potential for practical applications.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,under grant no.(GPIP:1074-612-2024).
文摘Log anomaly detection is essential for maintaining the reliability and security of large-scale networked systems.Most traditional techniques rely on log parsing in the reprocessing stage and utilize handcrafted features that limit their adaptability across various systems.In this study,we propose a hybrid model,BertGCN,that integrates BERT-based contextual embedding with Graph Convolutional Networks(GCNs)to identify anomalies in raw system logs,thereby eliminating the need for log parsing.TheBERT module captures semantic representations of log messages,while the GCN models the structural relationships among log entries through a text-based graph.This combination enables BertGCN to capture both the contextual and semantic characteristics of log data.BertGCN showed excellent performance on the HDFS and BGL datasets,demonstrating its effectiveness and resilience in detecting anomalies.Compared to multiple baselines,our proposed BertGCN showed improved precision,recall,and F1 scores.
基金supported by the National Natural Science Foundation of China,No.62301497the Science and Technology Research Program of Henan,No.252102211024the Key Research and Development Program of Henan,No.231111212000.
文摘Stereo matching is a pivotal task in computer vision,enabling precise depth estimation from stereo image pairs,yet it encounters challenges in regions with reflections,repetitive textures,or fine structures.In this paper,we propose a Semantic-Guided Parallax Attention Stereo Matching Network(SGPASMnet)that can be trained in unsupervised manner,building upon the Parallax Attention Stereo Matching Network(PASMnet).Our approach leverages unsupervised learning to address the scarcity of ground truth disparity in stereo matching datasets,facilitating robust training across diverse scene-specific datasets and enhancing generalization.SGPASMnet incorporates two novel components:a Cross-Scale Feature Interaction(CSFI)block and semantic feature augmentation using a pre-trained semantic segmentation model,SegFormer,seamlessly embedded into the parallax attention mechanism.The CSFI block enables effective fusion ofmulti-scale features,integrating coarse and fine details to enhance disparity estimation accuracy.Semantic features,extracted by SegFormer,enrich the parallax attention mechanism by providing high-level scene context,significantly improving performance in ambiguous regions.Our model unifies these enhancements within a cohesive architecture,comprising semantic feature extraction,an hourglass network,a semantic-guided cascaded parallax attentionmodule,outputmodule,and a disparity refinement network.Evaluations on the KITTI2015 dataset demonstrate that our unsupervised method achieves a lower error rate compared to the original PASMnet,highlighting the effectiveness of our enhancements in handling complex scenes.By harnessing unsupervised learning without ground truth disparity needed,SGPASMnet offers a scalable and robust solution for accurate stereo matching,with superior generalization across varied real-world applications.
基金supported by the National Natural Science Foundation of China(No.62267005)the Chinese Guangxi Natural Science Foundation(No.2023GXNSFAA026493)+1 种基金Guangxi Collaborative Innovation Center ofMulti-Source Information Integration and Intelligent ProcessingGuangxi Academy of Artificial Intelligence.
文摘The rapid development of information technology and accelerated digitalization have led to an explosive growth of data across various fields.As a key technology for knowledge representation and sharing,knowledge graphs play a crucial role by constructing structured networks of relationships among entities.However,data sparsity and numerous unexplored implicit relations result in the widespread incompleteness of knowledge graphs.In static knowledge graph completion,most existing methods rely on linear operations or simple interaction mechanisms for triple encoding,making it difficult to fully capture the deep semantic associations between entities and relations.Moreover,many methods focus only on the local information of individual triples,ignoring the rich semantic dependencies embedded in the neighboring nodes of entities within the graph structure,which leads to incomplete embedding representations.To address these challenges,we propose Two-Stage Mixer Embedding(TSMixerE),a static knowledge graph completion method based on entity context.In the unit semantic extraction stage,TSMixerE leveragesmulti-scale circular convolution to capture local features atmultiple granularities,enhancing the flexibility and robustness of feature interactions.A channel attention mechanism amplifies key channel responses to suppress noise and irrelevant information,thereby improving the discriminative power and semantic depth of feature representations.For contextual information fusion,a multi-layer self-attentionmechanism enables deep interactions among contextual cues,effectively integrating local details with global context.Simultaneously,type embeddings clarify the semantic identities and roles of each component,enhancing the model’s sensitivity and fusion capabilities for diverse information sources.Furthermore,TSMixerE constructs contextual unit sequences for entities,fully exploring neighborhood information within the graph structure to model complex semantic dependencies,thus improving the completeness and generalization of embedding representations.
文摘This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity.Conventional pooling operations,such as max and average,apply rigid aggregation and often discard fine-grained boundary information.In contrast,our method computes soft membershipswithin each receptive field and aggregates cluster-wise responses throughmembership-weighted pooling,thereby preserving informative structure while reducing dimensionality.Being differentiable,the proposed layer operates as standard two-dimensional pooling.We evaluate our approach across various CNN backbones and open datasets,including CIFAR-10/100,STL-10,LFW,and ImageNette,and further probe small training set restrictions on MNIST and Fashion-MNIST.In these settings,the proposed pooling consistently improves accuracy and weighted F1 over conventional baselines,with particularly strong gains when training data are scarce.Even with less than 1%of the training set,ourmethodmaintains reliable performance,indicating improved sample efficiency and robustness to noisy or ambiguous local patterns.Overall,integrating soft memberships into the pooling operator provides a practical and generalizable inductive bias that enhances robustness and generalization in modern CNN pipelines.