期刊文献+
共找到573篇文章
< 1 2 29 >
每页显示 20 50 100
Polarization-encodable photonic memory cells using nextgeneration 2D phase-change materials
1
作者 Amin Shafiee Linhong Chen +1 位作者 Mahdi Nikdast Jie Yao 《Nano Research》 2025年第3期510-518,共9页
Integration of phase-change materials(PCMs)created a unique opportunity to implement reconfigurable photonics devices that their performance can be tuned depending on the target application.Conventional PCMs such as G... Integration of phase-change materials(PCMs)created a unique opportunity to implement reconfigurable photonics devices that their performance can be tuned depending on the target application.Conventional PCMs such as Ge-Sb-Te(GST)and Ge-Sb-Se-Te(GSST)rely on melt-quench and high temperature annealing processes to change the organization of the molecules in the materials’crystal.Such a reorganization leads to different optical,electrical,and thermal properties which can be exploited to implement photonic memory cells that are able to store the data at different resistance or optical transmission levels.Despite the great promise of conventional PCMs for realizing reconfigurable photonic memories,their slow and extremely power-hungry thermal mechanisms make scaling the systems based on such devices challenging.In addition,such materials do not offer a stable multi-level response over a long period of time.To address these shortcomings,the research carried out in this study shows the proof of concept to implement next-generation photonic memory cells based on two-dimensional(2D)birefringence PCMs such as SnSe,which offer anisotropic optical properties that can be switched ferroelectrically.We demonstrate that by leveraging the ultrafast and low-power crystallographic direction change of the material,the optical polarization state of the input optical signal can be changed.This enables the implementation of next-generation high-speed polarization-encodable photonic memory cells for future photonic computing systems.Compared to the conventional PCMs,the proposed SnSe-based photonic memory cells offer an ultrafast switching and low-loss optical response relying on ferroelectric property of SnSe to encode the data on the polarization state of the input optical signal.Such a polarization encoding scheme also reduces memory read-out errors and alleviates the scalability limitations due to the optical insertion loss often seen in optical transmission encoding. 展开更多
关键词 ferroelectric materials phase-change materials photonic memory optical storage cells polarization encodable photonic memories optical polarization
原文传递
基于语义分割的输送带跑偏智能检测方法 被引量:1
2
作者 李南雁 廖辉 +3 位作者 赵龙 苏金辉 蓝武生 陈夕松 《科技风》 2025年第3期59-61,共3页
受设备老化与表面受力不均匀的影响,带式输送机易跑偏,导致故障和物料撒落。传统监测方法成本高且安装复杂,为此,本研究提出基于深度学习的智能检测方法,构建皮带线语义分割数据集并标注;使用Unet模型检测皮带线,并通过MiT编码器优化;... 受设备老化与表面受力不均匀的影响,带式输送机易跑偏,导致故障和物料撒落。传统监测方法成本高且安装复杂,为此,本研究提出基于深度学习的智能检测方法,构建皮带线语义分割数据集并标注;使用Unet模型检测皮带线,并通过MiT编码器优化;引入像素位置感知损失强化训练;利用概率霍夫变换提取皮带线的直线位置,定量分析偏移程度。试验结果显示,本模型在皮带线预测上IoU达61.34%,仅占12.93GFlops,具备高效实时性,适用于多种输送带场景。 展开更多
关键词 深度学习 语义分割 MiT Encoder 机器视觉
在线阅读 下载PDF
基于Transformer编码器和手工特征的航空发动机剩余寿命预测
3
作者 陈栋 黄国勇 《机床与液压》 北大核心 2025年第22期54-60,共7页
为了更准确地预测航空涡扇发动机剩余寿命(RUL),充分提取利用不同维度传感器数据之间的相关性和手工筛选特征,提出一种基于多层Transformer编码器(Encoder)和多个手工筛选特征融合的预测模型。利用多层编码器(Encoder)进行特征筛选,利... 为了更准确地预测航空涡扇发动机剩余寿命(RUL),充分提取利用不同维度传感器数据之间的相关性和手工筛选特征,提出一种基于多层Transformer编码器(Encoder)和多个手工筛选特征融合的预测模型。利用多层编码器(Encoder)进行特征筛选,利用编码器多头注意力机制同时处理发动机整个特征序列,提取各维度传感器数据之间的相关性并重新分配权重,使模型能够捕捉到各维度不同特征之间的相互依赖关系;利用模型提取每个特征维度的均值、线性回归趋势系数及传感器数据与真实剩余寿命的协方差3个手工筛选特征,将编码器的输出进行展平和多层全连接处理,然后与手工筛选特征进行拼接,经过全连接层对RUL进行预测。将所提模型在公开数据集C-MAPSS的FD001上进行验证,对比未加入手工筛选特征的模型,文中模型的RMSE降低了18.4%,Score降低了30.3%;对比未对Encoder输入数据进行转置操作的模型,文中模型的RMSE降低了20.8%,Score降低了44.9%,编码器(Encoder)提取了不同维度传感器之间的相关性,通过融合不同手工筛选特征提高了发动机剩余寿命预测准确性。针对相对复杂的数据集FD004,该模型也获得了较好的预测结果,表明所提模型具有较好的稳定性及泛化性。 展开更多
关键词 发动机剩余寿命 编码器(Encoder) 自注意力机制 特征融合
在线阅读 下载PDF
Joint Feature Encoding and Task Alignment Mechanism for Emotion-Cause Pair Extraction
4
作者 Shi Li Didi Sun 《Computers, Materials & Continua》 SCIE EI 2025年第1期1069-1086,共18页
With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions... With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions and their triggers within a text,facilitating a deeper understanding of expressed sentiments and their underlying reasons.This comprehension is crucial for making informed strategic decisions in various business and societal contexts.However,recent research approaches employing multi-task learning frameworks for modeling often face challenges such as the inability to simultaneouslymodel extracted features and their interactions,or inconsistencies in label prediction between emotion-cause pair extraction and independent assistant tasks like emotion and cause extraction.To address these issues,this study proposes an emotion-cause pair extraction methodology that incorporates joint feature encoding and task alignment mechanisms.The model consists of two primary components:First,joint feature encoding simultaneously generates features for emotion-cause pairs and clauses,enhancing feature interactions between emotion clauses,cause clauses,and emotion-cause pairs.Second,the task alignment technique is applied to reduce the labeling distance between emotion-cause pair extraction and the two assistant tasks,capturing deep semantic information interactions among tasks.The proposed method is evaluated on a Chinese benchmark corpus using 10-fold cross-validation,assessing key performance metrics such as precision,recall,and F1 score.Experimental results demonstrate that the model achieves an F1 score of 76.05%,surpassing the state-of-the-art by 1.03%.The proposed model exhibits significant improvements in emotion-cause pair extraction(ECPE)and cause extraction(CE)compared to existing methods,validating its effectiveness.This research introduces a novel approach based on joint feature encoding and task alignment mechanisms,contributing to advancements in emotion-cause pair extraction.However,the study’s limitation lies in the data sources,potentially restricting the generalizability of the findings. 展开更多
关键词 Emotion-cause pair extraction interactive information enhancement joint feature encoding label consistency task alignment mechanisms
在线阅读 下载PDF
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
5
作者 ISLAM Md Tauhidul WU Da-Wen +6 位作者 TANG Qing-Qing ZHAO Kai-Yang YIN Teng LI Yan-Fei SHANG Wen-Yi LIU Jing-Yu ZHANG Hai-Xian 《四川大学学报(自然科学版)》 北大核心 2025年第1期79-95,共17页
Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited t... Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases.Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field,yet issues like limited training data,imbalance data distribution,and inadequate feature extraction persist,hindering both the segmentation performance and optimal model generalization.Addressing these critical issues,the DEFFA-Unet is proposed featuring an additional encoder to process domain-invariant pre-processed inputs,thereby improving both richer feature encoding and enhanced model generalization.A feature filtering fusion module is developed to ensure the precise feature filtering and robust hybrid feature fusion.In response to the task-specific need for higher precision where false positives are very costly,traditional skip connections are replaced with the attention-guided feature reconstructing fusion module.Additionally,innovative data augmentation and balancing methods are proposed to counter data scarcity and distribution imbalance,further boosting the robustness and generalization of the model.With a comprehensive suite of evaluation metrics,extensive validations on four benchmark datasets(DRIVE,CHASEDB1,STARE,and HRF)and an SLO dataset(IOSTAR),demonstrate the proposed method’s superiority over both baseline and state-of-the-art models.Particularly the proposed method significantly outperforms the compared methods in cross-validation model generalization. 展开更多
关键词 Vessel segmentation Data balancing Data augmentation Dual encoder Attention Mechanism Model generalization
在线阅读 下载PDF
基于VMD-MPE和并行双支路的变压器局部放电模式识别方法
6
作者 陈康裕 王飞 +1 位作者 曾龙兴 陈尔佳 《电工电能新技术》 北大核心 2025年第9期100-110,共11页
针对变压器局部放电信号的非平稳性和非线性特点,本文提出了一种基于变分模态分解(VMD)和多尺度排列熵(MPE)以及并行双支路的变压器局部放电模式识别方法。首先,利用VMD技术对局部放电波形进行层次分解,分离出若干带限本征模态函数(IMF)... 针对变压器局部放电信号的非平稳性和非线性特点,本文提出了一种基于变分模态分解(VMD)和多尺度排列熵(MPE)以及并行双支路的变压器局部放电模式识别方法。首先,利用VMD技术对局部放电波形进行层次分解,分离出若干带限本征模态函数(IMF),并基于MPE提取各阶IMF分量的深层特征信息,构建特征向量样本集。接着,设计了一个并行双支路模型,其中支路一通过Transformer Encoder的多头注意力机制提取全局特征,支路二利用堆叠的一维卷积神经网络(1D-CNN)结合挤压与激励网络(SENet)进一步提取局部特征信息。通过特征融合拼接策略,将双支路提取的全局与局部特征信息有效融合,从而增强模式识别的表现力。实验结果表明,本文所提出的方法在变压器局部放电模式识别中的准确率达到96.37%,且具有较高的识别效率,能够有效提升变压器局部放电故障的诊断性能,为变压器设备的维护工作提供了坚实的技术保障。 展开更多
关键词 变压器局部放电 变分模态分解 多尺度排列熵 Transformer Encoder 一维卷积神经网络 挤压与激励网络 故障诊断
在线阅读 下载PDF
Encoding converters for quantum communication networks
7
作者 Hua-Xing Xu Shao-Hua Wang +2 位作者 Ya-Qi Song Ping Zhang Chang-Lei Wang 《Chinese Physics B》 2025年第5期64-69,共6页
Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding ... Quantum communication networks,such as quantum key distribution(QKD)networks,typically employ the measurement-resend mechanism between two users using quantum communication devices based on different quantum encoding types.To achieve direct communication between the devices with different quantum encoding types,in this paper,we propose encoding conversion schemes between the polarization bases(rectilinear,diagonal and circular bases)and the time-bin phase bases(two phase bases and time-bin basis)and design the quantum encoding converters.The theoretical analysis of the encoding conversion schemes is given in detail,and the basis correspondence of encoding conversion and the property of bit flip are revealed.The conversion relationship between polarization bases and time-bin phase bases can be easily selected by controlling a phase shifter.Since no optical switches are used in our scheme,the converter can be operated with high speed.The converters can also be modularized,which may be utilized to realize miniaturization in the future. 展开更多
关键词 quantum communication networks encoding conversion polarization encoding time-bin phase encoding
原文传递
Graphene-Metal Hybrid Metasurface for Broadband Terahertz Logic Encoder Induced by Near-Field Coupling
8
作者 Yufan Zhang Longhui Zhang +6 位作者 Mingzhu Jiang Chenyue Xi Fangrong Hu Yatao Zhou Shangjun Lin Xinlong Xu Zengxiu Zhao 《Chinese Physics Letters》 2025年第10期101-116,共16页
High-performance terahertz(THz)logic gate devices are crucial components for signal processing and modulation,playing a significant role in the application of THz communication and imaging.Here,we propose a THz broadb... High-performance terahertz(THz)logic gate devices are crucial components for signal processing and modulation,playing a significant role in the application of THz communication and imaging.Here,we propose a THz broadband NOR logic encoder based on a graphene-metal hybrid metasurface.The unit structure consists of two symmetrical dual-gap metal split-ring resonators(DSRRs)arranged in a staggered configuration,with graphene strips embedded in their gaps.The NOR logic gate metadevice is controlled by the bias voltages independently applied to the two electrodes.Experiments show that when the bias voltages are applied to both electrodes,the metadevice achieves the NOR logic gate within a 0.52 THz bandwidth,with an average modulation depth above 80%.The experimental results match well with theoretical simulations.Additionally,the strong near-field coupling induced by the staggered DSRRs causes redshift at both LC resonance and dipole resonance.This phenomenon was demonstrated by coupled mode theory.Besides,we analyze the surface current distribution at resonances and propose four equivalent circuit models to elucidate the physical mechanisms of modulation under distinct loaded voltage conditions.The results not only advance modulation and logic gate designs for THz communication but also demonstrate significant potential applications in 6G networks,THz imaging,and radar systems. 展开更多
关键词 signal processing Broadband terahertz logic encoder Near field coupling thz broadband logic encoder Graphene metal hybrid metasurface bias vo Modulation Terahertz logic gate
原文传递
A Blockchain-Based Covert Communication Model Based on Dynamic Base-K Encoding
9
作者 Wang Zhujun Zhang Lejun +7 位作者 Li Xueqing Tian Zhihong Su Shen Qiu Jing Chen Huiling Qiu Tie Sergey Gataullin Guo Ran 《China Communications》 2025年第6期319-333,共15页
Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to comm... Blockchain,as a distributed ledger,inherently possesses tamper-resistant capabilities,creating a natural channel for covert communication.However,the immutable nature of data storage might introduce challenges to communication security.This study introduces a blockchain-based covert communication model utilizing dynamic Base-K encoding.The proposed encoding scheme utilizes the input address sequence to determine K to encode the secret message and determines the order of transactions based on K,thus ensuring effective concealment of the message.The dynamic encoding parameters enhance flexibility and address issues related to identical transaction amounts for the same secret message.Experimental results demonstrate that the proposed method maintains smooth communication and low susceptibility to tampering,achieving commendable concealment and embedding rates. 展开更多
关键词 base-K encoding blockchain CONCEALMENT covert communication
在线阅读 下载PDF
An Auto Encoder-Enhanced Stacked Ensemble for Intrusion Detection in Healthcare Networks
10
作者 Fatma S.Alrayes Mohammed Zakariah +2 位作者 Mohammed K.Alzaylaee Syed Umar Amin Zafar Iqbal Khan 《Computers, Materials & Continua》 2025年第11期3457-3484,共28页
Healthcare networks prove to be an urgent issue in terms of intrusion detection due to the critical consequences of cyber threats and the extreme sensitivity of medical information.The proposed Auto-Stack ID in the st... Healthcare networks prove to be an urgent issue in terms of intrusion detection due to the critical consequences of cyber threats and the extreme sensitivity of medical information.The proposed Auto-Stack ID in the study is a stacked ensemble of encoder-enhanced auctions that can be used to improve intrusion detection in healthcare networks.TheWUSTL-EHMS 2020 dataset trains and evaluates themodel,constituting an imbalanced class distribution(87.46% normal traffic and 12.53% intrusion attacks).To address this imbalance,the study balances the effect of training Bias through Stratified K-fold cross-validation(K=5),so that each class is represented similarly on training and validation splits.Second,the Auto-Stack ID method combines many base classifiers such as TabNet,LightGBM,Gaussian Naive Bayes,Histogram-Based Gradient Boosting(HGB),and Logistic Regression.We apply a two-stage training process based on the first stage,where we have base classifiers that predict out-of-fold(OOF)predictions,which we use as inputs for the second-stage meta-learner XGBoost.The meta-learner learns to refine predictions to capture complicated interactions between base models,thus improving detection accuracy without introducing bias,overfitting,or requiring domain knowledge of the meta-data.In addition,the auto-stack ID model got 98.41% accuracy and 93.45%F1 score,better than individual classifiers.It can identify intrusions due to its 90.55% recall and 96.53% precision with minimal false positives.These findings identify its suitability in ensuring healthcare networks’security through ensemble learning.Ongoing efforts will be deployed in real time to improve response to evolving threats. 展开更多
关键词 Intrusion detection auto encoder stacked ensemble WUSTL-EHMS 2020 dataset class imbalance XGBoost
在线阅读 下载PDF
Image encoding-based bearing fault diagnosis:Review and challenges for high-speed trains
11
作者 Huimin Li Lingfeng Li +1 位作者 Bin Liu Ge Xin 《High-Speed Railway》 2025年第3期251-259,共9页
High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal im... High-Speed Trains (HSTs) have emerged as a mainstream mode of transportation in China, owing to their exceptional safety and efficiency. Ensuring the reliable operation of HSTs is of paramount economic and societal importance. As critical rotating mechanical components of the transmission system, bearings make their fault diagnosis a topic of extensive attention. This paper provides a systematic review of image encoding-based bearing fault diagnosis methods tailored to the condition monitoring of HSTs. First, it categorizes the image encoding techniques applied in the field of bearing fault diagnosis. Then, a review of state-of-the-art studies has been presented, encompassing both monomodal image conversion and multimodal image fusion approaches. Finally, it highlights current challenges and proposes future research directions to advance intelligent fault diagnosis in HSTs, aiming to provide a valuable reference for researchers and engineers in the field of intelligent operation and maintenance. 展开更多
关键词 High-speed trains Image encoding Fault diagnosis Rotating machinery Condition monitoring
在线阅读 下载PDF
DC Disturbance Classification Method Based on Compressed Sensing and Encoder
12
作者 Huanan Yu Xiang Zhang Jian Wang 《Energy Engineering》 2025年第12期5055-5071,共17页
Recent advances in AC/DC hybrid power distribution systems have enhanced convenience in daily life.However,DC distribution introduces significant power quality challenges.To address the identification and classificati... Recent advances in AC/DC hybrid power distribution systems have enhanced convenience in daily life.However,DC distribution introduces significant power quality challenges.To address the identification and classification of DC power quality disturbances,this paper proposes a novel methodology integrating Compressed Sensing(CS)with an enhanced Stacked Denoising Autoencoder(SDAE).The proposed approach first employs MATLAB/SIMULINK to model the DC distribution network and generate DC power quality disturbance signals.The measured original signals are then reconstructed using the compressive sensing-based generalized orthogonal matching pursuit(GOMP)algorithm to obtain sparse vectors as the final dataset.Subsequently,a Stacked Denoising Autoencoder model is constructed.The Root Mean Square Propagation(RMSprop)optimization algorithm is introduced to finetune network parameters,thereby reducing the probability of convergence to local optima.Finally,simulation analyses are conducted on five common types of DC power quality disturbance signals.Both raw signals and sparse vectors are utilized as datasets and fed into the encoder model.The results indicate that this method effectively reduces the feature dimensionality for DC power quality disturbance classification while improving both recognition efficiency and accuracy,with additional advantages in noise resistance. 展开更多
关键词 DC power quality disturbance classification compressed sensing sparse vector ENCODER
在线阅读 下载PDF
轻量级低光照图像增强算法研究
13
作者 单慧 《电脑编程技巧与维护》 2025年第7期154-156,共3页
提出一种结合注意力机制与U-Net结构的低光照图像增强算法,由MSFE、Encoder、Bottle-neck、Decoder和RFRM 5个模块组成。通过联合注意力、轴向注意力和细节增强模块,有效提升图像清晰度并降低噪声。实验结果验证了该方法的有效性。
关键词 MSFE模块 Encoder模块 Bottleneck模块 Decoder模块 RFRM模块
在线阅读 下载PDF
LRP:learned robust data partitioning for efficient processing of large dynamic queries
14
作者 Pengju LIU Pan CAI +2 位作者 Kai ZHONG Cuiping LI Hong CHEN 《Frontiers of Computer Science》 2025年第9期43-60,共18页
The interconnection between query processing and data partitioning is pivotal for the acceleration of massive data processing during query execution,primarily by minimizing the number of scanned block files.Existing p... The interconnection between query processing and data partitioning is pivotal for the acceleration of massive data processing during query execution,primarily by minimizing the number of scanned block files.Existing partitioning techniques predominantly focus on query accesses on numeric columns for constructing partitions,often overlooking non-numeric columns and thus limiting optimization potential.Additionally,these techniques,despite creating fine-grained partitions from representative queries to enhance system performance,experience from notable performance declines due to unpredictable fluctuations in future queries.To tackle these issues,we introduce LRP,a learned robust partitioning system for dynamic query processing.LRP first proposes a method for data and query encoding that captures comprehensive column access patterns from historical queries.It then employs Multi-Layer Perceptron and Long Short-Term Memory networks to predict shifts in the distribution of historical queries.To create high-quality,robust partitions based on these predictions,LRP adopts a greedy beam search algorithm for optimal partition division and implements a data redundancy mechanism to share frequently accessed data across partitions.Experimental evaluations reveal that LRP yields partitions with more stable performance under incoming queries and significantly surpasses state-of-the-art partitioning methods. 展开更多
关键词 data partitioning data encoding query prediction beam search data redundancy
原文传递
New Encoder Based on Grating Eddy-Current with Differential Structure
15
作者 ZHANG Zaigi LüNa +1 位作者 TAO Wei ZHAO Hui 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期337-351,共15页
In response to the shortcomings of the common encoders in the industry,of which the photoelectric encoders have a poor anti-interference ability in harsh industrial environments with water,oil,dust,or strong vibration... In response to the shortcomings of the common encoders in the industry,of which the photoelectric encoders have a poor anti-interference ability in harsh industrial environments with water,oil,dust,or strong vibrations and the magnetic encoders are too sensitive to magnetic field density,this paper designs a new differential encoder based on the grating eddy-current measurement principle,abbreviated as differential grating eddy-current encoder(DGECE).The grating eddy-current of DGECE consists of a circular array of trapezoidal reflection conductors and 16 trapezoidal coils with a special structure to form a differential relationship,which are respectively located on the code plate and the readout plate designed by a printed circuit board.The differential structure of DGECE corrects the common mode interference and the amplitude distortion due to the assembly to some extent,possesses a certain anti-interference capability,and greatly simplifies the regularization algorithm of the original data.By means of the corresponding readout circuit and demodulation algorithm,the DGECE can convert the periodic impedance variation of 16 coils into an angular output within the 360°cycle.Due to its simple manufacturing process and certain interference immunity,DGECE is easy to be integrated and mass-produced as well as applicable in the industrial spindles,especially in robot joints.This paper presents the measurement principle,implementation methods,and results of the experiment of the DGECE.The experimental results show that the accuracy of the DGECE can reach 0.237%and the measurement standard deviation can reach±0.14°within360°cycle. 展开更多
关键词 ENCODER grating eddy-current differential structure angle measurement
原文传递
Research on Emotion Classification Supported by Multimodal Adversarial Autoencoder
16
作者 Jing Yu 《Journal of Electronic Research and Application》 2025年第1期270-275,共6页
In this paper,the sentiment classification method of multimodal adversarial autoencoder is studied.This paper includes the introduction of the multimodal adversarial autoencoder emotion classification method and the e... In this paper,the sentiment classification method of multimodal adversarial autoencoder is studied.This paper includes the introduction of the multimodal adversarial autoencoder emotion classification method and the experiment of the emotion classification method based on the encoder.The experimental analysis shows that the encoder has higher precision than other encoders in emotion classification.It is hoped that this analysis can provide some reference for the emotion classification under the current intelligent algorithm mode. 展开更多
关键词 Artificial intelligence Multimode adversarial encoder Sentiment classification Evaluation criteria Modal Settings
在线阅读 下载PDF
Multi-Scale Vision Transformer with Dynamic Multi-Loss Function for Medical Image Retrieval and Classification
17
作者 Omar Alqahtani Mohamed Ghouse +2 位作者 Asfia Sabahath Omer Bin Hussain Arshiya Begum 《Computers, Materials & Continua》 2025年第5期2221-2244,共24页
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi... This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison. 展开更多
关键词 Medical image retrieval vision transformer multi-scale encoding multi-loss function ISIC-2018 ChestX-ray14
在线阅读 下载PDF
Introduction to Special Issue on Fluorescent Probes for Optical Imaging and Biosensing
18
作者 Changfeng Wu Chenguang Wang Wei Chen 《Journal of Innovative Optical Health Sciences》 2025年第3期1-2,共2页
Fluorescent probes have revolutionized optical imaging and biosensing by enabling real-time visualization, quantification, and tracking of biological processes at molecular and cellular levels. These probes, ranging f... Fluorescent probes have revolutionized optical imaging and biosensing by enabling real-time visualization, quantification, and tracking of biological processes at molecular and cellular levels. These probes, ranging from organic dyes to genetically encoded proteins and nanomaterials, provide unparalleled specificity, sensitivity, and multiplexing capabilities. However, challenges such as brightness, photobleaching, biocompatibility, and emission range continue to drive innovation in probe design and application. This special issue, comprising four review papers and seven original research studies, highlights cutting-edge advancements in fluorescent probe technologies and their transformative roles in super-resolution imaging, in vivo diagnostics, and cancer therapeutics. 展开更多
关键词 super resolution imaging organic dyes BIOSENSING genetically encoded proteins optical imaging tracking biological processes fluorescent probes
原文传递
Optimized algorithm for image semantic segmentation compression algorithm in video surveillance scenarios
19
作者 ZHANG Yangmei ZHANG Xishan +1 位作者 ZHANG Shuo LI Jintao 《High Technology Letters》 2025年第2期194-203,共10页
In recent years,video coding has been widely applied in the field of video image processing to remove redundant information and improve data transmission efficiency.However,during the video coding process,irrelevant o... In recent years,video coding has been widely applied in the field of video image processing to remove redundant information and improve data transmission efficiency.However,during the video coding process,irrelevant objects such as background elements are often encoded due to environmental disturbances,resulting in the wastage of computational resources.Existing research on video coding efficiency optimization primarily focuses on optimizing encoding units during intra-frame or inter frame prediction after the generation of coding units,neglecting the optimization of video images before coding unit generation.To address this challenge,This work proposes an image semantic segmentation compression algorithm based on macroblock encoding,called image semantic segmentation compression algorithm based on macroblock encoding(ISSC-ME),which consists of three modules.(1)The semantic label generation module generates interesting object labels using a grid-based approach to reduce redundant coding of consecutive frames.(2)The image segmentation network module generates a semantic segmentation image using U-Net.(3)The macroblock coding module,is a block segmentation-based video encoding and decoding algorithm used to compress images and improve video transmission efficiency.Experimental results show that the proposed image semantic segmentation optimization algorithm can reduce the computational costs,and improve the overall accuracy by 1.00%and the mean intersection over union(IoU)by 1.20%.In addition,the proposed compression algorithm utilizes macroblock fusion,resulting in the image compression rate achieving 80.64%.It has been proven that the proposed algorithm greatly reduces data storage and transmission,and enables fast image compression processing at the millisecond level. 展开更多
关键词 macroblock encoding semantic segmentation segmentation compression
在线阅读 下载PDF
Remote sensing image semantic segmentation algorithm based on improved DeepLabv3+
20
作者 SONG Xirui GE Hongwei LI Ting 《Journal of Measurement Science and Instrumentation》 2025年第2期205-215,共11页
The convolutional neural network(CNN)method based on DeepLabv3+has some problems in the semantic segmentation task of high-resolution remote sensing images,such as fixed receiving field size of feature extraction,lack... The convolutional neural network(CNN)method based on DeepLabv3+has some problems in the semantic segmentation task of high-resolution remote sensing images,such as fixed receiving field size of feature extraction,lack of semantic information,high decoder magnification,and insufficient detail retention ability.A hierarchical feature fusion network(HFFNet)was proposed.Firstly,a combination of transformer and CNN architectures was employed for feature extraction from images of varying resolutions.The extracted features were processed independently.Subsequently,the features from the transformer and CNN were fused under the guidance of features from different sources.This fusion process assisted in restoring information more comprehensively during the decoding stage.Furthermore,a spatial channel attention module was designed in the final stage of decoding to refine features and reduce the semantic gap between shallow CNN features and deep decoder features.The experimental results showed that HFFNet had superior performance on UAVid,LoveDA,Potsdam,and Vaihingen datasets,and its cross-linking index was better than DeepLabv3+and other competing methods,showing strong generalization ability. 展开更多
关键词 semantic segmentation high-resolution remote sensing image deep learning transformer model attention mechanism feature fusion ENCODER DECODER
在线阅读 下载PDF
上一页 1 2 29 下一页 到第
使用帮助 返回顶部