Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intell...Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI.展开更多
The Bernoulli convolution ν λ measure is shown to be absolutely continuous with L 2 density for almost all 12<λ<1,and singular if λ -1 is a Pisot number. It is an open question whether the Pisot typ...The Bernoulli convolution ν λ measure is shown to be absolutely continuous with L 2 density for almost all 12<λ<1,and singular if λ -1 is a Pisot number. It is an open question whether the Pisot type Bernoulli convolutions are the only singular ones. In this paper,we construct a family of non-Pisot type Bernoulli convolutions ν λ such that their density functions,if they exist,are not L 2. We also construct other Bernolulli convolutions whose density functions,if they exist,behave rather badly.展开更多
Louis Pierre Gratiolet (1815-1865) was one of the first modern anatomists to pay attention to cerebral convolutions. Born in Sainte-Foy-la-Grande (Gironde), he moved to Paris in 1834 to study medicine, as well as comp...Louis Pierre Gratiolet (1815-1865) was one of the first modern anatomists to pay attention to cerebral convolutions. Born in Sainte-Foy-la-Grande (Gironde), he moved to Paris in 1834 to study medicine, as well as comparative anatomy under Henri de Blainville (1777-1850). In 1842, he accepted de Blainville’s offer to become his assistant at the Muséum d’histoire naturelle and progressively abandoned medicine for comparative anatomy. He undertook a detailed study of brains of human and nonhuman primates and soon realized that the organizational pattern of cerebral convolutions was so predictable that it could serve as a criterion to classify primate groups. He noted that only the deepest sulci exist in lower primate forms, while the complexity of cortical folding increases markedly in great apes and humans. Gratiolet provided the first cogent description of the lobular organization of primate cerebral hemispheres. He saw the insula as a central lobe around which revolved the frontal, parietal, temporal (temporo-sphenoidal) and occipital lobes. He correctly identified most gyri and sulci on all brain surfaces, introduced the term “plis de passage” for some interconnecting gyri, and provided the first description of the optic radiations. In the early 1860s, Gratiolet fought a highly publicized battle against Paul Broca (1824-1880) on the relationship between brain and intelligence. Gratiolet agreed that the brain was most likely the seat of intelligence, but he considered human cognition far too subtle to have any direct relationship with brain size. He argued that a detailed study of the human brain architecture would be more profitable than Broca’s vain speculations on the relationship between brain weight and intelligence, which he considered a monolithic entity. Despite remarkable scientific achievements and a unique teaching capacity, Gratiolet was unable to secure any academic position until three years before his sudden death in Paris at age 49.展开更多
Pointwise convolution is usually utilized to expand or squeeze features in modern lightweight deep models.However,it takes up most of the overall computational cost(usually more than 90%).This paper proposes a novel P...Pointwise convolution is usually utilized to expand or squeeze features in modern lightweight deep models.However,it takes up most of the overall computational cost(usually more than 90%).This paper proposes a novel Poker module to expand features by taking advantage of cheap depthwise convolution.As a result,the Poker module can greatly reduce the computational cost,and meanwhile generate a large number of effective features to guarantee the performance.The proposed module is standardized and can be employed wherever the feature expansion is needed.By varying the stride and the number of channels,different kinds of bottlenecks are designed to plug the proposed Poker module into the network.Thus,a lightweight model can be easily assembled.Experiments conducted on benchmarks reveal the effectiveness of our proposed Poker module.And our Poker Net models can reduce the computational cost by 7.1%-15.6%.Poker Net models achieve comparable or even higher recognition accuracy than previous state-of-the-art(SOTA)models on the Image Net ILSVRC2012 classification dataset.Code is available at https://github.com/diaomin/pokernet.展开更多
Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid mo...Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model.展开更多
A discrete algorithm suitable for the computation of complex frequency-domain convolution on computers was derived. The Durbin's numerical inversion of Laplace transforms can be used to figure out the time-domain ...A discrete algorithm suitable for the computation of complex frequency-domain convolution on computers was derived. The Durbin's numerical inversion of Laplace transforms can be used to figure out the time-domain digital solution of the result of complex frequency-domain convolutions. Compared with the digital solutions and corresponding analytical solutions, it is shown that the digital solutions have high precision.展开更多
Here concerned and further investigated is a certain operator method for the computation of convolutions of polynomials.We provide a general formulation of the method with a refinement for certain old results,and also...Here concerned and further investigated is a certain operator method for the computation of convolutions of polynomials.We provide a general formulation of the method with a refinement for certain old results,and also give some new applications to convolved sums involving several noted special polynomials.The advantage of the method using operators is illustrated with concrete examples.Finally,also presented is a brief investigation on convolution polynomials having two types of summations.展开更多
Based on quantum mechanical representation and operator theory,this paper restates the two new convolutions of fractional Fourier transform(FrFT)by making full use of the conversion relationship between two mutual con...Based on quantum mechanical representation and operator theory,this paper restates the two new convolutions of fractional Fourier transform(FrFT)by making full use of the conversion relationship between two mutual conjugates:coordinate representation and momentum representation.This paper gives full play to the efficiency of Dirac notation and proves the convolutions of fractional Fourier transform from the perspective of quantum optics,a field that has been developing rapidly.These two new convolution methods have potential value in signal processing.展开更多
For a locally compact group G, L 1(G) is its group algebra and L ∞(G) is the dual of L 1(G). Lau has studied the bounded linear operators T : L ∞(G) → L ∞(G) which commute with convolutions and translations. For a...For a locally compact group G, L 1(G) is its group algebra and L ∞(G) is the dual of L 1(G). Lau has studied the bounded linear operators T : L ∞(G) → L ∞(G) which commute with convolutions and translations. For a subspace H of L ∞(G), we know that M(L ∞(G),H), the Banach algebra of all bounded linear operators on L ∞(G) into H which commute with convolutions, has been studied by Pym and Lau. In this paper, we generalize these problems to L(K)*, the dual of a hypergroup algebra L(K) in a very general setting, i. e. we do not assume that K admits a Haar measure. It should be noted that these algebras include not only the group algebra L 1(G) but also most of the semigroup algebras. Compact hypergroups have a Haar measure, however, in general it is not known that every hypergroup has a Haar measure. The lack of the Haar measure and involution presents many difficulties; however, we succeed in getting some interesting results.展开更多
We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for gen...We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for general cases of off-axis scenarios where the sampling intervals and numbers of the input and observation planes are unequal.Additionally,it allows for arbitrary adjustment of the sampling interval of the impulse response function,facilitating a manual trade-off between computational load and accuracy.The er-rors associated with this method,which is equivalent to interpolation,primarily arise from the discontinuities of the sampling matrix of the impulse response function on its boundaries of periodic extension.To address this issue,we propose the concept of the padding function and its construction method,and evaluate its ef-fectiveness in enhancing computational accuracy.The feasibility of the proposed method is verified by nu-merical simulation and compared with the direct integration DI-method in a simplified scenario.It shows that the proposed method has good computational accuracy for the general case where the sampling interval of the input and observation plane is not equal under non-near-field diffraction,and when the diffraction distance is large,although the computational accuracy of the proposed method cannot exceed that of the DI-method,the computational amount can be significantly reduced with almost no effect on the computational accuracy.This method provides a general numerical calculation scheme of diffraction in the non-near field case for areas such as computational holography.展开更多
Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon...Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.展开更多
The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adver...The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.展开更多
Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representat...Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representations across diverse real-world applications.展开更多
Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating In...Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods.展开更多
Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreou...Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreous detachment(PVD)and epiretinal membrane(ERM).They can cause severe damage to the structure and function of the retina,leading to permanent vision loss.Therefore,fully automated detection of abnormalities at the VRI is crucial for the management of high myopia.This paper presents a DS-YOLOv7 network aimed at accurately identifying abnormalities,including partial PVD,complete PVD,and ERM from retinal OCT images.Built upon the YOLOv7 network,the proposed model integrates the advanced dynamic snake convolution(DSConv)module to capture the curvilinear characteristics of lesions,and the mixture of attention and convolution(ACMix)module to improve the precision and robustness of feature extraction through effective fusion of self-attention mechanisms and convolution.Moreover,the introduction of the efficient complete intersection-over-union(ECIoU)loss function further enhances the coordinate regression capability of the model.Threefold cross-validation on a dataset with 1973 OCT B-scans from 46 patients shows that the DS-YOLOv7 achieved superior performance in vitreoretinal interface abnormality detection,with mAP@0.5 of 0.714,mAP@0.75 of 0.438,and mAP@0.5:0.95 of 0.424.The proposed model can provide an accurate and efficient diagnostic tool for patients with high myopia.展开更多
Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression...Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.展开更多
基金supported by the European University of Atlantic.
文摘Brain tumors pose significant diagnostic challenges due to their diverse types and complex anatomical locations.Due to the increase in precision image-based diagnostic tools,driven by advancements in artificial intelligence(AI)and deep learning,there has been potential to improve diagnostic accuracy,especially with Magnetic Resonance Imaging(MRI).However,traditional state-of-the-art models lack the sensitivity essential for reliable tumor identification and segmentation.Thus,our research aims to enhance brain tumor diagnosis in MRI by proposing an advanced model.The proposed model incorporates dilated convolutions to optimize the brain tumor segmentation and classification.The proposed model is first trained and later evaluated using the BraTS 2020 dataset.In our proposed model preprocessing consists of normalization,noise reduction,and data augmentation to improve model robustness.The attention mechanism and dilated convolutions were introduced to increase the model’s focus on critical regions and capture finer spatial details without compromising image resolution.We have performed experimentation to measure efficiency.For this,we have used various metrics including accuracy,sensitivity,and curve(AUC-ROC).The proposed model achieved a high accuracy of 94%,a sensitivity of 93%,a specificity of 92%,and an AUC-ROC of 0.98,outperforming traditional diagnostic models in brain tumor detection.The proposed model accurately identifies tumor regions,while dilated convolutions enhanced the segmentation accuracy,especially for complex tumor structures.The proposed model demonstrates significant potential for clinical application,providing reliable and precise brain tumor detection in MRI.
文摘The Bernoulli convolution ν λ measure is shown to be absolutely continuous with L 2 density for almost all 12<λ<1,and singular if λ -1 is a Pisot number. It is an open question whether the Pisot type Bernoulli convolutions are the only singular ones. In this paper,we construct a family of non-Pisot type Bernoulli convolutions ν λ such that their density functions,if they exist,are not L 2. We also construct other Bernolulli convolutions whose density functions,if they exist,behave rather badly.
文摘Louis Pierre Gratiolet (1815-1865) was one of the first modern anatomists to pay attention to cerebral convolutions. Born in Sainte-Foy-la-Grande (Gironde), he moved to Paris in 1834 to study medicine, as well as comparative anatomy under Henri de Blainville (1777-1850). In 1842, he accepted de Blainville’s offer to become his assistant at the Muséum d’histoire naturelle and progressively abandoned medicine for comparative anatomy. He undertook a detailed study of brains of human and nonhuman primates and soon realized that the organizational pattern of cerebral convolutions was so predictable that it could serve as a criterion to classify primate groups. He noted that only the deepest sulci exist in lower primate forms, while the complexity of cortical folding increases markedly in great apes and humans. Gratiolet provided the first cogent description of the lobular organization of primate cerebral hemispheres. He saw the insula as a central lobe around which revolved the frontal, parietal, temporal (temporo-sphenoidal) and occipital lobes. He correctly identified most gyri and sulci on all brain surfaces, introduced the term “plis de passage” for some interconnecting gyri, and provided the first description of the optic radiations. In the early 1860s, Gratiolet fought a highly publicized battle against Paul Broca (1824-1880) on the relationship between brain and intelligence. Gratiolet agreed that the brain was most likely the seat of intelligence, but he considered human cognition far too subtle to have any direct relationship with brain size. He argued that a detailed study of the human brain architecture would be more profitable than Broca’s vain speculations on the relationship between brain weight and intelligence, which he considered a monolithic entity. Despite remarkable scientific achievements and a unique teaching capacity, Gratiolet was unable to secure any academic position until three years before his sudden death in Paris at age 49.
基金supported by National Natural Science Foundation of China(Nos.61525306,61633021,61721004,61806194,U1803261 and 61976132)Major Project for New Generation of AI(No.2018AAA0100400)+2 种基金Beijing Nova Program(No.Z201100006820079)Shandong Provincial Key Research and Development Program(No.2019JZZY010119)CAS-AIR。
文摘Pointwise convolution is usually utilized to expand or squeeze features in modern lightweight deep models.However,it takes up most of the overall computational cost(usually more than 90%).This paper proposes a novel Poker module to expand features by taking advantage of cheap depthwise convolution.As a result,the Poker module can greatly reduce the computational cost,and meanwhile generate a large number of effective features to guarantee the performance.The proposed module is standardized and can be employed wherever the feature expansion is needed.By varying the stride and the number of channels,different kinds of bottlenecks are designed to plug the proposed Poker module into the network.Thus,a lightweight model can be easily assembled.Experiments conducted on benchmarks reveal the effectiveness of our proposed Poker module.And our Poker Net models can reduce the computational cost by 7.1%-15.6%.Poker Net models achieve comparable or even higher recognition accuracy than previous state-of-the-art(SOTA)models on the Image Net ILSVRC2012 classification dataset.Code is available at https://github.com/diaomin/pokernet.
基金Fundamental Research Funds for the Central University,China(No.2232018D3-17)。
文摘Text format information is full of most of the resources of Internet,which puts forward higher and higher requirements for the accuracy of text classification.Therefore,in this manuscript,firstly,we design a hybrid model of bidirectional encoder representation from transformers-hierarchical attention networks-dilated convolutions networks(BERT_HAN_DCN)which based on BERT pre-trained model with superior ability of extracting characteristic.The advantages of HAN model and DCN model are taken into account which can help gain abundant semantic information,fusing context semantic features and hierarchical characteristics.Secondly,the traditional softmax algorithm increases the learning difficulty of the same kind of samples,making it more difficult to distinguish similar features.Based on this,AM-softmax is introduced to replace the traditional softmax.Finally,the fused model is validated,which shows superior performance in the accuracy rate and F1-score of this hybrid model on two datasets and the experimental analysis shows the general single models such as HAN,DCN,based on BERT pre-trained model.Besides,the improved AM-softmax network model is superior to the general softmax network model.
文摘A discrete algorithm suitable for the computation of complex frequency-domain convolution on computers was derived. The Durbin's numerical inversion of Laplace transforms can be used to figure out the time-domain digital solution of the result of complex frequency-domain convolutions. Compared with the digital solutions and corresponding analytical solutions, it is shown that the digital solutions have high precision.
文摘Here concerned and further investigated is a certain operator method for the computation of convolutions of polynomials.We provide a general formulation of the method with a refinement for certain old results,and also give some new applications to convolved sums involving several noted special polynomials.The advantage of the method using operators is illustrated with concrete examples.Finally,also presented is a brief investigation on convolution polynomials having two types of summations.
基金National Natural Science Foundation of China(Grant Number:11304126)College Students' Innovation Training Program(Grant Number:202110299696X)。
文摘Based on quantum mechanical representation and operator theory,this paper restates the two new convolutions of fractional Fourier transform(FrFT)by making full use of the conversion relationship between two mutual conjugates:coordinate representation and momentum representation.This paper gives full play to the efficiency of Dirac notation and proves the convolutions of fractional Fourier transform from the perspective of quantum optics,a field that has been developing rapidly.These two new convolution methods have potential value in signal processing.
文摘For a locally compact group G, L 1(G) is its group algebra and L ∞(G) is the dual of L 1(G). Lau has studied the bounded linear operators T : L ∞(G) → L ∞(G) which commute with convolutions and translations. For a subspace H of L ∞(G), we know that M(L ∞(G),H), the Banach algebra of all bounded linear operators on L ∞(G) into H which commute with convolutions, has been studied by Pym and Lau. In this paper, we generalize these problems to L(K)*, the dual of a hypergroup algebra L(K) in a very general setting, i. e. we do not assume that K admits a Haar measure. It should be noted that these algebras include not only the group algebra L 1(G) but also most of the semigroup algebras. Compact hypergroups have a Haar measure, however, in general it is not known that every hypergroup has a Haar measure. The lack of the Haar measure and involution presents many difficulties; however, we succeed in getting some interesting results.
文摘We propose a novel fast numerical calculation method for the Rayleigh-Sommerfeld diffraction integral,which is developed based on the existing scaled convolution method.This approach enables fast cal-culations for general cases of off-axis scenarios where the sampling intervals and numbers of the input and observation planes are unequal.Additionally,it allows for arbitrary adjustment of the sampling interval of the impulse response function,facilitating a manual trade-off between computational load and accuracy.The er-rors associated with this method,which is equivalent to interpolation,primarily arise from the discontinuities of the sampling matrix of the impulse response function on its boundaries of periodic extension.To address this issue,we propose the concept of the padding function and its construction method,and evaluate its ef-fectiveness in enhancing computational accuracy.The feasibility of the proposed method is verified by nu-merical simulation and compared with the direct integration DI-method in a simplified scenario.It shows that the proposed method has good computational accuracy for the general case where the sampling interval of the input and observation plane is not equal under non-near-field diffraction,and when the diffraction distance is large,although the computational accuracy of the proposed method cannot exceed that of the DI-method,the computational amount can be significantly reduced with almost no effect on the computational accuracy.This method provides a general numerical calculation scheme of diffraction in the non-near field case for areas such as computational holography.
基金Supported by the National key research and development program in the 14th five year plan 2021YFA1200700)the National Natural Science Foundation of China(62535018,62431025,62561160113)the Natural Science Foundation of Shanghai(23ZR1473400).
文摘Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks.
文摘The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics.
基金supported by the National Natural Science Foundation of China(62402399)the New Chongqing Youth Innovation Talent Project(CSTB2024NSCQ-QCXMX0035)。
文摘Dear Editor,D2This letter presents a node feature similarity preserving graph convolutional framework P G.Graph neural networks(GNNs)have garnered significant attention for their efficacy in learning graph representations across diverse real-world applications.
文摘Reliable traffic flow prediction is crucial for mitigating urban congestion.This paper proposes Attentionbased spatiotemporal Interactive Dynamic Graph Convolutional Network(AIDGCN),a novel architecture integrating Interactive Dynamic Graph Convolution Network(IDGCN)with Temporal Multi-Head Trend-Aware Attention.Its core innovation lies in IDGCN,which uniquely splits sequences into symmetric intervals for interactive feature sharing via dynamic graphs,and a novel attention mechanism incorporating convolutional operations to capture essential local traffic trends—addressing a critical gap in standard attention for continuous data.For 15-and 60-min forecasting on METR-LA,AIDGCN achieves MAEs of 0.75%and 0.39%,and RMSEs of 1.32%and 0.14%,respectively.In the 60-min long-term forecasting of the PEMS-BAY dataset,the AIDGCN out-performs the MRA-BGCN method by 6.28%,4.93%,and 7.17%in terms of MAE,RMSE,and MAPE,respectively.Experimental results demonstrate the superiority of our pro-posed model over state-of-the-art methods.
基金supported by the National Natural Science Foundation of China(62271337,62371326,and 62371328)the National Key Research and Development Program of China(2019FYC1710204)+1 种基金the National Clinical Key Specialty Construction Project(10000015Z155080000004)the Natural Science Foundation of Jiangsu Province(BK20231310).
文摘Advances in optical coherence tomography(OCT)technology allow a clear view of the vitreoretinal interface(VRI).The abnormality of the VRI is one of the common symptoms of high myopia,mainly including posterior vitreous detachment(PVD)and epiretinal membrane(ERM).They can cause severe damage to the structure and function of the retina,leading to permanent vision loss.Therefore,fully automated detection of abnormalities at the VRI is crucial for the management of high myopia.This paper presents a DS-YOLOv7 network aimed at accurately identifying abnormalities,including partial PVD,complete PVD,and ERM from retinal OCT images.Built upon the YOLOv7 network,the proposed model integrates the advanced dynamic snake convolution(DSConv)module to capture the curvilinear characteristics of lesions,and the mixture of attention and convolution(ACMix)module to improve the precision and robustness of feature extraction through effective fusion of self-attention mechanisms and convolution.Moreover,the introduction of the efficient complete intersection-over-union(ECIoU)loss function further enhances the coordinate regression capability of the model.Threefold cross-validation on a dataset with 1973 OCT B-scans from 46 patients shows that the DS-YOLOv7 achieved superior performance in vitreoretinal interface abnormality detection,with mAP@0.5 of 0.714,mAP@0.75 of 0.438,and mAP@0.5:0.95 of 0.424.The proposed model can provide an accurate and efficient diagnostic tool for patients with high myopia.
基金supported by the Science and Technology Innovation Key R&D Program of Chongqing(CSTB2025TIAD-STX0032)National Key Research and Development Program of China(2024YFF0908200)+1 种基金the Chongqing Technology Innovation and Application Development Special Key Project(CSTB2024TIAD-KPX0018)the Southwest University Graduate Student Research Innovation(SWUB24051)。
文摘Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices.