Imaging observations of solar X-ray bursts can reveal details of the energy release process and particle acceleration in flares.Most hard X-ray imagers make use of the modulation-based Fourier transform imaging method...Imaging observations of solar X-ray bursts can reveal details of the energy release process and particle acceleration in flares.Most hard X-ray imagers make use of the modulation-based Fourier transform imaging method,an indirect imaging technique that requires algorithms to reconstruct and optimize images.During the last decade,a variety of algorithms have been developed and improved.However,it is difficult to quantitatively evaluate the image quality of different solutions without a true,reference image of observation.How to choose the values of imaging parameters for these algorithms to get the best performance is also an open question.In this study,we present a detailed test of the characteristics of these algorithms,imaging dynamic range and a crucial parameter for the CLEAN method,clean beam width factor(CBWF).We first used SDO/AIA EUV images to compute DEM maps and calculate thermal X-ray maps.Then these realistic sources and several types of simulated sources are used as the ground truth in the imaging simulations for both RHESSI and ASO-S/HXI.The different solutions are evaluated quantitatively by a number of means.The overall results suggest that EM,PIXON,and CLEAN are exceptional methods for sidelobe elimination,producing images with clear source details.Although MEM_GE,MEM_NJIT,VIS_WV and VIS_CS possess fast imaging processes and generate good images,they too possess associated imperfections unique to each method.The two forward fit algorithms,VF and FF,perform differently,and VF appears to be more robust and useful.We also demonstrated the imaging capability of HXI and available HXI algorithms.Furthermore,the effect of CBWF on image quality was investigated,and the optimal settings for both RHESSI and HXI were proposed.展开更多
Crop leaf area index(LAI)and biomass are two major biophysical parameters to measure crop growth and health condition.Measuring LAI and biomass in field experiments is a destructive method.Therefore,we focused on the ...Crop leaf area index(LAI)and biomass are two major biophysical parameters to measure crop growth and health condition.Measuring LAI and biomass in field experiments is a destructive method.Therefore,we focused on the application of unmanned aerial vehicles(UAVs)in agriculture,which is a cost and labor-efficientmethod.Hence,UAV-captured multispectral images were applied to monitor crop growth,identify plant bio-physical conditions,and so on.In this study,we monitored soybean crops using UAV and field experiments.This experiment was conducted at theMAFES(Mississippi Agricultural and Forestry Experiment Station)Pontotoc Ridge-Flatwoods Branch Experiment Station.It followed a randomized block design with five cover crops:Cereal Rye,Vetch,Wheat,MC:mixed Mustard and Cereal Rye,and native vegetation.Planting was made in the fall,and three fertilizer treatments were applied:Synthetic Fertilizer,Poultry Litter,and none,applied before planting the soybean,in a full factorial combination.We monitored soybean reproductive phases at R3(initial pod development),R5(initial seed development),R6(full seed development),and R7(initial maturity)and used UAV multispectral remote sensing for soybean LAI and biomass estimations.The major goal of this study was to assess LAI and biomass estimations from UAV multispectral images in the reproductive stages when the development of leaves and biomass was stabilized.Wemade about fourteen vegetation indices(VIs)fromUAVmultispectral images at these stages to estimate LAI and biomass.Wemodeled LAI and biomass based on these remotely sensed VIs and ground-truth measurements usingmachine learning methods,including linear regression,Random Forest(RF),and support vector regression(SVR).Thereafter,the models were applied to estimate LAI and biomass.According to the model results,LAI was better estimated at the R6 stage and biomass at the R3 stage.Compared to the other models,the RF models showed better estimation,i.e.,an R^(2) of about 0.58–0.68 with an RMSE(rootmean square error)of 0.52–0.60(m^(2)/m^(2))for the LAI and about 0.44–0.64 for R^(2) and 21–26(g dry weight/5 plants)for RMSE of biomass estimation.We performed a leave-one-out cross-validation.Based on cross-validatedmodels with field experiments,we also found that the R6 stage was the best for estimating LAI,and the R3 stage for estimating crop biomass.The cross-validated RF model showed the estimation ability with an R^(2) about 0.25–0.44 and RMSE of 0.65–0.85(m^(2)/m^(2))for LAI estimation;and R^(2) about 0.1–0.31 and an RMSE of about 28–35(g dry weight/5 plants)for crop biomass estimation.This result will be helpful to promote the use of non-destructive remote sensing methods to determine the crop LAI and biomass status,which may bring more efficient crop production and management.展开更多
Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions.However,camouflage object segmentation is increasingly challenging du...Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions.However,camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry.Although multispectral-RGB based technology shows promise,conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities,limiting their performance.Here,we propose the Reconstructed Multispectral-RGB Fusion Network(RMRF-Net),which reconstructs RGB images into multispectral ones,enabling efficient multimodal segmentation using only an RGB camera.Specifically,RMRF-Net employs a divergentsimilarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours.Notably,we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset,including 11 object categories.Experimental results demonstrate that RMRF-Net outperforms existing methods,achieving 17.38 FPS on the NVIDIA Jetson AGX Orin,with only a 0.96%drop in mIoU compared to the RTX 3090,showing its practical applicability in multimodal remote sensing.展开更多
Scientific and precise evaluations of the megafaunal and landform characteristics of seamounts are important guides for their protection and study.A series of manned and unmanned submersibles have provided invaluable ...Scientific and precise evaluations of the megafaunal and landform characteristics of seamounts are important guides for their protection and study.A series of manned and unmanned submersibles have provided invaluable observational imaging data for the ecological study of seamounts.However,traditional methods of artificial observation of seamount imaging data cannot accurately and efficiently determine the characteristics of megafauna and landforms.This research harnesses data-driven technology to systematically investigate the distributional traits and morphological features of megafaunal organisms,as well as the topographical characteristics,in the Caiwei Guyot region of the western Pacific’s Magellan Seamounts.To construct the landform and megafauna dataset of the Caiwei Guyot region,we used a data preprocessing technology based on image enhancement to provide high-quality imaging data for data-driven technologies.A megafaunal identification and counting algorithm based on YOLOv5(You Only Look Once Version 5)was developed to efficiently assess the abundance,variety,and dominant species of megafauna.Simultaneously,a landform three-dimensional(3D)reconstruction algorithm based on PatchmatchNet was developed to reconstruct the 3D form of the terrain accurately.This study pioneers the application of data-driven technology to deep-sea imaging within the Caiwei Guyot region,offering an innovative approach to accurately and efficiently characterize the region’s unique megafauna and landforms.展开更多
To enable proper diagnosis of a patient,medical images must demonstrate no presence of noise and artifacts.The major hurdle lies in acquiring these images in such a manner that extraneous variables,causing distortions...To enable proper diagnosis of a patient,medical images must demonstrate no presence of noise and artifacts.The major hurdle lies in acquiring these images in such a manner that extraneous variables,causing distortions in the form of noise and artifacts,are kept to a bare minimum.The unexpected change realized during the acquisition process specifically attacks the integrity of the image’s quality,while indirectly attacking the effectiveness of the diagnostic process.It is thus crucial that this is attended to with maximum efficiency at the level of pertinent expertise.The solution to these challenges presents a complex dilemma at the acquisition stage,where image processing techniques must be adopted.The necessity of this mandatory image pre-processing step underpins the implementation of traditional state-of-the-art methods to create functional and robust denoising or recovery devices.This article hereby provides an extensive systematic review of the above techniques,with the purpose of presenting a systematic evaluation of their effect on medical images under three different distributions of noise,i.e.,Gaussian,Poisson,and Rician.A thorough analysis of these methods is conducted using eight evaluation parameters to highlight the unique features of each method.The covered denoising methods are essential in actual clinical scenarios where the preservation of anatomical details is crucial for accurate and safe diagnosis,such as tumor detection in MRI and vascular imaging in CT.展开更多
Objective:In the Radiology Department of Mzuzu Central Hospital(MCH),daily training for radiographers now includes content on Computed Tomography(CT)image quality control and equipment maintenance to ensure the normal...Objective:In the Radiology Department of Mzuzu Central Hospital(MCH),daily training for radiographers now includes content on Computed Tomography(CT)image quality control and equipment maintenance to ensure the normal,continuous,and stable operation of the 16-slice spiral CT scanner.Methods:Through comprehensive analysis of relevant equipment,we have identified key parameters that significantly impact CT image quality.Innovative optimization strategies and solutions targeting these parameters have been developed and integrated into daily training programs.Furthermore,starting from an examination of prevalent failure modes observed in CT equipment,we delve into essential maintenance and preservation techniques that CT technologists must master to ensure optimal system performance.Results:(1)Crucial factors affecting CT image quality include artifacts,noise,partial volume effects,and surrounding gap phenomena,alongside spatial and density resolutions,CT dose,reconstruction algorithms,and human factors during the scanning process.In the daily training for radiographers,emphasis is placed on strictly implementing image quality control measures at every stage of the CT scanning process and skillfully applying advanced scanning and image processing techniques.By doing so,we can provide clinicians with accurate and reliable imaging references for diagnosis and treatment.(2)Strategies for CT equipment maintenance:①Environmental inspection of the CT room to ensure cleanliness and hygiene.②Rational and accurate operation,including calibration software proficiency.③Regular maintenance and servicing for minimizing machine downtime.④Maintenance of the CT X-ray tube.CT technicians can become proficient in equipment maintenance and upkeep techniques through training,which can significantly extend the service life of CT systems and reduce the occurrence of malfunctions.Conclusion:Through the regular implementation of rigorous CT image quality control training for radiology technicians,coupled with diligent and proactive CT equipment maintenance,we have observed profound and beneficial impacts on improving image quality.The accuracy and fidelity of radiological data ultimately leads to more accurate diagnoses and effective treatments.展开更多
Kinship verification is a key biometric recognition task that determines biological relationships based on physical features.Traditional methods predominantly use facial recognition,leveraging established techniques a...Kinship verification is a key biometric recognition task that determines biological relationships based on physical features.Traditional methods predominantly use facial recognition,leveraging established techniques and extensive datasets.However,recent research has highlighted ear recognition as a promising alternative,offering advantages in robustness against variations in facial expressions,aging,and occlusions.Despite its potential,a significant challenge in ear-based kinship verification is the lack of large-scale datasets necessary for training deep learning models effectively.To address this challenge,we introduce the EarKinshipVN dataset,a novel and extensive collection of ear images designed specifically for kinship verification.This dataset consists of 4876 high-resolution color images from 157 multiracial families across different regions,forming 73,220 kinship pairs.EarKinshipVN,a diverse and large-scale dataset,advances kinship verification research using ear features.Furthermore,we propose the Mixer Attention Inception(MAI)model,an improved architecture that enhances feature extraction and classification accuracy.The MAI model fuses Inceptionv4 and MLP Mixer,integrating four attention mechanisms to enhance spatial and channel-wise feature representation.Experimental results demonstrate that MAI significantly outperforms traditional backbone architectures.It achieves an accuracy of 98.71%,surpassing Vision Transformer models while reducing computational complexity by up to 95%in parameter usage.These findings suggest that ear-based kinship verification,combined with an optimized deep learning model and a comprehensive dataset,holds significant promise for biometric applications.展开更多
Objective:This study aims to develop a deep multiscale image learning system(DMILS)to differentiate malignant from benign thyroid follicular neoplasms on multiscale whole-slide images(WSIs)of intraoperative frozen pat...Objective:This study aims to develop a deep multiscale image learning system(DMILS)to differentiate malignant from benign thyroid follicular neoplasms on multiscale whole-slide images(WSIs)of intraoperative frozen pathological images.Methods:A total of 1,213 patients were divided into training and validation sets,an internal test set,a pooled external test set,and a pooled prospective test set at three centers.DMILS was constructed using a deep learningbased weakly supervised method based on multiscale WSIs at 10×,20×,and 40×magnifications.The performance of the DMILS was compared with that of a single magnification and validated in two pathologist-unidentified subsets.Results:The DMILS yielded good performance,with areas under the receiver operating characteristic curves(AUCs)of 0.848,0.857,0.810,and 0.787 in the training and validation sets,internal test set,pooled external test set,and pooled prospective test set,respectively.The AUC of the DMILS was higher than that of a single magnification,with 0.788 of 10×,0.824 of 20×,and 0.775 of 40×in the internal test set.Moreover,DMILS yielded satisfactory performance on the two pathologist-unidentified subsets.Furthermore,the most indicative region predicted by DMILS is the follicular epithelium.Conclusions:DMILS has good performance in differentiating thyroid follicular neoplasms on multiscale WSIs of intraoperative frozen pathological images.展开更多
Nonlinear transforms have significantly advanced learned image compression(LIC),particularly using residual blocks.This transform enhances the nonlinear expression ability and obtain compact feature representation by ...Nonlinear transforms have significantly advanced learned image compression(LIC),particularly using residual blocks.This transform enhances the nonlinear expression ability and obtain compact feature representation by enlarging the receptive field,which indicates how the convolution process extracts features in a high dimensional feature space.However,its functionality is restricted to the spatial dimension and network depth,limiting further improvements in network performance due to insufficient information interaction and representation.Crucially,the potential of high dimensional feature space in the channel dimension and the exploration of network width/resolution remain largely untapped.In this paper,we consider nonlinear transforms from the perspective of feature space,defining high-dimensional feature spaces in different dimensions and investigating the specific effects.Firstly,we introduce the dimension increasing and decreasing transforms in both channel and spatial dimensions to obtain high dimensional feature space and achieve better feature extraction.Secondly,we design a channel-spatial fusion residual transform(CSR),which incorporates multi-dimensional transforms for a more effective representation.Furthermore,we simplify the proposed fusion transform to obtain a slim architecture(CSR-sm),balancing network complexity and compression performance.Finally,we build the overall network with stacked CSR transforms to achieve better compression and reconstruction.Experimental results demonstrate that the proposed method can achieve superior ratedistortion performance compared to the existing LIC methods and traditional codecs.Specifically,our proposed method achieves 9.38%BD-rate reduction over VVC on Kodak dataset.展开更多
Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems,...Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.展开更多
Clear cell renal cell carcinoma(ccRCC)represents the most frequent form of renal cell carcinoma(RCC),and accurate International Society of Urological Pathology(ISUP)grading is crucial for prognosis and treatment selec...Clear cell renal cell carcinoma(ccRCC)represents the most frequent form of renal cell carcinoma(RCC),and accurate International Society of Urological Pathology(ISUP)grading is crucial for prognosis and treatment selection.This study presents a new deep network called Multi-scale Fusion Network(MsfNet),which aims to enhance the automatic ISUP grade of ccRCC with digital histopathology pathology images.The MsfNet overcomes the limitations of traditional ResNet50 by multi-scale information fusion and dynamic allocation of channel quantity.The model was trained and tested using 90 Hematoxylin and Eosin(H&E)stained whole slide images(WSIs),which were all cropped into 320×320-pixel patches at 40×magnification.MsfNet achieved a micro-averaged area under the curve(AUC)of 0.9807,a macro-averaged AUC of 0.9778 on the test dataset.The Gradient-weighted Class Activation Mapping(Grad-CAM)visually demonstrated MsfNet’s ability to distinguish and highlight abnormal areas more effectively than ResNet50.The t-Distributed Stochastic Neighbor Embedding(t-SNE)plot indicates our model can efficiently extract critical features from images,reducing the impact of noise and redundant information.The results suggest that MsfNet offers an accurate ISUP grade of ccRCC in digital images,emphasizing the potential of AI-assisted histopathological systems in clinical practice.展开更多
An extreme ultraviolet solar corona multispectral imager can allow direct observation of high temperature coronal plasma,which is related to solar flares,coronal mass ejections and other significant coronal activities...An extreme ultraviolet solar corona multispectral imager can allow direct observation of high temperature coronal plasma,which is related to solar flares,coronal mass ejections and other significant coronal activities.This manuscript proposes a novel end-to-end computational design method for an extreme ultraviolet(EUV)solar corona multispectral imager operating at wavelengths near 100 nm,including a stray light suppression design and computational image recovery.To suppress the strong stray light from the solar disk,an outer opto-mechanical structure is designed to protect the imaging component of the system.Considering the low reflectivity(less than 70%)and strong-scattering(roughness)of existing extreme ultraviolet optical elements,the imaging component comprises only a primary mirror and a curved grating.A Lyot aperture is used to further suppress any residual stray light.Finally,a deep learning computational imaging method is used to correct the individual multi-wavelength images from the original recorded multi-slit data.In results and data,this can achieve a far-field angular resolution below 7",and spectral resolution below 0.05 nm.The field of view is±3 R_(☉)along the multi-slit moving direction,where R☉represents the radius of the solar disk.The ratio of the corona's stray light intensity to the solar center's irradiation intensity is less than 10-6 at the circle of 1.3 R_(☉).展开更多
As one of the carriers for human communication and interaction, images are prone to contamination by noise during transmission and reception, which is often uncontrollable and unknown. Therefore, how to denoise images...As one of the carriers for human communication and interaction, images are prone to contamination by noise during transmission and reception, which is often uncontrollable and unknown. Therefore, how to denoise images contaminated by unknown noise has gradually become one of the research focuses. In order to achieve blind denoising and separation to restore images, this paper proposes a method for image processing based on Root Mean Square Error (RMSE) by integrating multiple filtering methods for denoising. This method includes Wavelet Filtering, Gaussian Filtering, Median Filtering, Mean Filtering, Bilateral Filtering, Adaptive Bandpass Filtering, Non-local Means Filtering and Regularization Denoising suitable for different types of noise. We can apply this method to denoise images contaminated by blind noise sources and evaluate the denoising effects using RMSE. The smaller the RMSE, the better the denoising effect. The optimal denoising result is selected through comprehensively comparing the RMSE values of all methods. Experimental results demonstrate that the proposed method effectively denoises and restores images contaminated by blind noise sources.展开更多
为了提高图像拼接速度并满足高分辨率图像的实时拼接需求,提出了一种基于ORB(Oriented Fast and Rotated Brief)算法和MSAC(M-estimator Sample Consensus)算法的快速图像拼接方法。ORB算法特征匹配速度快,能够满足实时性要求。首先采用...为了提高图像拼接速度并满足高分辨率图像的实时拼接需求,提出了一种基于ORB(Oriented Fast and Rotated Brief)算法和MSAC(M-estimator Sample Consensus)算法的快速图像拼接方法。ORB算法特征匹配速度快,能够满足实时性要求。首先采用ORB算法进行图像特征点提取;然后,采用MSAC算法对匹配点对进行优化,剔除图像拼接中的伪匹配点对,通过正确的匹配点对求解图像变换矩阵;最后,采用双线性插值融合算法消除可见接缝并去除拼接痕迹。实验结果表明,本文方法在保证图像拼接质量的同时具有更快的拼接速度。展开更多
This paper emphasizes a faster digital processing time while presenting an accurate method for identifying spinefractures in X-ray pictures. The study focuses on efficiency by utilizing many methods that include pictu...This paper emphasizes a faster digital processing time while presenting an accurate method for identifying spinefractures in X-ray pictures. The study focuses on efficiency by utilizing many methods that include picturesegmentation, feature reduction, and image classification. Two important elements are investigated to reducethe classification time: Using feature reduction software and leveraging the capabilities of sophisticated digitalprocessing hardware. The researchers use different algorithms for picture enhancement, including theWiener andKalman filters, and they look into two background correction techniques. The article presents a technique forextracting textural features and evaluates three picture segmentation algorithms and three fractured spine detectionalgorithms using transformdomain, PowerDensity Spectrum(PDS), andHigher-Order Statistics (HOS) for featureextraction.With an emphasis on reducing digital processing time, this all-encompassing method helps to create asimplified system for classifying fractured spine fractures. A feature reduction program code has been built toimprove the processing speed for picture classification. Overall, the proposed approach shows great potential forsignificantly reducing classification time in clinical settings where time is critical. In comparison to other transformdomains, the texture features’ discrete cosine transform (DCT) yielded an exceptional classification rate, and theprocess of extracting features from the transform domain took less time. More capable hardware can also result inquicker execution times for the feature extraction algorithms.展开更多
Urban tree species provide various essential ecosystem services in cities,such as regulating urban temperatures,reducing noise,capturing carbon,and mitigating the urban heat island effect.The quality of these services...Urban tree species provide various essential ecosystem services in cities,such as regulating urban temperatures,reducing noise,capturing carbon,and mitigating the urban heat island effect.The quality of these services is influenced by species diversity,tree health,and the distribution and the composition of trees.Traditionally,data on urban trees has been collected through field surveys and manual interpretation of remote sensing images.In this study,we evaluated the effectiveness of multispectral airborne laser scanning(ALS)data in classifying 24 common urban roadside tree species in Espoo,Finland.Tree crown structure information,intensity features,and spectral data were used for classification.Eight different machine learning algorithms were tested,with the extra trees(ET)algorithm performing the best,achieving an overall accuracy of 71.7%using multispectral LiDAR data.This result highlights that integrating structural and spectral information within a single framework can improve the classification accuracy.Future research will focus on identifying the most important features for species classification and developing algorithms with greater efficiency and accuracy.展开更多
基金supported by the National Key R&D Program of China 2022YFF0503002the National Natural Science Foundation of China(NSFC,Grant Nos.12333010 and 12233012)+2 种基金the Strategic Priority Research Program of the Chinese Academy of Sciences(grant No.XDB0560000)supported by the Prominent Postdoctoral Project of Jiangsu Province(2023ZB304)supported by the Strategic Priority Research Program on Space Science,the Chinese Academy of Sciences,grant No.XDA15320000.
文摘Imaging observations of solar X-ray bursts can reveal details of the energy release process and particle acceleration in flares.Most hard X-ray imagers make use of the modulation-based Fourier transform imaging method,an indirect imaging technique that requires algorithms to reconstruct and optimize images.During the last decade,a variety of algorithms have been developed and improved.However,it is difficult to quantitatively evaluate the image quality of different solutions without a true,reference image of observation.How to choose the values of imaging parameters for these algorithms to get the best performance is also an open question.In this study,we present a detailed test of the characteristics of these algorithms,imaging dynamic range and a crucial parameter for the CLEAN method,clean beam width factor(CBWF).We first used SDO/AIA EUV images to compute DEM maps and calculate thermal X-ray maps.Then these realistic sources and several types of simulated sources are used as the ground truth in the imaging simulations for both RHESSI and ASO-S/HXI.The different solutions are evaluated quantitatively by a number of means.The overall results suggest that EM,PIXON,and CLEAN are exceptional methods for sidelobe elimination,producing images with clear source details.Although MEM_GE,MEM_NJIT,VIS_WV and VIS_CS possess fast imaging processes and generate good images,they too possess associated imperfections unique to each method.The two forward fit algorithms,VF and FF,perform differently,and VF appears to be more robust and useful.We also demonstrated the imaging capability of HXI and available HXI algorithms.Furthermore,the effect of CBWF on image quality was investigated,and the optimal settings for both RHESSI and HXI were proposed.
基金This research was supported in part by a postdoctoral research fellow appointment to the Agricultural Research Service(ARS)Research Participation Program administered by the Oak Ridge Institute for Science and Education(ORISE)through an interagency agreement between the U.S.Department of Energy(DOE)and the U.S.Department of Agriculture(USDA).
文摘Crop leaf area index(LAI)and biomass are two major biophysical parameters to measure crop growth and health condition.Measuring LAI and biomass in field experiments is a destructive method.Therefore,we focused on the application of unmanned aerial vehicles(UAVs)in agriculture,which is a cost and labor-efficientmethod.Hence,UAV-captured multispectral images were applied to monitor crop growth,identify plant bio-physical conditions,and so on.In this study,we monitored soybean crops using UAV and field experiments.This experiment was conducted at theMAFES(Mississippi Agricultural and Forestry Experiment Station)Pontotoc Ridge-Flatwoods Branch Experiment Station.It followed a randomized block design with five cover crops:Cereal Rye,Vetch,Wheat,MC:mixed Mustard and Cereal Rye,and native vegetation.Planting was made in the fall,and three fertilizer treatments were applied:Synthetic Fertilizer,Poultry Litter,and none,applied before planting the soybean,in a full factorial combination.We monitored soybean reproductive phases at R3(initial pod development),R5(initial seed development),R6(full seed development),and R7(initial maturity)and used UAV multispectral remote sensing for soybean LAI and biomass estimations.The major goal of this study was to assess LAI and biomass estimations from UAV multispectral images in the reproductive stages when the development of leaves and biomass was stabilized.Wemade about fourteen vegetation indices(VIs)fromUAVmultispectral images at these stages to estimate LAI and biomass.Wemodeled LAI and biomass based on these remotely sensed VIs and ground-truth measurements usingmachine learning methods,including linear regression,Random Forest(RF),and support vector regression(SVR).Thereafter,the models were applied to estimate LAI and biomass.According to the model results,LAI was better estimated at the R6 stage and biomass at the R3 stage.Compared to the other models,the RF models showed better estimation,i.e.,an R^(2) of about 0.58–0.68 with an RMSE(rootmean square error)of 0.52–0.60(m^(2)/m^(2))for the LAI and about 0.44–0.64 for R^(2) and 21–26(g dry weight/5 plants)for RMSE of biomass estimation.We performed a leave-one-out cross-validation.Based on cross-validatedmodels with field experiments,we also found that the R6 stage was the best for estimating LAI,and the R3 stage for estimating crop biomass.The cross-validated RF model showed the estimation ability with an R^(2) about 0.25–0.44 and RMSE of 0.65–0.85(m^(2)/m^(2))for LAI estimation;and R^(2) about 0.1–0.31 and an RMSE of about 28–35(g dry weight/5 plants)for crop biomass estimation.This result will be helpful to promote the use of non-destructive remote sensing methods to determine the crop LAI and biomass status,which may bring more efficient crop production and management.
基金National Natural Science Foundation of China(Grant Nos.62005049 and 62072110)Natural Science Foundation of Fujian Province(Grant No.2020J01451).
文摘Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions.However,camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry.Although multispectral-RGB based technology shows promise,conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities,limiting their performance.Here,we propose the Reconstructed Multispectral-RGB Fusion Network(RMRF-Net),which reconstructs RGB images into multispectral ones,enabling efficient multimodal segmentation using only an RGB camera.Specifically,RMRF-Net employs a divergentsimilarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours.Notably,we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset,including 11 object categories.Experimental results demonstrate that RMRF-Net outperforms existing methods,achieving 17.38 FPS on the NVIDIA Jetson AGX Orin,with only a 0.96%drop in mIoU compared to the RTX 3090,showing its practical applicability in multimodal remote sensing.
基金The Key Research and Development Program of Shandong Province of China under contract No.2020JMRH0101the National Key Research and Development Project of China under contract No.2021YFC2802100the Qingdao Natural Science Foundation under contract No.24-4-4-zrij-127-jch.
文摘Scientific and precise evaluations of the megafaunal and landform characteristics of seamounts are important guides for their protection and study.A series of manned and unmanned submersibles have provided invaluable observational imaging data for the ecological study of seamounts.However,traditional methods of artificial observation of seamount imaging data cannot accurately and efficiently determine the characteristics of megafauna and landforms.This research harnesses data-driven technology to systematically investigate the distributional traits and morphological features of megafaunal organisms,as well as the topographical characteristics,in the Caiwei Guyot region of the western Pacific’s Magellan Seamounts.To construct the landform and megafauna dataset of the Caiwei Guyot region,we used a data preprocessing technology based on image enhancement to provide high-quality imaging data for data-driven technologies.A megafaunal identification and counting algorithm based on YOLOv5(You Only Look Once Version 5)was developed to efficiently assess the abundance,variety,and dominant species of megafauna.Simultaneously,a landform three-dimensional(3D)reconstruction algorithm based on PatchmatchNet was developed to reconstruct the 3D form of the terrain accurately.This study pioneers the application of data-driven technology to deep-sea imaging within the Caiwei Guyot region,offering an innovative approach to accurately and efficiently characterize the region’s unique megafauna and landforms.
文摘To enable proper diagnosis of a patient,medical images must demonstrate no presence of noise and artifacts.The major hurdle lies in acquiring these images in such a manner that extraneous variables,causing distortions in the form of noise and artifacts,are kept to a bare minimum.The unexpected change realized during the acquisition process specifically attacks the integrity of the image’s quality,while indirectly attacking the effectiveness of the diagnostic process.It is thus crucial that this is attended to with maximum efficiency at the level of pertinent expertise.The solution to these challenges presents a complex dilemma at the acquisition stage,where image processing techniques must be adopted.The necessity of this mandatory image pre-processing step underpins the implementation of traditional state-of-the-art methods to create functional and robust denoising or recovery devices.This article hereby provides an extensive systematic review of the above techniques,with the purpose of presenting a systematic evaluation of their effect on medical images under three different distributions of noise,i.e.,Gaussian,Poisson,and Rician.A thorough analysis of these methods is conducted using eight evaluation parameters to highlight the unique features of each method.The covered denoising methods are essential in actual clinical scenarios where the preservation of anatomical details is crucial for accurate and safe diagnosis,such as tumor detection in MRI and vascular imaging in CT.
基金supported by the First Affiliated Hospital of Xi’an Jiaotong University Teaching Reform Project(Grant No.JG2023-0206 and JG2022-0324).
文摘Objective:In the Radiology Department of Mzuzu Central Hospital(MCH),daily training for radiographers now includes content on Computed Tomography(CT)image quality control and equipment maintenance to ensure the normal,continuous,and stable operation of the 16-slice spiral CT scanner.Methods:Through comprehensive analysis of relevant equipment,we have identified key parameters that significantly impact CT image quality.Innovative optimization strategies and solutions targeting these parameters have been developed and integrated into daily training programs.Furthermore,starting from an examination of prevalent failure modes observed in CT equipment,we delve into essential maintenance and preservation techniques that CT technologists must master to ensure optimal system performance.Results:(1)Crucial factors affecting CT image quality include artifacts,noise,partial volume effects,and surrounding gap phenomena,alongside spatial and density resolutions,CT dose,reconstruction algorithms,and human factors during the scanning process.In the daily training for radiographers,emphasis is placed on strictly implementing image quality control measures at every stage of the CT scanning process and skillfully applying advanced scanning and image processing techniques.By doing so,we can provide clinicians with accurate and reliable imaging references for diagnosis and treatment.(2)Strategies for CT equipment maintenance:①Environmental inspection of the CT room to ensure cleanliness and hygiene.②Rational and accurate operation,including calibration software proficiency.③Regular maintenance and servicing for minimizing machine downtime.④Maintenance of the CT X-ray tube.CT technicians can become proficient in equipment maintenance and upkeep techniques through training,which can significantly extend the service life of CT systems and reduce the occurrence of malfunctions.Conclusion:Through the regular implementation of rigorous CT image quality control training for radiology technicians,coupled with diligent and proactive CT equipment maintenance,we have observed profound and beneficial impacts on improving image quality.The accuracy and fidelity of radiological data ultimately leads to more accurate diagnoses and effective treatments.
文摘Kinship verification is a key biometric recognition task that determines biological relationships based on physical features.Traditional methods predominantly use facial recognition,leveraging established techniques and extensive datasets.However,recent research has highlighted ear recognition as a promising alternative,offering advantages in robustness against variations in facial expressions,aging,and occlusions.Despite its potential,a significant challenge in ear-based kinship verification is the lack of large-scale datasets necessary for training deep learning models effectively.To address this challenge,we introduce the EarKinshipVN dataset,a novel and extensive collection of ear images designed specifically for kinship verification.This dataset consists of 4876 high-resolution color images from 157 multiracial families across different regions,forming 73,220 kinship pairs.EarKinshipVN,a diverse and large-scale dataset,advances kinship verification research using ear features.Furthermore,we propose the Mixer Attention Inception(MAI)model,an improved architecture that enhances feature extraction and classification accuracy.The MAI model fuses Inceptionv4 and MLP Mixer,integrating four attention mechanisms to enhance spatial and channel-wise feature representation.Experimental results demonstrate that MAI significantly outperforms traditional backbone architectures.It achieves an accuracy of 98.71%,surpassing Vision Transformer models while reducing computational complexity by up to 95%in parameter usage.These findings suggest that ear-based kinship verification,combined with an optimized deep learning model and a comprehensive dataset,holds significant promise for biometric applications.
基金supported by the Taishan Scholar Project(No.ts20190991,tsqn202211378)the Key R&D Project of Shandong Province(No.2022CXPT023)the General Program of National Natural Science Foundation of China(No.82371933)。
文摘Objective:This study aims to develop a deep multiscale image learning system(DMILS)to differentiate malignant from benign thyroid follicular neoplasms on multiscale whole-slide images(WSIs)of intraoperative frozen pathological images.Methods:A total of 1,213 patients were divided into training and validation sets,an internal test set,a pooled external test set,and a pooled prospective test set at three centers.DMILS was constructed using a deep learningbased weakly supervised method based on multiscale WSIs at 10×,20×,and 40×magnifications.The performance of the DMILS was compared with that of a single magnification and validated in two pathologist-unidentified subsets.Results:The DMILS yielded good performance,with areas under the receiver operating characteristic curves(AUCs)of 0.848,0.857,0.810,and 0.787 in the training and validation sets,internal test set,pooled external test set,and pooled prospective test set,respectively.The AUC of the DMILS was higher than that of a single magnification,with 0.788 of 10×,0.824 of 20×,and 0.775 of 40×in the internal test set.Moreover,DMILS yielded satisfactory performance on the two pathologist-unidentified subsets.Furthermore,the most indicative region predicted by DMILS is the follicular epithelium.Conclusions:DMILS has good performance in differentiating thyroid follicular neoplasms on multiscale WSIs of intraoperative frozen pathological images.
基金supported by the Key Program of the National Natural Science Foundation of China(Grant No.62031013)Guangdong Province Key Construction Discipline Scientific Research Capacity Improvement Project(Grant No.2022ZDJS117).
文摘Nonlinear transforms have significantly advanced learned image compression(LIC),particularly using residual blocks.This transform enhances the nonlinear expression ability and obtain compact feature representation by enlarging the receptive field,which indicates how the convolution process extracts features in a high dimensional feature space.However,its functionality is restricted to the spatial dimension and network depth,limiting further improvements in network performance due to insufficient information interaction and representation.Crucially,the potential of high dimensional feature space in the channel dimension and the exploration of network width/resolution remain largely untapped.In this paper,we consider nonlinear transforms from the perspective of feature space,defining high-dimensional feature spaces in different dimensions and investigating the specific effects.Firstly,we introduce the dimension increasing and decreasing transforms in both channel and spatial dimensions to obtain high dimensional feature space and achieve better feature extraction.Secondly,we design a channel-spatial fusion residual transform(CSR),which incorporates multi-dimensional transforms for a more effective representation.Furthermore,we simplify the proposed fusion transform to obtain a slim architecture(CSR-sm),balancing network complexity and compression performance.Finally,we build the overall network with stacked CSR transforms to achieve better compression and reconstruction.Experimental results demonstrate that the proposed method can achieve superior ratedistortion performance compared to the existing LIC methods and traditional codecs.Specifically,our proposed method achieves 9.38%BD-rate reduction over VVC on Kodak dataset.
基金support by the National Natural Science Foundation of China (Grant No. 62005049)Natural Science Foundation of Fujian Province (Grant Nos. 2020J01451, 2022J05113)Education and Scientific Research Program for Young and Middleaged Teachers in Fujian Province (Grant No. JAT210035)。
文摘Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.
基金supported by the Scientific Research and Innovation Team of Hebei University(IT2023B07)the Natural Science Foundation of Hebei Province(F2023201069)the Postgraduate’s Innovation Fund Project of Hebei University(HBU2024BS021).
文摘Clear cell renal cell carcinoma(ccRCC)represents the most frequent form of renal cell carcinoma(RCC),and accurate International Society of Urological Pathology(ISUP)grading is crucial for prognosis and treatment selection.This study presents a new deep network called Multi-scale Fusion Network(MsfNet),which aims to enhance the automatic ISUP grade of ccRCC with digital histopathology pathology images.The MsfNet overcomes the limitations of traditional ResNet50 by multi-scale information fusion and dynamic allocation of channel quantity.The model was trained and tested using 90 Hematoxylin and Eosin(H&E)stained whole slide images(WSIs),which were all cropped into 320×320-pixel patches at 40×magnification.MsfNet achieved a micro-averaged area under the curve(AUC)of 0.9807,a macro-averaged AUC of 0.9778 on the test dataset.The Gradient-weighted Class Activation Mapping(Grad-CAM)visually demonstrated MsfNet’s ability to distinguish and highlight abnormal areas more effectively than ResNet50.The t-Distributed Stochastic Neighbor Embedding(t-SNE)plot indicates our model can efficiently extract critical features from images,reducing the impact of noise and redundant information.The results suggest that MsfNet offers an accurate ISUP grade of ccRCC in digital images,emphasizing the potential of AI-assisted histopathological systems in clinical practice.
基金This study is partially supported by the National Natural Science Foundation of China(NSFC)(62005120,62125504).
文摘An extreme ultraviolet solar corona multispectral imager can allow direct observation of high temperature coronal plasma,which is related to solar flares,coronal mass ejections and other significant coronal activities.This manuscript proposes a novel end-to-end computational design method for an extreme ultraviolet(EUV)solar corona multispectral imager operating at wavelengths near 100 nm,including a stray light suppression design and computational image recovery.To suppress the strong stray light from the solar disk,an outer opto-mechanical structure is designed to protect the imaging component of the system.Considering the low reflectivity(less than 70%)and strong-scattering(roughness)of existing extreme ultraviolet optical elements,the imaging component comprises only a primary mirror and a curved grating.A Lyot aperture is used to further suppress any residual stray light.Finally,a deep learning computational imaging method is used to correct the individual multi-wavelength images from the original recorded multi-slit data.In results and data,this can achieve a far-field angular resolution below 7",and spectral resolution below 0.05 nm.The field of view is±3 R_(☉)along the multi-slit moving direction,where R☉represents the radius of the solar disk.The ratio of the corona's stray light intensity to the solar center's irradiation intensity is less than 10-6 at the circle of 1.3 R_(☉).
文摘As one of the carriers for human communication and interaction, images are prone to contamination by noise during transmission and reception, which is often uncontrollable and unknown. Therefore, how to denoise images contaminated by unknown noise has gradually become one of the research focuses. In order to achieve blind denoising and separation to restore images, this paper proposes a method for image processing based on Root Mean Square Error (RMSE) by integrating multiple filtering methods for denoising. This method includes Wavelet Filtering, Gaussian Filtering, Median Filtering, Mean Filtering, Bilateral Filtering, Adaptive Bandpass Filtering, Non-local Means Filtering and Regularization Denoising suitable for different types of noise. We can apply this method to denoise images contaminated by blind noise sources and evaluate the denoising effects using RMSE. The smaller the RMSE, the better the denoising effect. The optimal denoising result is selected through comprehensively comparing the RMSE values of all methods. Experimental results demonstrate that the proposed method effectively denoises and restores images contaminated by blind noise sources.
文摘为了提高图像拼接速度并满足高分辨率图像的实时拼接需求,提出了一种基于ORB(Oriented Fast and Rotated Brief)算法和MSAC(M-estimator Sample Consensus)算法的快速图像拼接方法。ORB算法特征匹配速度快,能够满足实时性要求。首先采用ORB算法进行图像特征点提取;然后,采用MSAC算法对匹配点对进行优化,剔除图像拼接中的伪匹配点对,通过正确的匹配点对求解图像变换矩阵;最后,采用双线性插值融合算法消除可见接缝并去除拼接痕迹。实验结果表明,本文方法在保证图像拼接质量的同时具有更快的拼接速度。
基金the appreciation to the Deanship of Postgraduate Studies and ScientificResearch atMajmaah University for funding this research work through the Project Number R-2024-922.
文摘This paper emphasizes a faster digital processing time while presenting an accurate method for identifying spinefractures in X-ray pictures. The study focuses on efficiency by utilizing many methods that include picturesegmentation, feature reduction, and image classification. Two important elements are investigated to reducethe classification time: Using feature reduction software and leveraging the capabilities of sophisticated digitalprocessing hardware. The researchers use different algorithms for picture enhancement, including theWiener andKalman filters, and they look into two background correction techniques. The article presents a technique forextracting textural features and evaluates three picture segmentation algorithms and three fractured spine detectionalgorithms using transformdomain, PowerDensity Spectrum(PDS), andHigher-Order Statistics (HOS) for featureextraction.With an emphasis on reducing digital processing time, this all-encompassing method helps to create asimplified system for classifying fractured spine fractures. A feature reduction program code has been built toimprove the processing speed for picture classification. Overall, the proposed approach shows great potential forsignificantly reducing classification time in clinical settings where time is critical. In comparison to other transformdomains, the texture features’ discrete cosine transform (DCT) yielded an exceptional classification rate, and theprocess of extracting features from the transform domain took less time. More capable hardware can also result inquicker execution times for the feature extraction algorithms.
文摘Urban tree species provide various essential ecosystem services in cities,such as regulating urban temperatures,reducing noise,capturing carbon,and mitigating the urban heat island effect.The quality of these services is influenced by species diversity,tree health,and the distribution and the composition of trees.Traditionally,data on urban trees has been collected through field surveys and manual interpretation of remote sensing images.In this study,we evaluated the effectiveness of multispectral airborne laser scanning(ALS)data in classifying 24 common urban roadside tree species in Espoo,Finland.Tree crown structure information,intensity features,and spectral data were used for classification.Eight different machine learning algorithms were tested,with the extra trees(ET)algorithm performing the best,achieving an overall accuracy of 71.7%using multispectral LiDAR data.This result highlights that integrating structural and spectral information within a single framework can improve the classification accuracy.Future research will focus on identifying the most important features for species classification and developing algorithms with greater efficiency and accuracy.