Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods ex...Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.展开更多
Watermarking is embedding visible or invisible data within media to verify its authenticity or protect copyright.The watermark is embedded in significant spatial or frequency features of the media to make it more resi...Watermarking is embedding visible or invisible data within media to verify its authenticity or protect copyright.The watermark is embedded in significant spatial or frequency features of the media to make it more resistant to intentional or unintentional modification.Some of these features are important perceptual features according to the human visual system(HVS),which means that the embedded watermark should be imperceptible in these features.Therefore,both the designers of watermarking algorithms and potential attackers must consider these perceptual features when carrying out their actions.The two roles will be considered in this paper when designing a robust watermarking algorithm against the most harmful attacks,like volumetric scaling,histogram equalization,and non-conventional watermarking attacks like the Denoising Convolution Neural Network(DnCNN),which must be considered in watermarking algorithm design due to its rising role in the state-of-the-art attacks.The DnCNN is initialized and trained using watermarked image samples created by our proposed Covert and Severe Attacks Resistant Watermarking Algorithm(CSRWA)to prove its robustness.For this algorithm to satisfy the robustness and imperceptibility tradeoff,implementing the Dither Modulation(DM)algorithm is boosted by utilizing the Just Noticeable Distortion(JND)principle to get an improved performance in this sense.Sensitivity,luminance,inter and intra-block contrast are used to adjust the JND values.展开更多
Objective Autism spectrum disorder(ASD)is a neurodevelopmental condition characterized by difficulties with communication and social interaction,restricted and repetitive behaviors.Previous studies have indicated that...Objective Autism spectrum disorder(ASD)is a neurodevelopmental condition characterized by difficulties with communication and social interaction,restricted and repetitive behaviors.Previous studies have indicated that individuals with ASD exhibit early and lifelong attention deficits,which are closely related to the core symptoms of ASD.Basic visual attention processes may provide a critical foundation for their social communication and interaction abilities.Therefore,this study explores the behavior of children with ASD in capturing attention to changes in topological properties.Methods Our study recruited twenty-seven ASD children diagnosed by professional clinicians according to DSM-5 and twenty-eight typically developing(TD)age-matched controls.In an attention capture task,we recorded the saccadic behaviors of children with ASD and TD in response to topological change(TC)and non-topological change(nTC)stimuli.Saccadic reaction time(SRT),visual search time(VS),and first fixation dwell time(FFDT)were used as indicators of attentional bias.Pearson correlation tests between the clinical assessment scales and attentional bias were conducted.Results This study found that TD children had significantly faster SRT(P<0.05)and VS(P<0.05)for the TC stimuli compared to the nTC stimuli,while the children with ASD did not exhibit significant differences in either measure(P>0.05).Additionally,ASD children demonstrated significantly less attention towards the TC targets(measured by FFDT),in comparison to TD children(P<0.05).Furthermore,ASD children exhibited a significant negative linear correlation between their attentional bias(measured by VS)and their scores on the compulsive subscale(P<0.05).Conclusion The results suggest that children with ASD have difficulty shifting their attention to objects with topological changes during change detection.This atypical attention may affect the child’s cognitive and behavioral development,thereby impacting their social communication and interaction.In sum,our findings indicate that difficulties in attentional capture by TC may be a key feature of ASD.展开更多
Picture is a means of objects’representation and inter-subject communication,where various ways of spatial modelling interact.Unlike arbitrary signs,picture not only represents something different from itself,but sho...Picture is a means of objects’representation and inter-subject communication,where various ways of spatial modelling interact.Unlike arbitrary signs,picture not only represents something different from itself,but shows the represented objects for viewer’s perception.Therefore,the external modelling in form of a material bearer is connected always with internal modelling of depicted objects.Not only perceptual images,but also internal models of other psychical levels participate in creation and interpretation of the pictures.These are,on the one hand,images of the apperceptual and conceptual levels,where schemes of represented objects and their verbal interpretations are formed.On the other hand,the images of sensorial level participate also in the internal modelling.All these mental models interact differently by artists and viewers and influence organization of external pictorial models mediating their contacts.展开更多
Artificial sensory systems(ASS)are pivotal to next-generation extended reality technologies,now evolving into flexible platforms for comfortable wear and immersive user experiences,while ensuring high performance and ...Artificial sensory systems(ASS)are pivotal to next-generation extended reality technologies,now evolving into flexible platforms for comfortable wear and immersive user experiences,while ensuring high performance and operational reliability.To address these demands,metal-based nanoparticles(NPs),such as noble metal,oxide,and multi-elemental NPs,have been extensively incorporated into functional materials of sensory and synaptic devices due to their tunable optical,electrical,and chemical properties,enhancing sensory precision,stability,and environmental adaptability.However,traditional NP fabrication methods often involve complex processing,residual contaminants,and scalability issues,limiting their effectiveness in ASS applications.State-of-the-art laser ablation in liquids(LAL)presents a promising alternative,offering scalable production of surfactant-free NPs with customizable physicochemical properties,though their application in electronics remains underexplored.This review delves into the transformative potential of LAL-fabricated NPs in ASS,covering the fundamental mechanisms of LAL,the role of process parameters,the derivative strategies for size modulation,the diversity of metal-based NPs,their applications in sensory and synaptic devices,and the challenges and perspectives for meeting industrial standards.Bridging the gap between LAL and ASS is poised to revolutionize both industrial manufacturing and academic research by offering scalable solutions to overcome intrinsic tradeoffs between flexibility and performance,fostering innovations in human-centric,immersive electronics.展开更多
Although deep learning methods have been widely applied in slam visual odometry(VO)over the past decade with impressive improvements,the accuracy remains limited in complex dynamic environments.In this paper,a composi...Although deep learning methods have been widely applied in slam visual odometry(VO)over the past decade with impressive improvements,the accuracy remains limited in complex dynamic environments.In this paper,a composite mask-based generative adversarial network(CMGAN)is introduced to predict camera motion and binocular depth maps.Specifically,a perceptual generator is constructed to obtain the corresponding parallax map and optical flow between two neighboring frames.Then,an iterative pose improvement strategy is proposed to improve the accuracy of pose estimation.Finally,a composite mask is embedded in the discriminator to sense structural deformation in the synthesized virtual image,thereby increasing the overall structural constraints of the network model,improving the accuracy of camera pose estimation,and reducing drift issues in the VO.Detailed quantitative and qualitative evaluations on the KITTI dataset show that the proposed framework outperforms existing conventional,supervised learning and unsupervised depth VO methods,providing better results in both pose estimation and depth estimation.展开更多
Perceptual learning of orientation discrimination was investigated using cats. Two adult cats (Cat 1 and 2) were trained to monocularly discriminate between two static striped sinusoidal grates with 30° orienta...Perceptual learning of orientation discrimination was investigated using cats. Two adult cats (Cat 1 and 2) were trained to monocularly discriminate between two static striped sinusoidal grates with 30° orientation difference. After greater than 80% correct performance was reached, cats were then required to monocularly perform a discrimination between two grates with consecutively shifting orientation difference(2°, 4°, 6°, 8°, 10°, 12°, 16°, 20°, 24°, 30°) . The staircase method (two correct-down and one error-up) was applied throughout the training to track the threshold of orientation difference that cats could detect. The performance of detecting grates with varied orientation difference was measured respectively for beth trained and untrained eyes before and after training. Our results showed that the learning effect of discrimination for grates with a fixed orientation difference transferred completely from the trained eye to the untrained eye, whereas the inter-eye transfer for detecting °ates with gradually reducing orientation difference was almost nonegrates. The two opposite learning effects in the same subject strongly suggest that different information processing mechanisms might mediate the learning processes.展开更多
To further explore the human visual system( HVS),the perceptual grouping( PG), which has been proven to play an important role in the HVS, is adopted to design an effective image quality assessment( IQA) model. ...To further explore the human visual system( HVS),the perceptual grouping( PG), which has been proven to play an important role in the HVS, is adopted to design an effective image quality assessment( IQA) model. Compared with the existing fixed-window-based models, the proposed one is an adaptive window-like model that introduces the perceptual grouping strategy into the IQA model. It works as follows: first,it preprocesses the images by clustering similar pixels into a group to the greatest extent; then the structural similarity is used to compute the similarity of the superpixels between reference and distorted images; finally, it integrates all the similarity of superpixels of an image to yield a quality score. Experimental results on three databases( LIVE, IVC and MICT) showthat the proposed method yields good performance in terms of correlation with human judgments of visual quality.展开更多
Through questionnaires and perceptual maps,we made a variation analysis of landscape spatial image to the people in the Pearl River Delta from the viewpoint of the subjective perception.It is found that the people of ...Through questionnaires and perceptual maps,we made a variation analysis of landscape spatial image to the people in the Pearl River Delta from the viewpoint of the subjective perception.It is found that the people of different ages and different education levels from different areas hold diversified perceptions towards rural landscape space,and they have different ideal rural landscapes.However,there is a common point that the development of rural landscape should retain its excellent local landscape features and the local spirit,which will provide references for building the ideal rural landscape model.展开更多
In order to achieve better perceptual coding quality while using fewer bits, a novel perceptual video coding method based on the just-noticeable-distortion (JND) model and the auto-regressive (AR) model is explore...In order to achieve better perceptual coding quality while using fewer bits, a novel perceptual video coding method based on the just-noticeable-distortion (JND) model and the auto-regressive (AR) model is explored. First, a new texture segmentation method exploiting the JND profile is devised to detect and classify texture regions in video scenes. In this step, a spatial-temporal JND model is proposed and the JND energy of every micro-block unit is computed and compared with the threshold. Secondly, in order to effectively remove temporal redundancies while preserving high visual quality, an AR model is applied to synthesize the texture regions. All the parameters of the AR model are obtained by the least-squares method and each pixel in the texture region is generated as a linear combination of pixels taken from the closest forward and backward reference frames. Finally, the proposed method is compared with the H.264/AVC video coding system to demonstrate the performance. Various sequences with different types of texture regions are used in the experiment and the results show that the proposed method can reduce the bit-rate by 15% to 58% while maintaining good perceptual quality.展开更多
The severity of the hot and humid conditions to which miners are exposed increases as the depth of the work site increases.This can cause heat stress that can greatly affect the health and safety of workers.To resolve...The severity of the hot and humid conditions to which miners are exposed increases as the depth of the work site increases.This can cause heat stress that can greatly affect the health and safety of workers.To resolve this,a cooling garment has been developed that uses an atmospheric discharge of liquid CO2 to create a cool microclimate with an average temperature of 12.5(±0.4)℃ beneath the garment.To evaluate the garment's cooling efficiency,19 male subjects participated in an experimental procedure.The two modes,cooling on and off,were compared.Significant physiological differences were found between the two modes after minute 27(p<0.05)until the end of the recovery phase for the heart rate(maximum difference of 10 beats per minute)and the internal body temperature(maximum difference of 0.33℃).It was found that the modes also affected the subjects'perceptions.The ON-mode was associated with better well-being and thermal comfort,and reduced humidity sensation.Perceptions of exertion were lower in the ON-mode condition from minute 2.The findings provide strong evidence of the ability of this cooling garment to reduce heat stress in hot and humid conditions similar to those encountered in deep mines.展开更多
BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the...BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the treatment of amblyopia. Perceptual learning, a new method for treating amblyopia, improves the ability to process signals from the cerebral optic nerve system by specific visual stimulation and visual learning, as well as activation of the visual signal pathway utilizing brain nervous system plasticity. OBJECTIVE: This study investigated and evaluated the curative effects of perceptual learning, which can directionally increase brain plasticity, on the treatment of amblyopia in children. The relationship between curative effect and time was also analyzed. DESIGN: A self-control experiment. SETTING: Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region. PARTICIPANTS: A total of 125 amblyopic children (250 amblyopic eyes), 73 males, 52 females, averaging (6±2) years of age, received treatment at the Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region between September 2006 and February 2007 and were recruited for this study. All children presented with no structural disease of the eyeballs. Written informed consent for therapeutic regiments was obtained from each child's parent. The protocol received approval from the Hospital's Ethics Committee. METHODS: Visual function was tested with a perceptual learning system (Research Center for Human Health and Development of Sun Yat-sen University, National Engineering Technique Research Center for Medical Care Implement) for visual noise, position noise, contour discrimination, contrast sensitivity, grating stereogram, and random-dot fusion. These tests helped to evaluate the efficiency of visual information processing of these children, and to determine the degree of defects of the optic nerve cells and the connections of visual cortical neurons. According to results of visual function tests, individualized treatment was adopted for each amblyopia patient using perceptual learning system. One course of treatment lasted one month, and treatment was performed twice every day with two training procedures (each training procedure lasted for ten minutes). There was a ten-minute time interval between the two training procedures. The training treatment was performed in a quiet and dark environment. Visual acuity and recovery of visual function were tested every month. Original training procedure was continued or adjusted according to the results of visual function. MAIN OUTCOME MEASURES: Visual function change; relationship of curative effects and curative time. RESULTS: A total of 125 amblyopia children were included in the final analysis. The total efficiency of perceptual learning for treating amblyopia in children was 75.2%. Visual acuity began to greatly increase 3 months after treatment (P 〈 0.05). Visual acuity was best corrected from 0.60 ± 0.23 before treatment to 0.86 ± 0.26 after treatment (P 〈 0.05). The mean time to reach improved levels with curative effects was (2.82 ± 1.30) months, and to reach a basically cured level was (2.87 ±1.40) months. Percentage of improved visual acuity was the highest [98% (39/40)] in children that received 3 months of treatment and the lowest [55% (31/56)] in children that received 1 month of treatment (P 〈 0.05). The percentage of basically cured levels with curative effects increased with length of learning time and was the greatest in children that received 4 months of treatment [67% (31/46), P 〈 0.05]. CONCLUSION: Perceptual learning rapidly and remarkably improves visual function of amblyopia children; however, the curative effects are first apparent two and three months after intervention.展开更多
In order to understand the influence of brittleness and confining stress on rock cuttability,the indentation tests were carried out by a conical pick on the four types of rocks.Then,the experimental results were utili...In order to understand the influence of brittleness and confining stress on rock cuttability,the indentation tests were carried out by a conical pick on the four types of rocks.Then,the experimental results were utilized to take regression analysis.The eight sets of normalized regression models were established for reflecting the relationships of peak indentation force(PIF)and specific energy(SE)with brittleness index and uniaxial confining stress.The regression analyses present that these regression models have good prediction performance.The regressive results indicate that brittleness indices and uniaxial confining stress conditions have non-linear effects on the rock cuttability that is determined by PIF and SE.Finally,the multilayer perceptual neural network was used to measure the importance weights of brittleness index and uniaxial confining stress upon the influence for rock cuttability.The results indicate that the uniaxial confining stress is more significant than brittleness index for influencing the rock cuttability.展开更多
The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the...The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the academia as well as the industry. On the other hand, efficient search of the huge amount of images has become a great challenge. Image hashing is a technique suitable for use in image authentication and content based image retrieval (CBIR). In this article, we review some representative image hashing techniques proposed in the recent years, with emphases on how to meet the conflicting requirements of perceptual robustness and security. Following a brief introduction to some earlier methods, we focus on a typical two-stage structure and some geometric-distortion resilient techniques. We then introduce two image hashing approaches developed in our own research, and reveal security problems in some existing methods due to the absence of secret keys in certain stage of the image feature extraction, or availability of a large quantity of images, keys, or the hash function to the adversary. More research efforts are needed in developing truly robust and secure image hashing techniques.展开更多
In order to solve the problem of patient information security protection in medical images,whilst also taking into consideration the unchangeable particularity of medical images to the lesion area and the need for med...In order to solve the problem of patient information security protection in medical images,whilst also taking into consideration the unchangeable particularity of medical images to the lesion area and the need for medical images themselves to be protected,a novel robust watermarking algorithm for encrypted medical images based on dual-tree complex wavelet transform and discrete cosine transform(DTCWT-DCT)and chaotic map is proposed in this paper.First,DTCWT-DCT transformation was performed on medical images,and dot product was per-formed in relation to the transformation matrix and logistic map.Inverse transformation was undertaken to obtain encrypted medical images.Then,in the low-frequency part of the DTCWT-DCT transformation coefficient of the encrypted medical image,a set of 32 bits visual feature vectors that can effectively resist geometric attacks are found to be the feature vector of the encrypted medical image by using perceptual hashing.After that,different logistic initial values and growth parameters were set to encrypt the watermark,and zero-watermark technology was used to embed and extract the encrypted medical images by combining cryptography and third-party concepts.The proposed watermarking algorithm does not change the region of interest of medical images thus it does not affect the judgment of doctors.Additionally,the security of the algorithm is enhanced by using chaotic mapping,which is sensitive to the initial value in order to encrypt the medical image and the watermark.The simulation results show that the pro-posed algorithm has good homomorphism,which can not only protect the original medical image and the watermark information,but can also embed and extract the watermark directly in the encrypted image,eliminating the potential risk of decrypting the embedded watermark and extracting watermark.Compared with the recent related research,the proposed algorithm solves the contradiction between robustness and invisibility of the watermarking algorithm for encrypted medical images,and it has good results against both conventional attacks and geometric attacks.Under geometric attacks in particular,the proposed algorithm performs much better than existing algorithms.展开更多
Perceptual auditory filter banks such as Bark-scale filter bank are widely used as front-end processing in speech recognition systems.However,the problem of the design of optimized filter banks that provide higher acc...Perceptual auditory filter banks such as Bark-scale filter bank are widely used as front-end processing in speech recognition systems.However,the problem of the design of optimized filter banks that provide higher accuracy in recognition tasks is still open.Owing to spectral analysis in feature extraction,an adaptive bands filter bank (ABFB) is presented.The design adopts flexible bandwidths and center frequencies for the frequency responses of the filters and utilizes genetic algorithm (GA) to optimize the design parameters.The optimization process is realized by combining the front-end filter bank with the back-end recognition network in the performance evaluation loop.The deployment of ABFB together with zero-crossing peak amplitude (ZCPA) feature as a front process for radial basis function (RBF) system shows significant improvement in robustness compared with the Bark-scale filter bank.In ABFB,several sub-bands are still more concentrated toward lower frequency but their exact locations are determined by the performance rather than the perceptual criteria.For the ease of optimization,only symmetrical bands are considered here,which still provide satisfactory results.展开更多
基金This study was supported by:Inner Mongolia Academy of Forestry Sciences Open Research Project(Grant No.KF2024MS03)The Project to Improve the Scientific Research Capacity of the Inner Mongolia Academy of Forestry Sciences(Grant No.2024NLTS04)The Innovation and Entrepreneurship Training Program for Undergraduates of Beijing Forestry University(Grant No.X202410022268).
文摘Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures.
文摘Watermarking is embedding visible or invisible data within media to verify its authenticity or protect copyright.The watermark is embedded in significant spatial or frequency features of the media to make it more resistant to intentional or unintentional modification.Some of these features are important perceptual features according to the human visual system(HVS),which means that the embedded watermark should be imperceptible in these features.Therefore,both the designers of watermarking algorithms and potential attackers must consider these perceptual features when carrying out their actions.The two roles will be considered in this paper when designing a robust watermarking algorithm against the most harmful attacks,like volumetric scaling,histogram equalization,and non-conventional watermarking attacks like the Denoising Convolution Neural Network(DnCNN),which must be considered in watermarking algorithm design due to its rising role in the state-of-the-art attacks.The DnCNN is initialized and trained using watermarked image samples created by our proposed Covert and Severe Attacks Resistant Watermarking Algorithm(CSRWA)to prove its robustness.For this algorithm to satisfy the robustness and imperceptibility tradeoff,implementing the Dither Modulation(DM)algorithm is boosted by utilizing the Just Noticeable Distortion(JND)principle to get an improved performance in this sense.Sensitivity,luminance,inter and intra-block contrast are used to adjust the JND values.
文摘Objective Autism spectrum disorder(ASD)is a neurodevelopmental condition characterized by difficulties with communication and social interaction,restricted and repetitive behaviors.Previous studies have indicated that individuals with ASD exhibit early and lifelong attention deficits,which are closely related to the core symptoms of ASD.Basic visual attention processes may provide a critical foundation for their social communication and interaction abilities.Therefore,this study explores the behavior of children with ASD in capturing attention to changes in topological properties.Methods Our study recruited twenty-seven ASD children diagnosed by professional clinicians according to DSM-5 and twenty-eight typically developing(TD)age-matched controls.In an attention capture task,we recorded the saccadic behaviors of children with ASD and TD in response to topological change(TC)and non-topological change(nTC)stimuli.Saccadic reaction time(SRT),visual search time(VS),and first fixation dwell time(FFDT)were used as indicators of attentional bias.Pearson correlation tests between the clinical assessment scales and attentional bias were conducted.Results This study found that TD children had significantly faster SRT(P<0.05)and VS(P<0.05)for the TC stimuli compared to the nTC stimuli,while the children with ASD did not exhibit significant differences in either measure(P>0.05).Additionally,ASD children demonstrated significantly less attention towards the TC targets(measured by FFDT),in comparison to TD children(P<0.05).Furthermore,ASD children exhibited a significant negative linear correlation between their attentional bias(measured by VS)and their scores on the compulsive subscale(P<0.05).Conclusion The results suggest that children with ASD have difficulty shifting their attention to objects with topological changes during change detection.This atypical attention may affect the child’s cognitive and behavioral development,thereby impacting their social communication and interaction.In sum,our findings indicate that difficulties in attentional capture by TC may be a key feature of ASD.
文摘Picture is a means of objects’representation and inter-subject communication,where various ways of spatial modelling interact.Unlike arbitrary signs,picture not only represents something different from itself,but shows the represented objects for viewer’s perception.Therefore,the external modelling in form of a material bearer is connected always with internal modelling of depicted objects.Not only perceptual images,but also internal models of other psychical levels participate in creation and interpretation of the pictures.These are,on the one hand,images of the apperceptual and conceptual levels,where schemes of represented objects and their verbal interpretations are formed.On the other hand,the images of sensorial level participate also in the internal modelling.All these mental models interact differently by artists and viewers and influence organization of external pictorial models mediating their contacts.
基金supported by the Nano&Material Technology Development Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Science and ICT(Grant Nos.RS-2024-00403639 and RS2024-00411904)。
文摘Artificial sensory systems(ASS)are pivotal to next-generation extended reality technologies,now evolving into flexible platforms for comfortable wear and immersive user experiences,while ensuring high performance and operational reliability.To address these demands,metal-based nanoparticles(NPs),such as noble metal,oxide,and multi-elemental NPs,have been extensively incorporated into functional materials of sensory and synaptic devices due to their tunable optical,electrical,and chemical properties,enhancing sensory precision,stability,and environmental adaptability.However,traditional NP fabrication methods often involve complex processing,residual contaminants,and scalability issues,limiting their effectiveness in ASS applications.State-of-the-art laser ablation in liquids(LAL)presents a promising alternative,offering scalable production of surfactant-free NPs with customizable physicochemical properties,though their application in electronics remains underexplored.This review delves into the transformative potential of LAL-fabricated NPs in ASS,covering the fundamental mechanisms of LAL,the role of process parameters,the derivative strategies for size modulation,the diversity of metal-based NPs,their applications in sensory and synaptic devices,and the challenges and perspectives for meeting industrial standards.Bridging the gap between LAL and ASS is poised to revolutionize both industrial manufacturing and academic research by offering scalable solutions to overcome intrinsic tradeoffs between flexibility and performance,fostering innovations in human-centric,immersive electronics.
基金supported by the Program of Graduate Education and Teaching Reform in Tianjin University of Technology(Nos.YBXM2204 and ZDXM2202)the National Natural Science Foundation of China(Nos.62203331 and 62103299)。
文摘Although deep learning methods have been widely applied in slam visual odometry(VO)over the past decade with impressive improvements,the accuracy remains limited in complex dynamic environments.In this paper,a composite mask-based generative adversarial network(CMGAN)is introduced to predict camera motion and binocular depth maps.Specifically,a perceptual generator is constructed to obtain the corresponding parallax map and optical flow between two neighboring frames.Then,an iterative pose improvement strategy is proposed to improve the accuracy of pose estimation.Finally,a composite mask is embedded in the discriminator to sense structural deformation in the synthesized virtual image,thereby increasing the overall structural constraints of the network model,improving the accuracy of camera pose estimation,and reducing drift issues in the VO.Detailed quantitative and qualitative evaluations on the KITTI dataset show that the proposed framework outperforms existing conventional,supervised learning and unsupervised depth VO methods,providing better results in both pose estimation and depth estimation.
基金This work was supported by the Foundationfor Key Laboratories of Anhui Province andthe Initiating Fundfor Ph.D.in AnhuiNormal University
文摘Perceptual learning of orientation discrimination was investigated using cats. Two adult cats (Cat 1 and 2) were trained to monocularly discriminate between two static striped sinusoidal grates with 30° orientation difference. After greater than 80% correct performance was reached, cats were then required to monocularly perform a discrimination between two grates with consecutively shifting orientation difference(2°, 4°, 6°, 8°, 10°, 12°, 16°, 20°, 24°, 30°) . The staircase method (two correct-down and one error-up) was applied throughout the training to track the threshold of orientation difference that cats could detect. The performance of detecting grates with varied orientation difference was measured respectively for beth trained and untrained eyes before and after training. Our results showed that the learning effect of discrimination for grates with a fixed orientation difference transferred completely from the trained eye to the untrained eye, whereas the inter-eye transfer for detecting °ates with gradually reducing orientation difference was almost nonegrates. The two opposite learning effects in the same subject strongly suggest that different information processing mechanisms might mediate the learning processes.
基金The National Natural Science Foundation of China(No.81272501)the National Basic Research Program of China(973Program)(No.2011CB707904)Taishan Scholars Program of Shandong Province,China(No.ts20120505)
文摘To further explore the human visual system( HVS),the perceptual grouping( PG), which has been proven to play an important role in the HVS, is adopted to design an effective image quality assessment( IQA) model. Compared with the existing fixed-window-based models, the proposed one is an adaptive window-like model that introduces the perceptual grouping strategy into the IQA model. It works as follows: first,it preprocesses the images by clustering similar pixels into a group to the greatest extent; then the structural similarity is used to compute the similarity of the superpixels between reference and distorted images; finally, it integrates all the similarity of superpixels of an image to yield a quality score. Experimental results on three databases( LIVE, IVC and MICT) showthat the proposed method yields good performance in terms of correlation with human judgments of visual quality.
文摘Through questionnaires and perceptual maps,we made a variation analysis of landscape spatial image to the people in the Pearl River Delta from the viewpoint of the subjective perception.It is found that the people of different ages and different education levels from different areas hold diversified perceptions towards rural landscape space,and they have different ideal rural landscapes.However,there is a common point that the development of rural landscape should retain its excellent local landscape features and the local spirit,which will provide references for building the ideal rural landscape model.
基金The National Natural Science Foundation of China (No.60472058, 60975017)
文摘In order to achieve better perceptual coding quality while using fewer bits, a novel perceptual video coding method based on the just-noticeable-distortion (JND) model and the auto-regressive (AR) model is explored. First, a new texture segmentation method exploiting the JND profile is devised to detect and classify texture regions in video scenes. In this step, a spatial-temporal JND model is proposed and the JND energy of every micro-block unit is computed and compared with the threshold. Secondly, in order to effectively remove temporal redundancies while preserving high visual quality, an AR model is applied to synthesize the texture regions. All the parameters of the AR model are obtained by the least-squares method and each pixel in the texture region is generated as a linear combination of pixels taken from the closest forward and backward reference frames. Finally, the proposed method is compared with the H.264/AVC video coding system to demonstrate the performance. Various sequences with different types of texture regions are used in the experiment and the results show that the proposed method can reduce the bit-rate by 15% to 58% while maintaining good perceptual quality.
基金funded by the Fonds de Recherche du QuébecNature et Technologies (FRQNT)
文摘The severity of the hot and humid conditions to which miners are exposed increases as the depth of the work site increases.This can cause heat stress that can greatly affect the health and safety of workers.To resolve this,a cooling garment has been developed that uses an atmospheric discharge of liquid CO2 to create a cool microclimate with an average temperature of 12.5(±0.4)℃ beneath the garment.To evaluate the garment's cooling efficiency,19 male subjects participated in an experimental procedure.The two modes,cooling on and off,were compared.Significant physiological differences were found between the two modes after minute 27(p<0.05)until the end of the recovery phase for the heart rate(maximum difference of 10 beats per minute)and the internal body temperature(maximum difference of 0.33℃).It was found that the modes also affected the subjects'perceptions.The ON-mode was associated with better well-being and thermal comfort,and reduced humidity sensation.Perceptions of exertion were lower in the ON-mode condition from minute 2.The findings provide strong evidence of the ability of this cooling garment to reduce heat stress in hot and humid conditions similar to those encountered in deep mines.
基金Grant from Major Scientific Research Program of Medical Treatment and Public Health of Guangxi Zhuang Autonomous Region, No.200730
文摘BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the treatment of amblyopia. Perceptual learning, a new method for treating amblyopia, improves the ability to process signals from the cerebral optic nerve system by specific visual stimulation and visual learning, as well as activation of the visual signal pathway utilizing brain nervous system plasticity. OBJECTIVE: This study investigated and evaluated the curative effects of perceptual learning, which can directionally increase brain plasticity, on the treatment of amblyopia in children. The relationship between curative effect and time was also analyzed. DESIGN: A self-control experiment. SETTING: Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region. PARTICIPANTS: A total of 125 amblyopic children (250 amblyopic eyes), 73 males, 52 females, averaging (6±2) years of age, received treatment at the Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region between September 2006 and February 2007 and were recruited for this study. All children presented with no structural disease of the eyeballs. Written informed consent for therapeutic regiments was obtained from each child's parent. The protocol received approval from the Hospital's Ethics Committee. METHODS: Visual function was tested with a perceptual learning system (Research Center for Human Health and Development of Sun Yat-sen University, National Engineering Technique Research Center for Medical Care Implement) for visual noise, position noise, contour discrimination, contrast sensitivity, grating stereogram, and random-dot fusion. These tests helped to evaluate the efficiency of visual information processing of these children, and to determine the degree of defects of the optic nerve cells and the connections of visual cortical neurons. According to results of visual function tests, individualized treatment was adopted for each amblyopia patient using perceptual learning system. One course of treatment lasted one month, and treatment was performed twice every day with two training procedures (each training procedure lasted for ten minutes). There was a ten-minute time interval between the two training procedures. The training treatment was performed in a quiet and dark environment. Visual acuity and recovery of visual function were tested every month. Original training procedure was continued or adjusted according to the results of visual function. MAIN OUTCOME MEASURES: Visual function change; relationship of curative effects and curative time. RESULTS: A total of 125 amblyopia children were included in the final analysis. The total efficiency of perceptual learning for treating amblyopia in children was 75.2%. Visual acuity began to greatly increase 3 months after treatment (P 〈 0.05). Visual acuity was best corrected from 0.60 ± 0.23 before treatment to 0.86 ± 0.26 after treatment (P 〈 0.05). The mean time to reach improved levels with curative effects was (2.82 ± 1.30) months, and to reach a basically cured level was (2.87 ±1.40) months. Percentage of improved visual acuity was the highest [98% (39/40)] in children that received 3 months of treatment and the lowest [55% (31/56)] in children that received 1 month of treatment (P 〈 0.05). The percentage of basically cured levels with curative effects increased with length of learning time and was the greatest in children that received 4 months of treatment [67% (31/46), P 〈 0.05]. CONCLUSION: Perceptual learning rapidly and remarkably improves visual function of amblyopia children; however, the curative effects are first apparent two and three months after intervention.
基金Project(51904333) supported by the National Natural Science Foundation of China。
文摘In order to understand the influence of brittleness and confining stress on rock cuttability,the indentation tests were carried out by a conical pick on the four types of rocks.Then,the experimental results were utilized to take regression analysis.The eight sets of normalized regression models were established for reflecting the relationships of peak indentation force(PIF)and specific energy(SE)with brittleness index and uniaxial confining stress.The regression analyses present that these regression models have good prediction performance.The regressive results indicate that brittleness indices and uniaxial confining stress conditions have non-linear effects on the rock cuttability that is determined by PIF and SE.Finally,the multilayer perceptual neural network was used to measure the importance weights of brittleness index and uniaxial confining stress upon the influence for rock cuttability.The results indicate that the uniaxial confining stress is more significant than brittleness index for influencing the rock cuttability.
基金supported by the National Natural Science Foundation of China(Grant No.60502039),the Shanghai Rising-Star Program(Grant No.06QA14022),and the Key project of Shanghai Municipality for Basic Research (Grant No.04JC14037)
文摘The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the academia as well as the industry. On the other hand, efficient search of the huge amount of images has become a great challenge. Image hashing is a technique suitable for use in image authentication and content based image retrieval (CBIR). In this article, we review some representative image hashing techniques proposed in the recent years, with emphases on how to meet the conflicting requirements of perceptual robustness and security. Following a brief introduction to some earlier methods, we focus on a typical two-stage structure and some geometric-distortion resilient techniques. We then introduce two image hashing approaches developed in our own research, and reveal security problems in some existing methods due to the absence of secret keys in certain stage of the image feature extraction, or availability of a large quantity of images, keys, or the hash function to the adversary. More research efforts are needed in developing truly robust and secure image hashing techniques.
基金supported by the Key Research Project of Hainan Province[ZDYF2018129]the Higher Education Research Project of Hainan Province(Hnky2019-73)+3 种基金the National Natural Science Foundation of China[61762033]the Natural Science Foundation of Hainan[617175]the Special Scientific Research Project of Philosophy and Social Sciences of Chongqing Medical University[201703]the Key Research Project of Haikou College of Economics[HJKZ18-01].
文摘In order to solve the problem of patient information security protection in medical images,whilst also taking into consideration the unchangeable particularity of medical images to the lesion area and the need for medical images themselves to be protected,a novel robust watermarking algorithm for encrypted medical images based on dual-tree complex wavelet transform and discrete cosine transform(DTCWT-DCT)and chaotic map is proposed in this paper.First,DTCWT-DCT transformation was performed on medical images,and dot product was per-formed in relation to the transformation matrix and logistic map.Inverse transformation was undertaken to obtain encrypted medical images.Then,in the low-frequency part of the DTCWT-DCT transformation coefficient of the encrypted medical image,a set of 32 bits visual feature vectors that can effectively resist geometric attacks are found to be the feature vector of the encrypted medical image by using perceptual hashing.After that,different logistic initial values and growth parameters were set to encrypt the watermark,and zero-watermark technology was used to embed and extract the encrypted medical images by combining cryptography and third-party concepts.The proposed watermarking algorithm does not change the region of interest of medical images thus it does not affect the judgment of doctors.Additionally,the security of the algorithm is enhanced by using chaotic mapping,which is sensitive to the initial value in order to encrypt the medical image and the watermark.The simulation results show that the pro-posed algorithm has good homomorphism,which can not only protect the original medical image and the watermark information,but can also embed and extract the watermark directly in the encrypted image,eliminating the potential risk of decrypting the embedded watermark and extracting watermark.Compared with the recent related research,the proposed algorithm solves the contradiction between robustness and invisibility of the watermarking algorithm for encrypted medical images,and it has good results against both conventional attacks and geometric attacks.Under geometric attacks in particular,the proposed algorithm performs much better than existing algorithms.
基金Project(61072087) supported by the National Natural Science Foundation of ChinaProject(20093048) supported by Shanxi ProvincialGraduate Innovation Fund of China
文摘Perceptual auditory filter banks such as Bark-scale filter bank are widely used as front-end processing in speech recognition systems.However,the problem of the design of optimized filter banks that provide higher accuracy in recognition tasks is still open.Owing to spectral analysis in feature extraction,an adaptive bands filter bank (ABFB) is presented.The design adopts flexible bandwidths and center frequencies for the frequency responses of the filters and utilizes genetic algorithm (GA) to optimize the design parameters.The optimization process is realized by combining the front-end filter bank with the back-end recognition network in the performance evaluation loop.The deployment of ABFB together with zero-crossing peak amplitude (ZCPA) feature as a front process for radial basis function (RBF) system shows significant improvement in robustness compared with the Bark-scale filter bank.In ABFB,several sub-bands are still more concentrated toward lower frequency but their exact locations are determined by the performance rather than the perceptual criteria.For the ease of optimization,only symmetrical bands are considered here,which still provide satisfactory results.