AIM:To explore the effect of visual perception learning software training(VPT)on binocular visual function reconstruction in children with intermittent exotropia after strabismus surgery.METHODS:Ninety children with i...AIM:To explore the effect of visual perception learning software training(VPT)on binocular visual function reconstruction in children with intermittent exotropia after strabismus surgery.METHODS:Ninety children with intermittent exotropia admitted to our hospital from June 2018 to December 2018 were included,and randomly divided into VPT and control groups.Children in the control group received basic binocular vision training,while those in the VPT group received VPT after strabismus surgery.Tertiary visual function,visual perception function,Newcastle Control Score(NCS),and ocular position retraction rate were compared at 3 and 12mo after the surgery.RESULTS:At 3 and 12mo after the surgery,the proportion of simultaneous perception,binocular fusion version and binocular stereo vision in the VPT group was conspicuously higher than that in the control group(P<0.05).After the vision training,the binocular visual perception functions of children in both groups were significantly improved compared with that before training(P<0.05).Interestingly,the grating sharpness,texture perception and texture motion perception in the VPT group were dramatically better than control group(P<0.01).The NCS in the VPT group was significantly lower than that in the control group(P<0.05).The ocular position retraction rate in the VPT group was significantly lower than that in the control group at 12mo(8.89%vs 26.67%,P=0.03).CONCLUSION:VPT effectively promotes binocular visual function reconstruction in intermittent exotropia children after strabismus surgery and reduces the strabismus severity and ocular position retraction rate.展开更多
The evaluation index of camouflage patterns is important in the field of military application.It is the goal that researchers have always pursued to make the computable evaluation indicators more in line with the huma...The evaluation index of camouflage patterns is important in the field of military application.It is the goal that researchers have always pursued to make the computable evaluation indicators more in line with the human visual mechanism.In order to make the evaluation method more computationally intelligent,a Multi-Feature Camouflage Fused Index(MF-CFI)is proposed based on the comparison of grayscale,color and texture features between the target and the background.In order to verify the effectiveness of the proposed index,eye movement experiments are conducted to compare the proposed index with existing indexes including Universal Image Quality Index(UIQI),Camouflage Similarity Index(CSI)and Structural Similarity(SSIM).Twenty-four different simulated targets are designed in a grassland background,28 observers participate in the experiment and record the eye movement data during the observation process.The results show that the highest Pearson correlation coefficient is observed between MF-CFI and the eye movement data,both in the designed digital camouflage patterns and largespot camouflage patterns.Since MF-CFI is more in line with the detection law of camouflage targets in human visual perception,the proposed index can be used for the comparison and parameter optimization of camouflage design algorithms.展开更多
The human visual system,dependent on retinal cells,can be regarded as a complex combination of optical system and nervous system.Artificial retinal system could mimic the sensing and processing function of human eyes....The human visual system,dependent on retinal cells,can be regarded as a complex combination of optical system and nervous system.Artificial retinal system could mimic the sensing and processing function of human eyes.Optically stimulated synaptic devices could serve as the building blocks for artificial retinas and subsequent information transmission system to brain.Herein,photonic synaptic transistors based on polycrystalline MoS_(2),which could simulate human visual perception and brain storage,are presented.Moreover,the photodetection range from visible light to near-infrared light of MoS_(2) multilayer could extend human eyes’vision limitation to near-infrared light.Additionally,the photonic synaptic transistor shows an ultrafast speed within 5μs and ultralow power consumption under optical stimuli about 40 aJ,several orders of magnitude lower than biological synapses(50 ms and 10 fJ).Furthermore,the backgate control could act as emotional modulation of the artificial brain to enhance or suppress memory function,i.e.the intensity of photoresponse.The proposed carrier trapping/detrapping as the main working mechanism is presented for the device.In addition,synaptic functionalities including short synaptic plasticity,long synaptic plasticity and paired-pulse facilitation could be successfully simulated based on the prepared device.Furthermore,the large difference between short synaptic plasticity and long synaptic plasticity reveals the better image pre-processing function of the prepared photonic synapses.The classical Pavlovian conditioning associated with the associative learning is successfully implemented as well.Therefore,the efficient and rich functionalities demonstrate the potential of the MoS_(2) synaptic device that integrates sensing-memory-preprocessing capabilities for realizing artificial neural networks with different emotions that mimic human retina and brain.展开更多
AIM:To compare the visual perception(color and chromatic-achromatic contrast vision)of a small cohort of COVID-19 patients at the time of infection and after 6mo with that of a healthy population matched for sex and a...AIM:To compare the visual perception(color and chromatic-achromatic contrast vision)of a small cohort of COVID-19 patients at the time of infection and after 6mo with that of a healthy population matched for sex and age.METHODS:A total of 25 patients(9 females,16 males,mean age:54±10y)with COVID-19 hospitalized in the COVID-19 Unit of the University Clinical Hospital of Valladolid were recruited for this preliminary study.Visual perception,as determined by monocular measurement of contrast sensitivity function(CSF)and color vision was assessed in each patient using the Optopad test.The results obtained were then compared with those of a sample of 16 age-and sex-matched healthy controls(5 females,11 males,mean age:50±6y)in which the same measurement procedure was repeated.Statistically significant differences between groups were assessed using the Mann-Whitney U test.Measurements were repeated after a minimum follow-up period of 6mo and statistically significant differences between the two time points in each group were assessed using the Wilcoxon signed rank test.RESULTS:Discrimination thresholds(color and chromatic-achromatic contrast vision)and their corresponding sensitivity,calculated as the inverse of the discrimination threshold,were evaluated.Analysis of the data revealed higher contrast threshold results(i.e.,worse contrast sensitivity)in the COVID-19 group than in the control group for all spatial frequencies studied in the Optopad-CSF achromatic test and most of the spatial frequencies studied in the Optopad-CSF chromatic test for the red-green and blue-yellow mechanisms.In addition,color threshold results in the COVID-19 group were also significantly higher(i.e.,worse color sensitivity)for almost all color mechanisms studied in the Optopad-Color test.At 6mo,most of the differences found between the groups were maintained despite COVID-19 recovery.CONCLUSION:The present results provide preliminary evidence that visual perception may be impaired in COVID-19,even when the infection has passed.Although further research is needed to determine the precise causes of this finding,analysis of CSF and color vision could provide valuable information on the visual impact of COVID-19.展开更多
A new scheme named personalized image retrieval technique based on visual perception is proposed in this letter, whose motive is to narrow the semantic gap by directly perceiving user's visual information. It uses...A new scheme named personalized image retrieval technique based on visual perception is proposed in this letter, whose motive is to narrow the semantic gap by directly perceiving user's visual information. It uses visual attention model to segment image regions and eye-tracking technique to record fixations. Visual perception is obtained by analyzing the fixations in regions to measure gaze interests. Integrating visual perception into attention model is to detect the Regions Of Interest (ROIs), whose features are extracted and analyzed, then feedback interests to optimize the results and construct user profiles.展开更多
Vehicle recognition system (VRS) plays a very important role in the field of intelligent transportation systems.A novel and intuitive method is proposed for vehicle location.The method we provide for vehicle location ...Vehicle recognition system (VRS) plays a very important role in the field of intelligent transportation systems.A novel and intuitive method is proposed for vehicle location.The method we provide for vehicle location is based on human visual perception model technique. The perception color space HSI in this algorithm is adopted.Three color components of a color image and more potential edge patterns are integrated for solving the feature extraction problem.A fast and automatic threshold technique based on human visual perception model is also developed.The vertical edge projection and horizontal edge projection are adopted for locating left-right boundary of vehicle and top-bottom boundary of vehicle, respectively. Very promising experimental results are obtained using real-time vehicle image sequences, which have confirmed that this proposed location vehicle method is efficient and reliable, and its calculation speed meets the needs of the VRS.展开更多
The gestalt principles are perception and what is visually communicated by objects. These principles describe the visual language within which we work. Understanding how a designer realizes things visually will help d...The gestalt principles are perception and what is visually communicated by objects. These principles describe the visual language within which we work. Understanding how a designer realizes things visually will help designers communicate better. Many designers use Gestalt principles to have a structure visual stimulus, so as to create interfaces that are easy for users to understand. Because the understanding of visual perception is a critical item in any designer’s toolkit. Islamic heritage buildings also contain many architectural values, especially the Sultan Hassan Mosque in Cairo. Through this paper, we will verify the use of gestalt principles of visual perception in the building design. Because the psychological organization of the design, achieves the understanding and awareness of the current relationships between the elements, depending on the extent of regularity, accuracy, and consistency of the design elements. This makes us recognize the esthetic values of Sultan Hassan’s building and mosque. The visual delight we get from architecture begins with how we perceive it. So, in order to understand how visual delight arises, we must first understand how we perceive and interpret visual stimuli. Especially since the principles of gestalt confirm that the human brain tends to make tricks and illusions. Therefore, we found the mosque designer had exploited this fact during the design process, to exclude any possibility of the resulting misunderstanding. The designer achieved the psychological preparation of the visitor by using some principles of visual perception of the Gestalt theory, such as growth, continuity, and hierarchy.展开更多
Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urba...Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urban environments.The task of 3D reconstruction from point clouds is still in the development phase,especially the recognition and interpretation of roof topological structures.Methods This study proposes a novel visual perception-based approach to automatically decompose and reconstruct building point clouds into meaningful and simple parametric structures,while the associated mutual relationships between the roof plane geometry and roof structure units are expressed by a hierarchical topology tree.First,a roof plane extraction is performed by a multi-label graph cut energy optimization framework and a roof structure graph(RSG)model is then constructed to describe the roof topological geometry with common adjacency,symmetry,and convexity rules.Moreover,a progressive roof decomposition and refinement are performed,generating a hierarchical representation of the 3D roof structure models.Finally,a visual plane fitted residual or area constraint process is adopted to generate the RSG model with different levels of details.Results Two airborne laser scanning datasets with different point densities and roof styles were tested,and the performance evaluation metrics were obtained by International Society for Photogrammetry and Remote Sensing,achieving a correctness and accuracy of 97.7%and 0.29m,respectively.Conclusions The standardized assessment results demonstrate the effectiveness and robustness of the proposed approach,showing its ability to generate a variety of structural models,even with missing data.展开更多
The process of human natural scene categorization consists of two correlated stages: visual perception and visual cognition of natural scenes.Inspired by this fact,we propose a biologically plausible approach for natu...The process of human natural scene categorization consists of two correlated stages: visual perception and visual cognition of natural scenes.Inspired by this fact,we propose a biologically plausible approach for natural scene image classification.This approach consists of one visual perception model and two visual cognition models.The visual perception model,composed of two steps,is used to extract discriminative features from natural scene images.In the first step,we mimic the oriented and bandpass properties of human primary visual cortex by a special complex wavelets transform,which can decompose a natural scene image into a series of 2D spatial structure signals.In the second step,a hybrid statistical feature extraction method is used to generate gist features from those 2D spatial structure signals.Then we design a cognitive feedback model to realize adaptive optimization for the visual perception model.At last,we build a multiple semantics based cognition model to imitate human cognitive mode in rapid natural scene categorization.Experiments on natural scene datasets show that the proposed method achieves high efficiency and accuracy for natural scene classification.展开更多
The advantage of electronic devices is to eliminate the subjectivity. Based on individual capabilities, visual perception has its disadvantages. The aim of this study was to find out the possible shade difference betw...The advantage of electronic devices is to eliminate the subjectivity. Based on individual capabilities, visual perception has its disadvantages. The aim of this study was to find out the possible shade difference between the upper and lower teeth, visual perception versus spectrophotometry as well as proving that females are better in visual perception of colour matching. 82 subjects (dental students) of this study determined the color of each other’s natural teeth. The color of 1640 teeth (incisors, canines and premolars), was matched by visual perception and by Vita Easyshade spectrophotometer. The matching was focused on the middle third of buccal surfaces of the examined teeth. In this study, it was found that there was a difference of color between the teeth of the upper and lower jaw. The percentage of the accordance between two methods was 28.35% for all examined teeth. The color determination at central incisors showed the highest value of accordance 40.85%. Based on Vitapan 3D Mastershade, the most frequent color was 2M2.展开更多
In recent years,humanoid robots have gained significant attention due to their potential to revolutionize various industries,from healthcare to manufacturing.A key factor driving this transformation is the advancement...In recent years,humanoid robots have gained significant attention due to their potential to revolutionize various industries,from healthcare to manufacturing.A key factor driving this transformation is the advancement of visual perception systems,which are crucial for making humanoid robots more intelligent and autonomous.Despite the progress,the full potential of vision-based technologies in humanoid robots has yet to be fully realized.This review aims to provide a comprehensive overview of recent advancements in visual perception applied to humanoid robots,specifically focusing on applications in state estimation and environmental interaction.By summarizing key developments and analyzing the challenges and opportunities in these areas,this paper seeks to inspire future research that can unlock new capabilities for humanoid robots,enabling them to better navigate complex environments,perform intricate tasks,and interact seamlessly with humans.展开更多
Photonic synapses combining photosensitivity and synaptic function can efficiently perceive and memorize visual information,making them crucial for the development of artificial vision systems.However,the development ...Photonic synapses combining photosensitivity and synaptic function can efficiently perceive and memorize visual information,making them crucial for the development of artificial vision systems.However,the development of high-performance photonic synapses with low power consumption and rapid optical erasing ability remains challenging.Here,we propose a photon-modulated charging/discharging mechanism for self-powered photonic synapses.The current hysteresis enables the devices based on CsPbBr3/solvent/carbon nitride multilayer architecture to emulate synaptic behaviors,such as excitatory postsynaptic currents,paired-pulse facilitation,and long/short-term memory.Intriguingly,the unique radiation direction-dependent photocurrent endows the photonic synapses with the capability of optical writing and rapid optical erasing.Moreover,the photonic synapses exhibit exceptional performance in contrast enhancement and noise reduction owing to the notable synaptic plasticity.In simulations based on artificial neural network(ANN)algorithms,the pre-processing by our photonic synapses improves the recognition rate of handwritten digit from 11.4%(200 training epochs)to 85%(~60 training epochs).Furthermore,due to the excellent feature extraction and memory capability,an array based on the photonic synapses can imitate facial recognition of human retina without the assistance of ANN.展开更多
When presented with visual stimuli of face images,the ventral stream visual cortex of the human brain exhibits face-specific activity that is modulated by the physical properties of the input images.However,it is stil...When presented with visual stimuli of face images,the ventral stream visual cortex of the human brain exhibits face-specific activity that is modulated by the physical properties of the input images.However,it is still unclear whether this activity relates to conscious face perception.We explored this issue by using the human intracranial electroencephalography technique.Our results showed that face-specific activity in the ventral stream visual cortex was significantly higher when the subjects subjectively saw faces than when they did not,even when face stimuli were presented in both conditions.In addition,the face-specific neural activity exhibited a more reliable neural response and increased posterior-anterior direction information transfer in the“seen”condition than the“unseen”condition.Furthermore,the face-specific neural activity was significantly correlated with performance.These findings support the view that face-specific activity in the ventral stream visual cortex is linked to conscious face perception.展开更多
As an essential part of the urban landscape,linear urban landscape(LUL)is the interaction between humans and nature,which is closely associated with daily life and brings multiple characteristics to visual perception....As an essential part of the urban landscape,linear urban landscape(LUL)is the interaction between humans and nature,which is closely associated with daily life and brings multiple characteristics to visual perception.Current studies focus on complex models that describe visual perception using static viewpoints,but lossing the continuous and dynamic features of visual perception.This paper provides a general framework that can quantify dynamic visual perception based on urban morphology and improves accuracy in the descriptions of LUL linear spatial characteristics.Based on Beijing-Hangzhou Grand Canal(Hangzhou urban section),the proposed framework combines the indicators of multiple dimensions to quantify dynamic visual perception and emphasizes the continuity of LUL.To represent the dynamic visual perception and the spatial pattern characteristics of LUL,different evaluation criteria of indicators are set according to landscape scales.To minimize subjectivity and uncertainty caused by subjective cognition and fulfill the landscape pattern under different urban development policies,we set up distinct scenario preference patterns.With appropriate fine-tuning of scenario preference patterns and setting of movement types,the proposed method can be adapted to other LUL projects and aspires to provide a general methodology and scientific guidance for urban planning and landscape management.展开更多
A collage is a composite artwork made from the spatial layout of multiple pictures on a canvas,collected from the Internet or user photographs.Collages,usually made by skilled artists,involve a complex manual process,...A collage is a composite artwork made from the spatial layout of multiple pictures on a canvas,collected from the Internet or user photographs.Collages,usually made by skilled artists,involve a complex manual process,especially when searching for component pictures and adjusting their spatial layout to meet artistic requirements.In this paper,we present a visual perception driven method for automatically synthesizing visually pleasing collages.Unlike previous works,we focus on how to design a collage layout which not only provides easy access to the theme of the overall image,but also conforms to human visual perception.To achieve this goal,we formulate the generation of collages as a mapping problem:given a canvas image,first,compute a saliency map for it and a vector field for each sub-region of it.Second,using a divide-and-conquer strategy,generate a series of patch sets from the canvas image,where the salient map and the vector field are used to determine each patch’s size and direction respectively.Third,construct a Gestalt-based energy function to choose the most visually pleasing and orderly patch set as the final layout.Finally,using a semantic-color metric,map the picture set to the patch set to generate the final collage.Extensive experimental and user study results show that this method can generate visual pleasing collages.展开更多
This paper aims to investigate the changes in the virtual perception on the built heritage at the traditional core setttement of Kumbakonam Town at Tamitnadu and to analyze their implica- tions. Specifically, the majo...This paper aims to investigate the changes in the virtual perception on the built heritage at the traditional core setttement of Kumbakonam Town at Tamitnadu and to analyze their implica- tions. Specifically, the major objectives of the study are (1) to identify the architectural elements that manifest the built heritage of Kumbakonam Town and (2) to assess the contMbutions of these elements to the changes in the visuat perception of the town. To achieve these objectives, this study adopts an empirical model that analyzes the architectural elements of the buildings in the study area. Direct observations and documentations of 373 buildings are collected to analyze those etements that contribute to the changes in the visual perception on the built heritage of Kumbakonam Town. An ordinary regression model is used to examine the characteristics of the built heritage across the chariot processional route of the town. Several architectural elements, including pitasters, horizontal cornices, arched windows, and ornamental parapets, improve the image of the town. These empirical findings support the policy framework that enhances the visual perception of Kumbakonam Town.展开更多
With the rapid development and improvement of urban construction,the influence of green environmental exposure on mental health has attracted increasing attention,but the relationship between the psychological percept...With the rapid development and improvement of urban construction,the influence of green environmental exposure on mental health has attracted increasing attention,but the relationship between the psychological perception and the object index has been less explored.From the perspective of the individual and environmental perception,space optimization of cultural blocks was discussed to provide a reference for the development of other old streets.展开更多
To build robots that engage in intuitive communication with people by natural language, we are developing a new knowledge representation called conceptual network model. The conceptual network connects natural languag...To build robots that engage in intuitive communication with people by natural language, we are developing a new knowledge representation called conceptual network model. The conceptual network connects natural language concepts with visual perception including color perception, shape perception, size perception, and spatial perception. In the implementation of spatial perception, we present a computational model based on spatial template theory to interpret qualitative spatial expressions. Based on the conceptual network model, our mobile robot can understand user's instructions and recognize the object referred to by the user and perform appropriate action. Experimental results show our approach promising.展开更多
Visual depth(distance)perception is a fundamental aspect of environmental cognition,as it allows people to judge the spatial scale of their surroundings.However,estimating the depth of classical Chinese gardens is cha...Visual depth(distance)perception is a fundamental aspect of environmental cognition,as it allows people to judge the spatial scale of their surroundings.However,estimating the depth of classical Chinese gardens is challenging,especially from static viewpoints that frame the scenery.Previous studies have examined how the internal components of the scenery frame affect depth perception.Still,the role of the frame and its peripheral information as environmental background have been largely overlooked.This study investigates how depth perception at viewpoints is influenced by viewing position displacement,frame geometry,and environmental context.The authors created nine stimulus materials in a cave virtual reality environment(three image treatments×three positions).Seventy-one participants were asked to evaluate depth perception using the magnitude estimation and adjustment methods.Their eye movement behavior was also recorded using an eye-movement instrument(SensoMotoric Instruments(SMI)eye-tracking glasses,120 Hz).The results showed that participants could perceive spatial depth differences between viewing positions even when the internal viewpoint displacement was small;frame shape did not significantly affect depth perception and gaze behavior;and peripheral visual information of the frame enhanced depth perception significantly.Moreover,the form of the environmental background,especially the position of the scenery window,strongly guided the participants'gaze.These findings suggest that ambient visual information significantly impacts environmental experience,which landscape designers should consider.展开更多
With the acceleration of urban renewal,people’s demand for improving the overall urban landscape quality of the city is increasing.However,it is often found in project practice that some professionals have one-sided ...With the acceleration of urban renewal,people’s demand for improving the overall urban landscape quality of the city is increasing.However,it is often found in project practice that some professionals have one-sided understanding or misunderstanding of landscape,which directly affects the work effi ciency of landscape design and the landscape quality of design works.Based on the theory of visual perceptual dynamics,this paper explores the aesthetic basis of people’s viewing and evaluation of visual objects from the aspects of viewpoint state,visual direction state and visual impressions formed in the mind,and excavates the true connotation of landscape.The research results show that there is no socalled landscape without viewpoint.The landscape is a visual perception process in which people experience the landscape of object at the appropriate viewpoint,not just the beauty of structure and form of the object.People’s impression of landscape is mainly determined by whether the object’s viewpoint fi eld can make aesthetic subject see easily,whether there is any barrier in the direction of view,whether the size is appropriate in scale,and whether the visual impression gives people a sense of peace of mind and comfort.展开更多
文摘AIM:To explore the effect of visual perception learning software training(VPT)on binocular visual function reconstruction in children with intermittent exotropia after strabismus surgery.METHODS:Ninety children with intermittent exotropia admitted to our hospital from June 2018 to December 2018 were included,and randomly divided into VPT and control groups.Children in the control group received basic binocular vision training,while those in the VPT group received VPT after strabismus surgery.Tertiary visual function,visual perception function,Newcastle Control Score(NCS),and ocular position retraction rate were compared at 3 and 12mo after the surgery.RESULTS:At 3 and 12mo after the surgery,the proportion of simultaneous perception,binocular fusion version and binocular stereo vision in the VPT group was conspicuously higher than that in the control group(P<0.05).After the vision training,the binocular visual perception functions of children in both groups were significantly improved compared with that before training(P<0.05).Interestingly,the grating sharpness,texture perception and texture motion perception in the VPT group were dramatically better than control group(P<0.01).The NCS in the VPT group was significantly lower than that in the control group(P<0.05).The ocular position retraction rate in the VPT group was significantly lower than that in the control group at 12mo(8.89%vs 26.67%,P=0.03).CONCLUSION:VPT effectively promotes binocular visual function reconstruction in intermittent exotropia children after strabismus surgery and reduces the strabismus severity and ocular position retraction rate.
基金Natural Science Foundation of Jiangsu Province&Key Laboratory Foundation,grant number is BK20180579&6142206180204 respectively.
文摘The evaluation index of camouflage patterns is important in the field of military application.It is the goal that researchers have always pursued to make the computable evaluation indicators more in line with the human visual mechanism.In order to make the evaluation method more computationally intelligent,a Multi-Feature Camouflage Fused Index(MF-CFI)is proposed based on the comparison of grayscale,color and texture features between the target and the background.In order to verify the effectiveness of the proposed index,eye movement experiments are conducted to compare the proposed index with existing indexes including Universal Image Quality Index(UIQI),Camouflage Similarity Index(CSI)and Structural Similarity(SSIM).Twenty-four different simulated targets are designed in a grassland background,28 observers participate in the experiment and record the eye movement data during the observation process.The results show that the highest Pearson correlation coefficient is observed between MF-CFI and the eye movement data,both in the designed digital camouflage patterns and largespot camouflage patterns.Since MF-CFI is more in line with the detection law of camouflage targets in human visual perception,the proposed index can be used for the comparison and parameter optimization of camouflage design algorithms.
基金This work was financially supported by the National Key Research and Development Program of China(2019YFB2203400)the“111 Project”(B20030)+3 种基金the UESTC Shared Research Facilities of Electromagnetic Wave and Matter Interaction(Y0301901290100201)the Fundamental Research Funds for the Central Universities(ZYGX2019Z018)the National Natural Science Foundation of China(61974014)the Innovation Group Project of Sichuan Province(20CXTD0090).
文摘The human visual system,dependent on retinal cells,can be regarded as a complex combination of optical system and nervous system.Artificial retinal system could mimic the sensing and processing function of human eyes.Optically stimulated synaptic devices could serve as the building blocks for artificial retinas and subsequent information transmission system to brain.Herein,photonic synaptic transistors based on polycrystalline MoS_(2),which could simulate human visual perception and brain storage,are presented.Moreover,the photodetection range from visible light to near-infrared light of MoS_(2) multilayer could extend human eyes’vision limitation to near-infrared light.Additionally,the photonic synaptic transistor shows an ultrafast speed within 5μs and ultralow power consumption under optical stimuli about 40 aJ,several orders of magnitude lower than biological synapses(50 ms and 10 fJ).Furthermore,the backgate control could act as emotional modulation of the artificial brain to enhance or suppress memory function,i.e.the intensity of photoresponse.The proposed carrier trapping/detrapping as the main working mechanism is presented for the device.In addition,synaptic functionalities including short synaptic plasticity,long synaptic plasticity and paired-pulse facilitation could be successfully simulated based on the prepared device.Furthermore,the large difference between short synaptic plasticity and long synaptic plasticity reveals the better image pre-processing function of the prepared photonic synapses.The classical Pavlovian conditioning associated with the associative learning is successfully implemented as well.Therefore,the efficient and rich functionalities demonstrate the potential of the MoS_(2) synaptic device that integrates sensing-memory-preprocessing capabilities for realizing artificial neural networks with different emotions that mimic human retina and brain.
基金Supported by the Institute of Health CarlosⅢ(No.COV20/00539)。
文摘AIM:To compare the visual perception(color and chromatic-achromatic contrast vision)of a small cohort of COVID-19 patients at the time of infection and after 6mo with that of a healthy population matched for sex and age.METHODS:A total of 25 patients(9 females,16 males,mean age:54±10y)with COVID-19 hospitalized in the COVID-19 Unit of the University Clinical Hospital of Valladolid were recruited for this preliminary study.Visual perception,as determined by monocular measurement of contrast sensitivity function(CSF)and color vision was assessed in each patient using the Optopad test.The results obtained were then compared with those of a sample of 16 age-and sex-matched healthy controls(5 females,11 males,mean age:50±6y)in which the same measurement procedure was repeated.Statistically significant differences between groups were assessed using the Mann-Whitney U test.Measurements were repeated after a minimum follow-up period of 6mo and statistically significant differences between the two time points in each group were assessed using the Wilcoxon signed rank test.RESULTS:Discrimination thresholds(color and chromatic-achromatic contrast vision)and their corresponding sensitivity,calculated as the inverse of the discrimination threshold,were evaluated.Analysis of the data revealed higher contrast threshold results(i.e.,worse contrast sensitivity)in the COVID-19 group than in the control group for all spatial frequencies studied in the Optopad-CSF achromatic test and most of the spatial frequencies studied in the Optopad-CSF chromatic test for the red-green and blue-yellow mechanisms.In addition,color threshold results in the COVID-19 group were also significantly higher(i.e.,worse color sensitivity)for almost all color mechanisms studied in the Optopad-Color test.At 6mo,most of the differences found between the groups were maintained despite COVID-19 recovery.CONCLUSION:The present results provide preliminary evidence that visual perception may be impaired in COVID-19,even when the infection has passed.Although further research is needed to determine the precise causes of this finding,analysis of CSF and color vision could provide valuable information on the visual impact of COVID-19.
基金Supported by the National Natural Science Foundation of China (No.60472036, No.60431020, No.60402036)the Natural Science Foundation of Beijing (No.4042008)and Ph.D. Foundation of Ministry of Education (No.20040005015).
文摘A new scheme named personalized image retrieval technique based on visual perception is proposed in this letter, whose motive is to narrow the semantic gap by directly perceiving user's visual information. It uses visual attention model to segment image regions and eye-tracking technique to record fixations. Visual perception is obtained by analyzing the fixations in regions to measure gaze interests. Integrating visual perception into attention model is to detect the Regions Of Interest (ROIs), whose features are extracted and analyzed, then feedback interests to optimize the results and construct user profiles.
文摘Vehicle recognition system (VRS) plays a very important role in the field of intelligent transportation systems.A novel and intuitive method is proposed for vehicle location.The method we provide for vehicle location is based on human visual perception model technique. The perception color space HSI in this algorithm is adopted.Three color components of a color image and more potential edge patterns are integrated for solving the feature extraction problem.A fast and automatic threshold technique based on human visual perception model is also developed.The vertical edge projection and horizontal edge projection are adopted for locating left-right boundary of vehicle and top-bottom boundary of vehicle, respectively. Very promising experimental results are obtained using real-time vehicle image sequences, which have confirmed that this proposed location vehicle method is efficient and reliable, and its calculation speed meets the needs of the VRS.
文摘The gestalt principles are perception and what is visually communicated by objects. These principles describe the visual language within which we work. Understanding how a designer realizes things visually will help designers communicate better. Many designers use Gestalt principles to have a structure visual stimulus, so as to create interfaces that are easy for users to understand. Because the understanding of visual perception is a critical item in any designer’s toolkit. Islamic heritage buildings also contain many architectural values, especially the Sultan Hassan Mosque in Cairo. Through this paper, we will verify the use of gestalt principles of visual perception in the building design. Because the psychological organization of the design, achieves the understanding and awareness of the current relationships between the elements, depending on the extent of regularity, accuracy, and consistency of the design elements. This makes us recognize the esthetic values of Sultan Hassan’s building and mosque. The visual delight we get from architecture begins with how we perceive it. So, in order to understand how visual delight arises, we must first understand how we perceive and interpret visual stimuli. Especially since the principles of gestalt confirm that the human brain tends to make tricks and illusions. Therefore, we found the mosque designer had exploited this fact during the design process, to exclude any possibility of the resulting misunderstanding. The designer achieved the psychological preparation of the visitor by using some principles of visual perception of the Gestalt theory, such as growth, continuity, and hierarchy.
基金Supported by the National Natural Science Foundation of China(41901405,41725005,41531177)and the National Key Research and Development Program of China(2016YFF0103501).
文摘Background Three-dimensional(3D)building models with unambiguous roof plane geometry parameters,roof structure units,and linked topology provide essential data for many applications related to human activities in urban environments.The task of 3D reconstruction from point clouds is still in the development phase,especially the recognition and interpretation of roof topological structures.Methods This study proposes a novel visual perception-based approach to automatically decompose and reconstruct building point clouds into meaningful and simple parametric structures,while the associated mutual relationships between the roof plane geometry and roof structure units are expressed by a hierarchical topology tree.First,a roof plane extraction is performed by a multi-label graph cut energy optimization framework and a roof structure graph(RSG)model is then constructed to describe the roof topological geometry with common adjacency,symmetry,and convexity rules.Moreover,a progressive roof decomposition and refinement are performed,generating a hierarchical representation of the 3D roof structure models.Finally,a visual plane fitted residual or area constraint process is adopted to generate the RSG model with different levels of details.Results Two airborne laser scanning datasets with different point densities and roof styles were tested,and the performance evaluation metrics were obtained by International Society for Photogrammetry and Remote Sensing,achieving a correctness and accuracy of 97.7%and 0.29m,respectively.Conclusions The standardized assessment results demonstrate the effectiveness and robustness of the proposed approach,showing its ability to generate a variety of structural models,even with missing data.
文摘The process of human natural scene categorization consists of two correlated stages: visual perception and visual cognition of natural scenes.Inspired by this fact,we propose a biologically plausible approach for natural scene image classification.This approach consists of one visual perception model and two visual cognition models.The visual perception model,composed of two steps,is used to extract discriminative features from natural scene images.In the first step,we mimic the oriented and bandpass properties of human primary visual cortex by a special complex wavelets transform,which can decompose a natural scene image into a series of 2D spatial structure signals.In the second step,a hybrid statistical feature extraction method is used to generate gist features from those 2D spatial structure signals.Then we design a cognitive feedback model to realize adaptive optimization for the visual perception model.At last,we build a multiple semantics based cognition model to imitate human cognitive mode in rapid natural scene categorization.Experiments on natural scene datasets show that the proposed method achieves high efficiency and accuracy for natural scene classification.
文摘The advantage of electronic devices is to eliminate the subjectivity. Based on individual capabilities, visual perception has its disadvantages. The aim of this study was to find out the possible shade difference between the upper and lower teeth, visual perception versus spectrophotometry as well as proving that females are better in visual perception of colour matching. 82 subjects (dental students) of this study determined the color of each other’s natural teeth. The color of 1640 teeth (incisors, canines and premolars), was matched by visual perception and by Vita Easyshade spectrophotometer. The matching was focused on the middle third of buccal surfaces of the examined teeth. In this study, it was found that there was a difference of color between the teeth of the upper and lower jaw. The percentage of the accordance between two methods was 28.35% for all examined teeth. The color determination at central incisors showed the highest value of accordance 40.85%. Based on Vitapan 3D Mastershade, the most frequent color was 2M2.
基金supported by the National Natural Science Foundation of China(62306185)the Guangdong Basic and Applied Basic Research Foundation,China(2024A1515012065)the Shenzhen Science and Technology Program,China(JSGGKQTD 20221101115656029 and KJZD20230923113801004).
文摘In recent years,humanoid robots have gained significant attention due to their potential to revolutionize various industries,from healthcare to manufacturing.A key factor driving this transformation is the advancement of visual perception systems,which are crucial for making humanoid robots more intelligent and autonomous.Despite the progress,the full potential of vision-based technologies in humanoid robots has yet to be fully realized.This review aims to provide a comprehensive overview of recent advancements in visual perception applied to humanoid robots,specifically focusing on applications in state estimation and environmental interaction.By summarizing key developments and analyzing the challenges and opportunities in these areas,this paper seeks to inspire future research that can unlock new capabilities for humanoid robots,enabling them to better navigate complex environments,perform intricate tasks,and interact seamlessly with humans.
基金supported by the Natural Science Foundation of Shandong Province(ZR2021YQ32)the China Postdoctoral Science Foundation(2023M740472)+2 种基金the National Natural Science Foundation of China(62175162,62205214,and 61901222)the Taishan Scholars Program of Shandong Province(tsqn201909117)the Special Fund for Science and Technology Innovation Teams of Shanxi Province and Foundation of Shenzhen Science and Technology(20200814100534001).
文摘Photonic synapses combining photosensitivity and synaptic function can efficiently perceive and memorize visual information,making them crucial for the development of artificial vision systems.However,the development of high-performance photonic synapses with low power consumption and rapid optical erasing ability remains challenging.Here,we propose a photon-modulated charging/discharging mechanism for self-powered photonic synapses.The current hysteresis enables the devices based on CsPbBr3/solvent/carbon nitride multilayer architecture to emulate synaptic behaviors,such as excitatory postsynaptic currents,paired-pulse facilitation,and long/short-term memory.Intriguingly,the unique radiation direction-dependent photocurrent endows the photonic synapses with the capability of optical writing and rapid optical erasing.Moreover,the photonic synapses exhibit exceptional performance in contrast enhancement and noise reduction owing to the notable synaptic plasticity.In simulations based on artificial neural network(ANN)algorithms,the pre-processing by our photonic synapses improves the recognition rate of handwritten digit from 11.4%(200 training epochs)to 85%(~60 training epochs).Furthermore,due to the excellent feature extraction and memory capability,an array based on the photonic synapses can imitate facial recognition of human retina without the assistance of ANN.
基金supported by the Science and Technology Innovation 2030-Brain Science and Brain-Inspired Intelligence Project (2021ZD0200200)the National Natural Science Foundation of China (62327805,82151307,and 32271085)the Beijing Natural Science Foundation (5244049).
文摘When presented with visual stimuli of face images,the ventral stream visual cortex of the human brain exhibits face-specific activity that is modulated by the physical properties of the input images.However,it is still unclear whether this activity relates to conscious face perception.We explored this issue by using the human intracranial electroencephalography technique.Our results showed that face-specific activity in the ventral stream visual cortex was significantly higher when the subjects subjectively saw faces than when they did not,even when face stimuli were presented in both conditions.In addition,the face-specific neural activity exhibited a more reliable neural response and increased posterior-anterior direction information transfer in the“seen”condition than the“unseen”condition.Furthermore,the face-specific neural activity was significantly correlated with performance.These findings support the view that face-specific activity in the ventral stream visual cortex is linked to conscious face perception.
基金This work was supported by the Scientific Research Foun-dation of Graduate School of Southeast University(No.YBJJ1601).
文摘As an essential part of the urban landscape,linear urban landscape(LUL)is the interaction between humans and nature,which is closely associated with daily life and brings multiple characteristics to visual perception.Current studies focus on complex models that describe visual perception using static viewpoints,but lossing the continuous and dynamic features of visual perception.This paper provides a general framework that can quantify dynamic visual perception based on urban morphology and improves accuracy in the descriptions of LUL linear spatial characteristics.Based on Beijing-Hangzhou Grand Canal(Hangzhou urban section),the proposed framework combines the indicators of multiple dimensions to quantify dynamic visual perception and emphasizes the continuity of LUL.To represent the dynamic visual perception and the spatial pattern characteristics of LUL,different evaluation criteria of indicators are set according to landscape scales.To minimize subjectivity and uncertainty caused by subjective cognition and fulfill the landscape pattern under different urban development policies,we set up distinct scenario preference patterns.With appropriate fine-tuning of scenario preference patterns and setting of movement types,the proposed method can be adapted to other LUL projects and aspires to provide a general methodology and scientific guidance for urban planning and landscape management.
基金supported by the National Natural Science Foundation of China(No.61772440)the Aeronautical Science Foundation of China(No.20165168007)Science and Technology of Electrooptic Control Laboratory.
文摘A collage is a composite artwork made from the spatial layout of multiple pictures on a canvas,collected from the Internet or user photographs.Collages,usually made by skilled artists,involve a complex manual process,especially when searching for component pictures and adjusting their spatial layout to meet artistic requirements.In this paper,we present a visual perception driven method for automatically synthesizing visually pleasing collages.Unlike previous works,we focus on how to design a collage layout which not only provides easy access to the theme of the overall image,but also conforms to human visual perception.To achieve this goal,we formulate the generation of collages as a mapping problem:given a canvas image,first,compute a saliency map for it and a vector field for each sub-region of it.Second,using a divide-and-conquer strategy,generate a series of patch sets from the canvas image,where the salient map and the vector field are used to determine each patch’s size and direction respectively.Third,construct a Gestalt-based energy function to choose the most visually pleasing and orderly patch set as the final layout.Finally,using a semantic-color metric,map the picture set to the patch set to generate the final collage.Extensive experimental and user study results show that this method can generate visual pleasing collages.
文摘This paper aims to investigate the changes in the virtual perception on the built heritage at the traditional core setttement of Kumbakonam Town at Tamitnadu and to analyze their implica- tions. Specifically, the major objectives of the study are (1) to identify the architectural elements that manifest the built heritage of Kumbakonam Town and (2) to assess the contMbutions of these elements to the changes in the visuat perception of the town. To achieve these objectives, this study adopts an empirical model that analyzes the architectural elements of the buildings in the study area. Direct observations and documentations of 373 buildings are collected to analyze those etements that contribute to the changes in the visual perception on the built heritage of Kumbakonam Town. An ordinary regression model is used to examine the characteristics of the built heritage across the chariot processional route of the town. Several architectural elements, including pitasters, horizontal cornices, arched windows, and ornamental parapets, improve the image of the town. These empirical findings support the policy framework that enhances the visual perception of Kumbakonam Town.
文摘With the rapid development and improvement of urban construction,the influence of green environmental exposure on mental health has attracted increasing attention,but the relationship between the psychological perception and the object index has been less explored.From the perspective of the individual and environmental perception,space optimization of cultural blocks was discussed to provide a reference for the development of other old streets.
文摘To build robots that engage in intuitive communication with people by natural language, we are developing a new knowledge representation called conceptual network model. The conceptual network connects natural language concepts with visual perception including color perception, shape perception, size perception, and spatial perception. In the implementation of spatial perception, we present a computational model based on spatial template theory to interpret qualitative spatial expressions. Based on the conceptual network model, our mobile robot can understand user's instructions and recognize the object referred to by the user and perform appropriate action. Experimental results show our approach promising.
基金supported by the National Natural Science Foundation of China(Grant No.52408006,52038007)。
文摘Visual depth(distance)perception is a fundamental aspect of environmental cognition,as it allows people to judge the spatial scale of their surroundings.However,estimating the depth of classical Chinese gardens is challenging,especially from static viewpoints that frame the scenery.Previous studies have examined how the internal components of the scenery frame affect depth perception.Still,the role of the frame and its peripheral information as environmental background have been largely overlooked.This study investigates how depth perception at viewpoints is influenced by viewing position displacement,frame geometry,and environmental context.The authors created nine stimulus materials in a cave virtual reality environment(three image treatments×three positions).Seventy-one participants were asked to evaluate depth perception using the magnitude estimation and adjustment methods.Their eye movement behavior was also recorded using an eye-movement instrument(SensoMotoric Instruments(SMI)eye-tracking glasses,120 Hz).The results showed that participants could perceive spatial depth differences between viewing positions even when the internal viewpoint displacement was small;frame shape did not significantly affect depth perception and gaze behavior;and peripheral visual information of the frame enhanced depth perception significantly.Moreover,the form of the environmental background,especially the position of the scenery window,strongly guided the participants'gaze.These findings suggest that ambient visual information significantly impacts environmental experience,which landscape designers should consider.
基金Sponsored by National Natural Science Foundation of China(51608237)Social Science Planning Project of Jiangxi Province(15YS39)Natural Science Foundation of Jiangxi Province(20161BAB216120).
文摘With the acceleration of urban renewal,people’s demand for improving the overall urban landscape quality of the city is increasing.However,it is often found in project practice that some professionals have one-sided understanding or misunderstanding of landscape,which directly affects the work effi ciency of landscape design and the landscape quality of design works.Based on the theory of visual perceptual dynamics,this paper explores the aesthetic basis of people’s viewing and evaluation of visual objects from the aspects of viewpoint state,visual direction state and visual impressions formed in the mind,and excavates the true connotation of landscape.The research results show that there is no socalled landscape without viewpoint.The landscape is a visual perception process in which people experience the landscape of object at the appropriate viewpoint,not just the beauty of structure and form of the object.People’s impression of landscape is mainly determined by whether the object’s viewpoint fi eld can make aesthetic subject see easily,whether there is any barrier in the direction of view,whether the size is appropriate in scale,and whether the visual impression gives people a sense of peace of mind and comfort.