Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglio...Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglion cells and their axons,leading to axonal transport dysfuntion,subsequently causing secondary damage to anterior or posterior ends of the visual system.Accordingly,recent evidence indicates that glaucoma is a degenerative disease of the central nervous system that causes damage throughout the visual pathway.However,the effects of glaucoma on synaptic plasticity in the primary visual cortex remain unclear.In this study,we established a mouse model of unilateral chronic ocular hypertension by injecting magnetic microbeads into the anterior chamber of one eye.We found that,after 4 weeks of chronic ocular hypertension,the neuronal somas were smaller in the superior colliculus and lateral geniculate body regions of the brain contralateral to the affected eye.This was accompanied by glial cell activation and increased expression of inflammatory factors.After 8 weeks of ocular hypertension,we observed a reduction in the number of excitatory and inhibitory synapses,dendritic spines,and activation of glial cells in the primary visual cortex contralateral to the affected eye.These findings suggest that glaucoma not only directly damages the retina but also induces alterations in synapses and dendritic spines in the primary visual cortex,providing new insights into the pathogenesis of glaucoma.展开更多
AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS...AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS:This postoperative cross-sectional study enrolled patients who underwent phacoemulsification combined with IOL implantation.Patients were divided into two groups:the bilateral ZXR00 group(ZXR00-only group)and the mixed IOL group(ZXR00+ZMB00 group).Primary outcome measures included uncorrected and corrected distance visual acuity(UDVA,CDVA),uncorrected and distance-corrected near visual acuity(UNVA,DCNVA),uncorrected and distance-corrected intermediate visual acuity(UIVA,DCIVA),and defocus curves.Secondary outcome measures were visual quality,spectacle independence,patient satisfaction,photic phenomena,and stereopsis.RESULTS:A total of 47 patients(94 eyes)were included,with 26 patients(11 males,15 females)in the ZXR00-only group(mean age:62.73±7.24y)and 21 patients(7 males,14 females)in the mixed group(mean age:65.71±9.16y).There was no statistically significant difference in age between the two groups(P=0.218).The mixed group showed significantly better binocular DCNVA compared to the ZXR00-only group(P=0.002).Defocus curve analysis revealed that the mixed group exhibited superior performance at−2.5 to−4.0 D but inferior performance at−0.5 and−1.5 D.Near stereoacuity was significantly poorer in the mixed group(Randot:5.589±0.744 vs 6.240±0.394 ln arcsec;Contour:4.966±0.973 vs 5.740±0.833 ln arcsec;both P<0.01).Both groups achieved high levels of spectacle independence and patient satisfaction,with no significant differences in photic phenomena or questionnaire scores.CONCLUSION:Mixed implantation of EDOF and bifocal IOLs improve near visual acuity but may compromise near stereopsis.This approach provides a viable option for patients prioritizing near vision;however,caution is recommended for individuals requiring fine stereoscopic vision for daily or professional tasks.展开更多
AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational stu...AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational study included patients with OGIs treated between June 2023 and June 2024.Data on demographics,injury features,and clinical findings were extracted from medical records.Poor visual outcome was defined as final best-corrected visual acuity(BCVA)worse than 20/400 or no light perception.Multivariable logistic regression was performed to identify independent risk factors.RESULTS:Among 509 patients(636 eyes),the mean age was 35.13y(range 20–51y),and 67.6%were male.After treatment,the proportion of eyes achieving≥20/40 increased from 12.6%to 42.1%,while no light perception decreased from 29.1%to 9.4%.Independent predictors of poor visual outcomes included delayed admission[>4h,odds ratio(OR)=3.33,95%confidence intervals(CI):1.76–6.33,P<0.001],Zone III injury(OR=5.90,95%CI:2.85–12.24,P<0.001),wound length>10 mm(OR=2.59,95%CI:1.60–4.18,P<0.001),relative afferent pupillary defect(RAPD,OR=1.65,95%CI:1.03–2.64,P=0.039),endophthalmitis(OR=1.75,95%CI:1.01–3.03,P=0.047),retinal detachment(OR=3.32,95%CI:2.02–5.45,P<0.001),and eyelid lacerations(OR=1.94,95%CI:1.13–3.33,P=0.016)associated with OGIs.Vitreous hemorrhage(OR=0.44,95%CI:0.22–0.89,P=0.023)was associated with better outcomes,and female gender appeared protective.CONCLUSION:Poor visual outcomes remain common after OGIs,despite improve visual acuity in many cases.Several clinical and injury-related factors are strongly associated with prognosis.Early recognition of these predictors can support risk stratification and improve trauma care in similar settings.展开更多
The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 ...The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 affected globally,older adults also experience significant psychological impact including depression,anxiety,and cognitive impairment.The implications of vision-related challenges extend far beyond mere sight.Depression and anxiety,exacerbated by social isolation and reduced physical activity,underscore the need for comprehensive interventions that address both medical and psychosocial dimensions.By recognizing the profound impact of ocular morbidities like strabismus,myopia,glaucoma,and age-related macular degeneration on mental health and investing in effective treatments and inclusive practices,society can pave the way for a healthier,more equitable future for affected individuals.There is evidence that myopic children experience a higher prevalence of depressive symptoms compared to their normal peers,and interventions like the correction of strabismus can enhance psychological outcome-demonstrating the value of an integrated management approach.展开更多
AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospecti...AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospective cohort study enrolled 29 patients diagnosed with OA-DLBCL based on histopathological biopsy between 2006 and 2023.Patients were stratified into two subgroups:primary OA-DLBCL(no prior history of lymphoma)and secondary OA-DLBCL(history of DLBCL at non-ocular adnexal sites).OS was defined as the time interval from OA-DLBCL diagnosis to death from any cause.Survival analysis was performed using the Kaplan–Meier method,and prognostic factors affecting OS were identified using multivariate Cox proportional hazards regression with a stepwise selection approach.RESULTS:The cohort included 24 patients with primary OA-DLBCL(13 males,11 females;mean age:61.36±18.29y)and 5 patients with secondary OA-DLBCL(2 males,3 females;mean age:50.94±18.17y).Among the primary OA-DLBCL subgroup,12 patients(50%)presented with advanced disease(Ann Arbor stage IIIE–IV),and 16 patients(66%)were classified as T4 disease according to the tumor-node-metastasis(TNM)staging system.The mean final visual acuity was 1.72±1.10 in the primary group and 0.90±1.18 in the secondary group.The 5-year OS rate for the entire cohort was 27.7%.Multivariate analysis identified five factors significantly associated with poor survival outcomes:epiphora[adjusted hazard ratio(aHR),36.95],atherosclerotic cardiovascular disease(aHR,10.08),human immunodeficiency virus(HIV)infection(aHR,12.47),M1 stage(aHR,6.99),and secondary OA-DLBCL(aHR,6.03;all P<0.05).The median OS was 1.68y for primary OA-DLBCL and 1.12y for secondary OA-DLBCL.CONCLUSION:A substantial proportion of patients with primary OA-DLBCL present with advanced-stage disease at diagnosis.Epiphora,atherosclerotic cardiovascular disease,HIV infection,M1 stage,and secondary OA-DLBCL are independent prognostic factors for poor survival outcomes.These findings emphasize the urgent need for optimized therapeutic strategies and early screening protocols to improve the management of OA-DLBCL,particularly in developing countries.展开更多
Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural...Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.展开更多
Siamese tracking algorithms usually take convolutional neural networks(CNNs)as feature extractors owing to their capability of extracting deep discriminative features.However,the convolution kernels in CNNs have limit...Siamese tracking algorithms usually take convolutional neural networks(CNNs)as feature extractors owing to their capability of extracting deep discriminative features.However,the convolution kernels in CNNs have limited receptive fields,making it difficult to capture global feature dependencies which is important for object detection,especially when the target undergoes large-scale variations or movement.In view of this,we develop a novel network called effective convolution mixed Transformer Siamese network(SiamCMT)for visual tracking,which integrates CNN-based and Transformer-based architectures to capture both local information and long-range dependencies.Specifically,we design a Transformer-based module named lightweight multi-head attention(LWMHA)which can be flexibly embedded into stage-wise CNNs and improve the network’s representation ability.Additionally,we introduce a stage-wise feature aggregation mechanism which integrates features learned from multiple stages.By leveraging both location and semantic information,this mechanism helps the SiamCMT to better locate and find the target.Moreover,to distinguish the contribution of different channels,a channel-wise attention mechanism is introduced to enhance the important channels and suppress the others.Extensive experiments on seven challenging benchmarks,i.e.,OTB2015,UAV123,GOT10K,LaSOT,DTB70,UAVTrack112_L,and VOT2018,demonstrate the effectiveness of the proposed algorithm.Specially,the proposed method outperforms the baseline by 3.5%and 3.1%in terms of precision and success rates with a real-time speed of 59.77 FPS on UAV123.展开更多
With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhan...With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhancing public safety.However,traditional methods typically process images and text separately,applying upstream models directly to downstream tasks.This approach significantly increases the complexity ofmodel training and computational costs.Furthermore,the common class imbalance in existing training datasets limitsmodel performance improvement.To address these challenges,we propose an innovative framework named Person Re-ID Network Based on Visual Prompt Technology andMulti-Instance Negative Pooling(VPM-Net).First,we incorporate the Contrastive Language-Image Pre-training(CLIP)pre-trained model to accurately map visual and textual features into a unified embedding space,effectively mitigating inconsistencies in data distribution and the training process.To enhancemodel adaptability and generalization,we introduce an efficient and task-specific Visual Prompt Tuning(VPT)technique,which improves the model’s relevance to specific tasks.Additionally,we design two key modules:the Knowledge-Aware Network(KAN)and theMulti-Instance Negative Pooling(MINP)module.The KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic modeling.MINP module handles samples,effectively improving the model’s ability to distinguish fine-grained features.The experimental outcomes across diverse datasets underscore the remarkable performance of VPM-Net.These results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.展开更多
Visual Place Recognition(VPR)technology aims to use visual information to judge the location of agents,which plays an irreplaceable role in tasks such as loop closure detection and relocation.It is well known that pre...Visual Place Recognition(VPR)technology aims to use visual information to judge the location of agents,which plays an irreplaceable role in tasks such as loop closure detection and relocation.It is well known that previous VPR algorithms emphasize the extraction and integration of general image features,while ignoring the mining of salient features that play a key role in the discrimination of VPR tasks.To this end,this paper proposes a Domain-invariant Information Extraction and Optimization Network(DIEONet)for VPR.The core of the algorithm is a newly designed Domain-invariant Information Mining Module(DIMM)and a Multi-sample Joint Triplet Loss(MJT Loss).Specifically,DIMM incorporates the interdependence between different spatial regions of the feature map in the cascaded convolutional unit group,which enhances the model’s attention to the domain-invariant static object class.MJT Loss introduces the“joint processing of multiple samples”mechanism into the original triplet loss,and adds a new distance constraint term for“positive and negative”samples,so that the model can avoid falling into local optimum during training.We demonstrate the effectiveness of our algorithm by conducting extensive experiments on several authoritative benchmarks.In particular,the proposed method achieves the best performance on the TokyoTM dataset with a Recall@1 metric of 92.89%.展开更多
Complex network modeling characterizes system relationships and structures,while network visualization enables intuitive analysis and interpretation of these patterns.However,existing network visualization tools exhib...Complex network modeling characterizes system relationships and structures,while network visualization enables intuitive analysis and interpretation of these patterns.However,existing network visualization tools exhibit significant limitations in representing attributes of complex networks at various scales,particularly failing to provide advanced visual representations of specific nodes and edges,community affiliation attribution,and global scalability.These limitations substantially impede the intuitive analysis and interpretation of complex network patterns through visual representation.To address these limitations,we propose SFFSlib,a multi-scale network visualization framework incorporating novel methods to highlight attribute representation in diverse network scenarios and optimize structural feature visualization.Notably,we have enhanced the visualization of pivotal details at different scales across diverse network scenarios.The visualization algorithms proposed within SFFSlib were applied to real-world datasets and benchmarked against conventional layout algorithms.The experimental results reveal that SFFSlib significantly enhances the clarity of visualizations across different scales,offering a practical solution for the advancement of network attribute representation and the overall enhancement of visualization quality.展开更多
基金supported by the National Natural Science Foundation of China,No.82271115(to MY).
文摘Synaptic plasticity is essential for maintaining neuronal function in the central nervous system and serves as a critical indicator of the effects of neurodegenerative disease.Glaucoma directly impairs retinal ganglion cells and their axons,leading to axonal transport dysfuntion,subsequently causing secondary damage to anterior or posterior ends of the visual system.Accordingly,recent evidence indicates that glaucoma is a degenerative disease of the central nervous system that causes damage throughout the visual pathway.However,the effects of glaucoma on synaptic plasticity in the primary visual cortex remain unclear.In this study,we established a mouse model of unilateral chronic ocular hypertension by injecting magnetic microbeads into the anterior chamber of one eye.We found that,after 4 weeks of chronic ocular hypertension,the neuronal somas were smaller in the superior colliculus and lateral geniculate body regions of the brain contralateral to the affected eye.This was accompanied by glial cell activation and increased expression of inflammatory factors.After 8 weeks of ocular hypertension,we observed a reduction in the number of excitatory and inhibitory synapses,dendritic spines,and activation of glial cells in the primary visual cortex contralateral to the affected eye.These findings suggest that glaucoma not only directly damages the retina but also induces alterations in synapses and dendritic spines in the primary visual cortex,providing new insights into the pathogenesis of glaucoma.
文摘AIM:To compare the visual outcomes between bilateral implantation of Tecnis ZXR00 extended depth-of-focus(EDOF)intraocular lenses(IOLs)and mixed implantation of Tecnis ZXR00(EDOF)with Tecnis ZMB00(bifocal)IOLs.METHODS:This postoperative cross-sectional study enrolled patients who underwent phacoemulsification combined with IOL implantation.Patients were divided into two groups:the bilateral ZXR00 group(ZXR00-only group)and the mixed IOL group(ZXR00+ZMB00 group).Primary outcome measures included uncorrected and corrected distance visual acuity(UDVA,CDVA),uncorrected and distance-corrected near visual acuity(UNVA,DCNVA),uncorrected and distance-corrected intermediate visual acuity(UIVA,DCIVA),and defocus curves.Secondary outcome measures were visual quality,spectacle independence,patient satisfaction,photic phenomena,and stereopsis.RESULTS:A total of 47 patients(94 eyes)were included,with 26 patients(11 males,15 females)in the ZXR00-only group(mean age:62.73±7.24y)and 21 patients(7 males,14 females)in the mixed group(mean age:65.71±9.16y).There was no statistically significant difference in age between the two groups(P=0.218).The mixed group showed significantly better binocular DCNVA compared to the ZXR00-only group(P=0.002).Defocus curve analysis revealed that the mixed group exhibited superior performance at−2.5 to−4.0 D but inferior performance at−0.5 and−1.5 D.Near stereoacuity was significantly poorer in the mixed group(Randot:5.589±0.744 vs 6.240±0.394 ln arcsec;Contour:4.966±0.973 vs 5.740±0.833 ln arcsec;both P<0.01).Both groups achieved high levels of spectacle independence and patient satisfaction,with no significant differences in photic phenomena or questionnaire scores.CONCLUSION:Mixed implantation of EDOF and bifocal IOLs improve near visual acuity but may compromise near stereopsis.This approach provides a viable option for patients prioritizing near vision;however,caution is recommended for individuals requiring fine stereoscopic vision for daily or professional tasks.
文摘AIM:To evaluate the clinical characteristics and risk factors associated with visual prognosis in patients with open globe injuries(OGIs)treated at Vietnam National Eye Hospital.METHODS:A prospective observational study included patients with OGIs treated between June 2023 and June 2024.Data on demographics,injury features,and clinical findings were extracted from medical records.Poor visual outcome was defined as final best-corrected visual acuity(BCVA)worse than 20/400 or no light perception.Multivariable logistic regression was performed to identify independent risk factors.RESULTS:Among 509 patients(636 eyes),the mean age was 35.13y(range 20–51y),and 67.6%were male.After treatment,the proportion of eyes achieving≥20/40 increased from 12.6%to 42.1%,while no light perception decreased from 29.1%to 9.4%.Independent predictors of poor visual outcomes included delayed admission[>4h,odds ratio(OR)=3.33,95%confidence intervals(CI):1.76–6.33,P<0.001],Zone III injury(OR=5.90,95%CI:2.85–12.24,P<0.001),wound length>10 mm(OR=2.59,95%CI:1.60–4.18,P<0.001),relative afferent pupillary defect(RAPD,OR=1.65,95%CI:1.03–2.64,P=0.039),endophthalmitis(OR=1.75,95%CI:1.01–3.03,P=0.047),retinal detachment(OR=3.32,95%CI:2.02–5.45,P<0.001),and eyelid lacerations(OR=1.94,95%CI:1.13–3.33,P=0.016)associated with OGIs.Vitreous hemorrhage(OR=0.44,95%CI:0.22–0.89,P=0.023)was associated with better outcomes,and female gender appeared protective.CONCLUSION:Poor visual outcomes remain common after OGIs,despite improve visual acuity in many cases.Several clinical and injury-related factors are strongly associated with prognosis.Early recognition of these predictors can support risk stratification and improve trauma care in similar settings.
文摘The intersection of visual impairment and mental health has profound effects on quality of life and warrants attention from healthcare providers,educators,and policymakers.With 20 million children under the age of 14 affected globally,older adults also experience significant psychological impact including depression,anxiety,and cognitive impairment.The implications of vision-related challenges extend far beyond mere sight.Depression and anxiety,exacerbated by social isolation and reduced physical activity,underscore the need for comprehensive interventions that address both medical and psychosocial dimensions.By recognizing the profound impact of ocular morbidities like strabismus,myopia,glaucoma,and age-related macular degeneration on mental health and investing in effective treatments and inclusive practices,society can pave the way for a healthier,more equitable future for affected individuals.There is evidence that myopic children experience a higher prevalence of depressive symptoms compared to their normal peers,and interventions like the correction of strabismus can enhance psychological outcome-demonstrating the value of an integrated management approach.
基金Supported by the Faculty of Medicine,Prince of Songkla University.Wainipitapong S has received grants from the Faculty of Medicine,Prince of Songkla University。
文摘AIM:To investigate the clinical characteristics and treatment outcomes,including visual function and overall survival(OS)of patients with ocular adnexal diffuse large B-cell lymphoma(OA-DLBCL).METHODS:This retrospective cohort study enrolled 29 patients diagnosed with OA-DLBCL based on histopathological biopsy between 2006 and 2023.Patients were stratified into two subgroups:primary OA-DLBCL(no prior history of lymphoma)and secondary OA-DLBCL(history of DLBCL at non-ocular adnexal sites).OS was defined as the time interval from OA-DLBCL diagnosis to death from any cause.Survival analysis was performed using the Kaplan–Meier method,and prognostic factors affecting OS were identified using multivariate Cox proportional hazards regression with a stepwise selection approach.RESULTS:The cohort included 24 patients with primary OA-DLBCL(13 males,11 females;mean age:61.36±18.29y)and 5 patients with secondary OA-DLBCL(2 males,3 females;mean age:50.94±18.17y).Among the primary OA-DLBCL subgroup,12 patients(50%)presented with advanced disease(Ann Arbor stage IIIE–IV),and 16 patients(66%)were classified as T4 disease according to the tumor-node-metastasis(TNM)staging system.The mean final visual acuity was 1.72±1.10 in the primary group and 0.90±1.18 in the secondary group.The 5-year OS rate for the entire cohort was 27.7%.Multivariate analysis identified five factors significantly associated with poor survival outcomes:epiphora[adjusted hazard ratio(aHR),36.95],atherosclerotic cardiovascular disease(aHR,10.08),human immunodeficiency virus(HIV)infection(aHR,12.47),M1 stage(aHR,6.99),and secondary OA-DLBCL(aHR,6.03;all P<0.05).The median OS was 1.68y for primary OA-DLBCL and 1.12y for secondary OA-DLBCL.CONCLUSION:A substantial proportion of patients with primary OA-DLBCL present with advanced-stage disease at diagnosis.Epiphora,atherosclerotic cardiovascular disease,HIV infection,M1 stage,and secondary OA-DLBCL are independent prognostic factors for poor survival outcomes.These findings emphasize the urgent need for optimized therapeutic strategies and early screening protocols to improve the management of OA-DLBCL,particularly in developing countries.
文摘Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.
基金supported by the National Natural Science Foundation of China(Grant No.62033007)the Major Fundamental Research Program of Shandong Province(Grant No.ZR2023ZD37).
文摘Siamese tracking algorithms usually take convolutional neural networks(CNNs)as feature extractors owing to their capability of extracting deep discriminative features.However,the convolution kernels in CNNs have limited receptive fields,making it difficult to capture global feature dependencies which is important for object detection,especially when the target undergoes large-scale variations or movement.In view of this,we develop a novel network called effective convolution mixed Transformer Siamese network(SiamCMT)for visual tracking,which integrates CNN-based and Transformer-based architectures to capture both local information and long-range dependencies.Specifically,we design a Transformer-based module named lightweight multi-head attention(LWMHA)which can be flexibly embedded into stage-wise CNNs and improve the network’s representation ability.Additionally,we introduce a stage-wise feature aggregation mechanism which integrates features learned from multiple stages.By leveraging both location and semantic information,this mechanism helps the SiamCMT to better locate and find the target.Moreover,to distinguish the contribution of different channels,a channel-wise attention mechanism is introduced to enhance the important channels and suppress the others.Extensive experiments on seven challenging benchmarks,i.e.,OTB2015,UAV123,GOT10K,LaSOT,DTB70,UAVTrack112_L,and VOT2018,demonstrate the effectiveness of the proposed algorithm.Specially,the proposed method outperforms the baseline by 3.5%and 3.1%in terms of precision and success rates with a real-time speed of 59.77 FPS on UAV123.
基金funded by the Key Research and Development Program of Hubei Province,China(Grant No.2023BEB024)the Young and Middle-aged Scientific and Technological Innova-tion Team Plan in Higher Education Institutions inHubei Province,China(GrantNo.T2023007)the key projects ofHubei Provincial Department of Education(No.D20161403).
文摘With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance systems.This technology plays a critical role in enhancing public safety.However,traditional methods typically process images and text separately,applying upstream models directly to downstream tasks.This approach significantly increases the complexity ofmodel training and computational costs.Furthermore,the common class imbalance in existing training datasets limitsmodel performance improvement.To address these challenges,we propose an innovative framework named Person Re-ID Network Based on Visual Prompt Technology andMulti-Instance Negative Pooling(VPM-Net).First,we incorporate the Contrastive Language-Image Pre-training(CLIP)pre-trained model to accurately map visual and textual features into a unified embedding space,effectively mitigating inconsistencies in data distribution and the training process.To enhancemodel adaptability and generalization,we introduce an efficient and task-specific Visual Prompt Tuning(VPT)technique,which improves the model’s relevance to specific tasks.Additionally,we design two key modules:the Knowledge-Aware Network(KAN)and theMulti-Instance Negative Pooling(MINP)module.The KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic modeling.MINP module handles samples,effectively improving the model’s ability to distinguish fine-grained features.The experimental outcomes across diverse datasets underscore the remarkable performance of VPM-Net.These results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.
基金supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region under grant number 2022D01B186.
文摘Visual Place Recognition(VPR)technology aims to use visual information to judge the location of agents,which plays an irreplaceable role in tasks such as loop closure detection and relocation.It is well known that previous VPR algorithms emphasize the extraction and integration of general image features,while ignoring the mining of salient features that play a key role in the discrimination of VPR tasks.To this end,this paper proposes a Domain-invariant Information Extraction and Optimization Network(DIEONet)for VPR.The core of the algorithm is a newly designed Domain-invariant Information Mining Module(DIMM)and a Multi-sample Joint Triplet Loss(MJT Loss).Specifically,DIMM incorporates the interdependence between different spatial regions of the feature map in the cascaded convolutional unit group,which enhances the model’s attention to the domain-invariant static object class.MJT Loss introduces the“joint processing of multiple samples”mechanism into the original triplet loss,and adds a new distance constraint term for“positive and negative”samples,so that the model can avoid falling into local optimum during training.We demonstrate the effectiveness of our algorithm by conducting extensive experiments on several authoritative benchmarks.In particular,the proposed method achieves the best performance on the TokyoTM dataset with a Recall@1 metric of 92.89%.
基金supported by the National Natural Science Foundation of China(Grant Nos.61773091 and 62476045)the LiaoNing Revitalization Talents Program(Grant No.XLYC1807106)the Program for the Outstanding Innovative Teams of Higher Learning Institutions of Liaoning(Grant No.LR2016070).
文摘Complex network modeling characterizes system relationships and structures,while network visualization enables intuitive analysis and interpretation of these patterns.However,existing network visualization tools exhibit significant limitations in representing attributes of complex networks at various scales,particularly failing to provide advanced visual representations of specific nodes and edges,community affiliation attribution,and global scalability.These limitations substantially impede the intuitive analysis and interpretation of complex network patterns through visual representation.To address these limitations,we propose SFFSlib,a multi-scale network visualization framework incorporating novel methods to highlight attribute representation in diverse network scenarios and optimize structural feature visualization.Notably,we have enhanced the visualization of pivotal details at different scales across diverse network scenarios.The visualization algorithms proposed within SFFSlib were applied to real-world datasets and benchmarked against conventional layout algorithms.The experimental results reveal that SFFSlib significantly enhances the clarity of visualizations across different scales,offering a practical solution for the advancement of network attribute representation and the overall enhancement of visualization quality.