期刊文献+
共找到504篇文章
< 1 2 26 >
每页显示 20 50 100
Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity
1
作者 Chin-Sheng Chen Kang-Yi Peng +1 位作者 Chien-Liang Huang Chun-Wei Yeh 《Journal of Signal and Information Processing》 2013年第3期114-119,共6页
This paper presents a corner-based image alignment algorithm based on the procedures of corner-based template matching and geometric parameter estimation. This algorithm consists of two stages: 1) training phase, and ... This paper presents a corner-based image alignment algorithm based on the procedures of corner-based template matching and geometric parameter estimation. This algorithm consists of two stages: 1) training phase, and 2) matching phase. In the training phase, a corner detection algorithm is used to extract the corners. These corners are then used to build the pyramid images. In the matching phase, the corners are obtained using the same corner detection algorithm. The similarity measure is then determined by the differences of gradient vector between the corners obtained in the template image and the inspection image, respectively. A parabolic function is further applied to evaluate the geometric relationship between the template and the inspection images. Results show that the corner-based template matching outperforms the original edge-based template matching in efficiency, and both of them are robust against non-liner light changes. The accuracy and precision of the corner-based image alignment are competitive to that of edge-based image alignment under the same environment. In practice, the proposed algorithm demonstrates its precision, efficiency and robustness in image alignment for real world applications. 展开更多
关键词 corner-based image alignment CORNER Detection Edge-Based TEMPLATE Matching Gradient Vector
在线阅读 下载PDF
Multi-level distribution alignment-based domain adaptation for segmentation of 3D neuronal soma images
2
作者 Li Ma Xuantai Xu Xiaoquan Yang 《Journal of Innovative Optical Health Sciences》 2025年第6期69-85,共17页
Deep learning networks are increasingly exploited in the field of neuronal soma segmentation.However,annotating dataset is also an expensive and time-consuming task.Unsupervised domain adaptation is an effective metho... Deep learning networks are increasingly exploited in the field of neuronal soma segmentation.However,annotating dataset is also an expensive and time-consuming task.Unsupervised domain adaptation is an effective method to mitigate the problem,which is able to learn an adaptive segmentation model by transferring knowledge from a rich-labeled source domain.In this paper,we propose a multi-level distribution alignment-based unsupervised domain adaptation network(MDA-Net)for segmentation of 3D neuronal soma images.Distribution alignment is performed in both feature space and output space.In the feature space,features from different scales are adaptively fused to enhance the feature extraction capability for small target somata and con-strained to be domain invariant by adversarial adaptation strategy.In the output space,local discrepancy maps that can reveal the spatial structures of somata are constructed on the predicted segmentation results.Then thedistribution alignment is performed on the local discrepancies maps across domains to obtain a superior discrepancy map in the target domain,achieving refined segmentation performance of neuronal somata.Additionally,after a period of distribution align-ment procedure,a portion of target samples with high confident pseudo-labels are selected as training data,which assist in learning a more adaptive segmentation network.We verified the superiority of the proposed algorithm by comparing several domain adaptation networks on two 3D mouse brain neuronal somata datasets and one macaque brain neuronal soma dataset. 展开更多
关键词 Unsupervised domain adaptation multi-level distribution alignment pseudo-labels 3D neuronal soma images
原文传递
Multi-domain abdomen image alignment based on multi-scale diffeomorphic jointed network
3
作者 LU Zhengwei WANG Yong +3 位作者 GUAN Qiu CHEN Yizhou LIU Dongchun XU Xinli 《Optoelectronics Letters》 EI 2022年第10期628-634,共7页
Recently, the generative adversarial network(GAN) has been extensively applied to the cross-modality conversion of medical images and has shown outstanding performance than other image conversion algorithms. Hence, we... Recently, the generative adversarial network(GAN) has been extensively applied to the cross-modality conversion of medical images and has shown outstanding performance than other image conversion algorithms. Hence, we propose a novel GAN-based multi-domain registration method named multiscale diffeomorphic jointed network of registration and synthesis(MDJRS-Net). The deviation of the generator of the GAN-based approach affects the alignment phase, so a joint training strategy is introduced to improve the performance of the generator, which feedbacks the structural loss contained in the deformation field. Meanwhile, the nature of diffeomorphism can enable the network to generate deformation fields with more anatomical properties. The average dice score(Dice) is improved by 1.95% for the computer tomography venous(CTV) to magnetic resonance imaging(MRI) registration task and by 1.92% for the CTV to computer tomography plain(CTP) task compared with the other methods. 展开更多
关键词 NETWORK alignment image
原文传递
A Two-Stage Algorithm of High Resolution Image Alignment for Mobile Applications
4
作者 Ren-You Huang Lan-Rong Dung Tang-Suan Hong 《Journal of Computer and Communications》 2016年第4期36-51,共16页
Global motion estimation (GME) algorithms are widely applied to computer vision and video processing. In the previous works, the image resolutions are usually low for the real-time requirement (e.g. video stabilizatio... Global motion estimation (GME) algorithms are widely applied to computer vision and video processing. In the previous works, the image resolutions are usually low for the real-time requirement (e.g. video stabilization). However, in some mobile devices applications (e.g. image sequence panoramic stitching), the high resolution is necessary to obtain satisfactory quality of panoramic image. However, the computational cost will become too expensive to be suitable for the low power consumption requirement of mobile device. The full search algorithm can obtain the global minimum with extremely computational cost, while the typical fast algorithms may suffer from the local minimum problem. This paper proposed a fast algorithm to deal with 2560 × 1920 high-resolution (HR) image sequences. The proposed method estimates the motion vector by a two-level coarse-to-fine scheme which only exploits sparse reference blocks (25 blocks in this paper) in each level to determine the global motion vector, thus the computational costs are significantly decreased. In order to increase the effective search range and robustness, the predictive motion vector (PMV) technique is used in this work. By the comparisons of computational complexity, the proposed algorithm costs less addition operations than the typical Three-Step Search algorithm (TSS) for estimating the global motion of the HR images without the local minimum problem. The quantitative evaluations show that our method is comparable to the full search algorithm (FSA) which is considered to be the golden baseline. 展开更多
关键词 Global Motion Estimation Block Matching High Resolution image alignment Mobile Applications
在线阅读 下载PDF
UniTrans:Unified Parameter-Efficient Transfer Learning and Multimodal Alignment for Large Multimodal Foundation Model
5
作者 Jiakang Sun Ke Chen +3 位作者 Xinyang He Xu Liu Ke Li Cheng Peng 《Computers, Materials & Continua》 2025年第4期219-238,共20页
With the advancements in parameter-efficient transfer learning techniques,it has become feasible to leverage large pre-trained language models for downstream tasks under low-cost and low-resource conditions.However,ap... With the advancements in parameter-efficient transfer learning techniques,it has become feasible to leverage large pre-trained language models for downstream tasks under low-cost and low-resource conditions.However,applying this technique to multimodal knowledge transfer introduces a significant challenge:ensuring alignment across modalities while minimizing the number of additional parameters required for downstream task adaptation.This paper introduces UniTrans,a framework aimed at facilitating efficient knowledge transfer across multiple modalities.UniTrans leverages Vector-based Cross-modal Random Matrix Adaptation to enable fine-tuning with minimal parameter overhead.To further enhance modality alignment,we introduce two key components:the Multimodal Consistency Alignment Module and the Query-Augmentation Side Network,specifically optimized for scenarios with extremely limited trainable parameters.Extensive evaluations on various cross-modal downstream tasks demonstrate that our approach surpasses state-of-the-art methods while using just 5%of their trainable parameters.Additionally,it achieves superior performance compared to fully fine-tuned models on certain benchmarks. 展开更多
关键词 Parameter-efficient transfer learning multimodal alignment image captioning image-text retrieval visual question answering
在线阅读 下载PDF
Imaging alignment of rotational state-selected CH_3I molecule
6
作者 Le-Le Song Yan-Hui Wang +9 位作者 Xiao-Chun Wang Hong-Tao Sun Lan-Hai He Si-Zuo Luo Wen-Hui Hu Dong-Xu Li Wen-Hui Zhu Ya-Nan Sun Da-Jun Ding Fu-Chun Liu 《Chinese Physics B》 SCIE EI CAS CSCD 2019年第2期167-172,共6页
We experimentally and numerically investigate CH_3I molecular alignment by using a femtosecond laser and a hexapole. The hexapole provides the single |111〉rotational state condition at 4.5-kV hexapole rod voltage. Ba... We experimentally and numerically investigate CH_3I molecular alignment by using a femtosecond laser and a hexapole. The hexapole provides the single |111〉rotational state condition at 4.5-kV hexapole rod voltage. Based on this single rotational state, an enhanced alignment degree of 0.73 is achieved. Our experimental results are in agreement with the simulation results. We experimentally obtain the ion velocity map images and show the influence of the initial rotational-state population. With the I+ion images and angular distributions at different pump-probe delay time, the alignment and anti-alignment phenomena are further demonstrated. The molecules will be under field-free conditions when the laser effect disappears completely at the full revival time. Our work shows that the quantum control and spatial control on CH_3I molecules can be realized and molecular coordinate frame can be obtained for further molecular experiment. 展开更多
关键词 hexapole state selection VELOCITY MAP imagING alignment
原文传递
Alignment-dependent ionization of molecules in near-circularly polarized intense laser fields
7
作者 Jie Liu Yong-Kang Zhang Xiao-Lei Hao 《Chinese Physics B》 2025年第5期347-354,共8页
The alignment-dependent photoelectron spectrum is a valuable tool for mapping out the electronic structure of molecular orbitals.However,this approach may not be applicable to all molecules,such as CO_(2),as the ioniz... The alignment-dependent photoelectron spectrum is a valuable tool for mapping out the electronic structure of molecular orbitals.However,this approach may not be applicable to all molecules,such as CO_(2),as the ionization process in a linearly polarized laser field involves contributions from orbitals other than the highest occupied molecular orbital(HOMO).Here,we conducted a theoretical investigation into the ionization process of N_(2) and CO_(2) in near-circularly polarized laser field using the Coulomb-corrected strong-field approximation(CCSFA)method for molecules.In particular,we introduced a generalized dressed state into the CCSFA method in order to account for the impact of the laser field on the molecular initial state.The simulated alignment-dependent photoelectron momentum distribution(PMD)of the two molecules exhibited markedly disparate behaviors,which were in excellent agreement with the previous experimental observations reported in[Phys.Rev.A 102,013117(2020)].Our findings indicate that under a near-circularly polarized laser field,the alignment-dependent PMD of molecules is primarily sourced from the HOMO,in contrast to the situation under a linearly polarized laser field.Moreover,a satisfactory correlation between the alignment-dependent angular distribution and the orbital symmetry was observed,which suggests an effective approach for molecular orbital imaging. 展开更多
关键词 alignment Coulomb-corrected strong-field approximation(CCSFA) photoelectron momentum distribution(PMD) image
原文传递
FPCNet-based change detection for remote sensing images
8
作者 LI Jiying WANG Qi SHI Hongping 《Journal of Measurement Science and Instrumentation》 2025年第3期371-383,共13页
The objective of this study is to address semantic misalignment and insufficient accuracy in edge detail and discrimination detection,which are common issues in deep learning-based change detection methods relying on ... The objective of this study is to address semantic misalignment and insufficient accuracy in edge detail and discrimination detection,which are common issues in deep learning-based change detection methods relying on encoding and decoding frameworks.In response to this,we propose a model called FlowDual-PixelClsObjectMec(FPCNet),which innovatively incorporates dual flow alignment technology in the decoding stage to rectify semantic discrepancies through streamlined feature correction fusion.Furthermore,the model employs an object-level similarity measurement coupled with pixel-level classification in the PixelClsObjectMec(PCOM)module during the final discrimination stage,significantly enhancing edge detail detection and overall accuracy.Experimental evaluations on the change detection dataset(CDD)and building CDD demonstrate superior performance,with F1 scores of 95.1%and 92.8%,respectively.Our findings indicate that the FPCNet outperforms the existing algorithms in stability,robustness,and other key metrics. 展开更多
关键词 remote sensing image change detection semantic misalignment dual flow alignment deep supervised discrimination
在线阅读 下载PDF
Advancing Sports Image Classification and Analysis:Effective Data Augmentation and Feature Alignment Strategies
9
作者 Ping Liu Chao Zhao +2 位作者 Bin Zang Sifeng Wang Shigen Shen 《Tsinghua Science and Technology》 2026年第1期577-589,共13页
Sport plays a crucial role in society,influencing physical health,entertainment,and community engagement.As artificial intelligence advances,the ability to classify sport images accurately becomes increasingly crucial... Sport plays a crucial role in society,influencing physical health,entertainment,and community engagement.As artificial intelligence advances,the ability to classify sport images accurately becomes increasingly crucial.Effective sport image classification enhances applications,such as performance analysis,athlete tracking,and fan engagement.Despite its significance,current methods face challenges due to limited labeled datasets and issues with feature misalignment.This paper introduces a novel Contrastive Language-Image Pre-training(CLIP)based framework specifically designed for sport image classification.By incorporating data augmentation techniques,the approach addresses data sparsity and enriches the diversity of image-text pairings,reducing the need for extensive manual annotation.Additionally,feature alignment strategies tackle text-image misalignment issues that affect classification accuracy.This approach fills a significant research gap and offers practical solutions to improve classification performance in sport image analysis.The results of extensive experiments validate the effectiveness of the framework,demonstrating its potential to advance sports analytics and contribute to more precise and scalable solutions in sport image classification. 展开更多
关键词 sport image classification sports analytics data augmentation feature alignment
原文传递
Novel registration algorithm for 3-D images captured from multiple views of object surface
10
作者 衡伟 《Journal of Southeast University(English Edition)》 EI CAS 2005年第4期411-413,共3页
A novel algorithm of 3-D surface image registration is proposed. It makes use of the array information of 3-D points and takes vector/vertex-like features as the basis of the matching. That array information of 3-D po... A novel algorithm of 3-D surface image registration is proposed. It makes use of the array information of 3-D points and takes vector/vertex-like features as the basis of the matching. That array information of 3-D points can be easily obtained when capturing original 3-D images. The iterative least-mean-squared (LMS) algorithm is applied to optimizing adaptively the transformation matrix parameters. These can effectively improve the registration performance and hurry up the matching process. Experimental results show that it can reach a good subjective impression on aligned 3-D images. Although the algorithm focuses primarily on the human head model, it can also be used for other objects with small modifications. 展开更多
关键词 image alignment 3-D image 3-D capture image registration iterative least-mean-squared algorithm
在线阅读 下载PDF
A survey on image and video stitching 被引量:9
11
作者 Wei LYU Zhong ZHOU +1 位作者 Lang CHEN Yi ZHOU 《Virtual Reality & Intelligent Hardware》 2019年第1期55-83,共29页
Image/video stitching is a technology for solving the field of view(FOV)limitation of images/videos.It stitches multiple overlapping images/videos to generate a wide-FOV image/video,and has been used in various fields... Image/video stitching is a technology for solving the field of view(FOV)limitation of images/videos.It stitches multiple overlapping images/videos to generate a wide-FOV image/video,and has been used in various fields such as sports broadcasting,video surveillance,street view,and entertainment.This survey reviews image/video stitching algorithms,with a particular focus on those developed in recent years.Image stitching first calculates the corresponding relationships between multiple overlapping images,deforms and aligns the matched images,and then blends the aligned images to generate a wide-FOV image.A seamless method is always adopted to eliminate such potential flaws as ghosting and blurring caused by parallax or objects moving across the overlapping regions.Video stitching is the further extension of image stitching.It usually stitches selected frames of original videos to generate a stitching template by performing image stitching algorithms,and the subsequent frames can then be stitched according to the template.Video stitching is more complicated with moving objects or violent camera movement,because these factors introduce jitter,shakiness,ghosting,and blurring.Foreground detection technique is usually combined into stitching to eliminate ghosting and blurring,while video stabilization algorithms are adopted to solve the jitter and shakiness.This paper further discusses panoramic stitching as a special-extension of image/video stitching.Panoramic stitching is currently the most widely used application in stitching.This survey reviews the latest image/video stitching methods,and introduces the fundamental principles/advantages/weaknesses of image/video stitching algorithms.Image/video stitching faces long-term challenges such as wide baseline,large parallax,and low-texture problem in the overlapping region.New technologies may present new opportunities to address these issues,such as deep learning-based semantic correspondence,and 3D image stitching.Finally,this survey discusses the challenges of image/video stitching and proposes potential solutions. 展开更多
关键词 image stitching Video stitching Panoramic stitching REGISTRATION alignment Mesh optimization Deep learning 3D stitching
在线阅读 下载PDF
Planning Margins to CTV for Image-Guided Whole Pelvis Prostate Cancer Intensity-Modulated Radiotherapy
12
作者 Zhendong Wang Kelin Wang +5 位作者 Fritz A. Lerma Bei Liu Pradip Amin Byongyong Yi Georges Hobeika Cedric Yu 《International Journal of Medical Physics, Clinical Engineering and Radiation Oncology》 2012年第2期23-31,共9页
Purpose: We investigated the margin recipes with different alignment techniques in the image-guided intensity-modulated radiotherapy (IMRT) of whole pelvis prostate cancer patients. Materials and Methods: Forty-eight ... Purpose: We investigated the margin recipes with different alignment techniques in the image-guided intensity-modulated radiotherapy (IMRT) of whole pelvis prostate cancer patients. Materials and Methods: Forty-eight computed tomography (CT) scans of eight prostate cancer patients were investigated. Each patient had an initial planning CT scan and 5 consecutive serial CT scans during the course of treatment, all of which were acquired using 3 mm slice separation and 0.94 mm resolution in the axial plane at 120 kVp, on a PQ 5000 CT scanner. Three different whole pelvis planning margin recipes, ranging from 3 to 13 mm, were investigated. A unique IMRT plan was created with each PTV on the initial CT scan, and was then registered to the 5 serial CT scans, by bony alignment or by prostate gland-based alignment. The dose computed on each serial CT scans was accumulated back to the initial CT scan using deformable image registration for final dosimetric evaluation of the interplay of the margin selection and alignment methods. Results: Bony alignment and prostate gland-based alignment gave very similar result to the pelvic lymphatic nodes (PLNs), regardless of its margin around. The prostate gland-based alignment greatly enhanced the coverage to the prostate and SV, especially with small margins. Meanwhile, the soft-tissue alignment also raised the incidental dose to the rectum and reduces the dose to the bladder. With small to intermediate margins, only soft-tissue alignment gave acceptable mean coverage to SV. Margin of 13mm or more was needed for PLNs to maintain good target coverage. Conclusion: We commend prostate-based alignment along with margins less than or equal to 5mm around prostate and SV, and margins greater than or equal to 13 mm around the vascular spaces. 展开更多
关键词 Prostate Cancer WHOLE PELVIS image Guidance IMRT Margin alignment
暂未订购
Use of Image processing software in Hip Joint surgery
13
作者 Rashmi Uddanwadiker 《Advances in Bioscience and Biotechnology》 2011年第2期68-74,共7页
The scope of this project was to investigate the possibility of application of Image Processing Technique in the field of Shaft Alignment process. Misalignment of shaft using image processing software Visionbuilder wa... The scope of this project was to investigate the possibility of application of Image Processing Technique in the field of Shaft Alignment process. Misalignment of shaft using image processing software Visionbuilder was calculated. The further purpose of this project was to check whether the image processing technique can be used in bone transplant surgery. The model of the hip was used for the experimentation purpose. Image processing software Visionbuilder was used to match the profiles of the bone before implant and bone after implant. 展开更多
关键词 image Processing SHAFT alignment HIP Joint BONE TRANSPLANT
暂未订购
Determination of Phase Transitions of <i>p</i>,<i>n</i>-Alkyloxy Benzoic Acid Mesogens Using Legendre Moments and Image Analysis
14
作者 S. Sreehari Sastry C. Nageswara Rao +2 位作者 K. Mallika S. Lakshminarayan Ha Sie Tiong 《World Journal of Condensed Matter Physics》 2013年第1期54-61,共8页
Phase transition temperatures of p,n-alkyloxy benzoic acids (nOBA, n = 3 to 10 and 12) are investigated basing on the textural image analysis of liquid crystal. The analysis is carried out by the computation of Legend... Phase transition temperatures of p,n-alkyloxy benzoic acids (nOBA, n = 3 to 10 and 12) are investigated basing on the textural image analysis of liquid crystal. The analysis is carried out by the computation of Legendre moments. Textures of the homeotropically aligned compounds are recorded as a function of temperature using POM in arthroscopic mode attached to the hot stage and high resolution camera. A recurrence formula is used to compute the liquid crystal textures based on Legendre polynomial. The discontinuities and fluctuations in the values of Legendre moments as a function of temperature are related to the phase transition temperatures of the sample. This method is successful in conforming or detecting the phase transition temperatures and the present findings are comparable with literature. 展开更多
关键词 Alkyloxybenzoic Acids HOMEOTROPIC alignment Phase Transitions Textures Legendre Moments image ANALYSIS
在线阅读 下载PDF
基于多模态语义信息的文本生成图像方法
15
作者 杨冰 周家辉 +1 位作者 姚金良 向学勤 《浙江大学学报(工学版)》 北大核心 2026年第2期360-369,共10页
针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生... 针对文本语义与图像语义不一致以及图像细节表现不足的问题,提出新的文本生成图像方法.基于多模态语义信息建立鉴别依据,在文本语义基础上引入真实图像语义,以解决文本描述信息密度低的问题,有效缓解生成图像细节缺失或失真的现象.在生成器中集成可变形卷积和星模块卷积,增强生成器表达能力,提高生成图像的细节表现和整体质量.为了验证所提方法的有效性,在CUB数据集和COCO数据集上进行模型训练及评估.与生成式对抗对比语言-图像预训练模型(GALIP)相比,所提方法在保证高效生成的同时,在细节表现、语义一致性及整体质量上具有显著优势. 展开更多
关键词 文本生成图像 多模态语义 可变形卷积 星模块卷积 语义对齐鉴别器
在线阅读 下载PDF
隐式特征图引导的文生图:三向注意力融合
16
作者 马栋林 马晓珍 赵宏 《计算机技术与发展》 2026年第2期101-108,共8页
针对多阶段生成模型因显式中间图像导致误差累积的问题,提出基于隐式特征图与三向注意力融合的双阶段解耦框架。在隐式特征图生成阶段,通过递归注意力迭代生成64×64非可视化特征图,替代AttnGAN等模型的显式中间图像,有效规避多阶... 针对多阶段生成模型因显式中间图像导致误差累积的问题,提出基于隐式特征图与三向注意力融合的双阶段解耦框架。在隐式特征图生成阶段,通过递归注意力迭代生成64×64非可视化特征图,替代AttnGAN等模型的显式中间图像,有效规避多阶段可视化误差传递;在三向注意力增强阶段,扩展Triplet Attention为通道-空间-文本交互机制,实现像素级细粒度控制。实验表明,在细粒度要求最高的CUB数据集上,CLIP分数达0.82,优于基线模型AttnGAN(0.62)、MediaPipe(0.71)等模型,较基线模型提升32.3%;IS分数提升至5.05,较基线模型提升15.8%,且模型推理速度较StackGAN++等多阶段生成方法提升20%。在NVIDIA RTX 4090(24 GB显存)环境下,单张图像生成时间降低至0.96秒,该框架有效解决了多阶段误差传递与细粒度对齐问题。 展开更多
关键词 文本到图像生成 三向注意力 隐式特征图 细粒度对齐 双阶段生成
在线阅读 下载PDF
图文跨模态检索双重过滤与动态补全的注意力区域优化方法研究
17
作者 孟凡奇 田凯迪 田研 《现代信息科技》 2026年第1期41-46,共6页
当前图文跨模态检索存在两个主要瓶颈:传统注意力机制往往包含大量冗余区域,引入无关语义噪声;过度筛选又会导致有效区域不足,造成关键视觉信息丢失。这两种情况均会显著降低模型的匹配精度与鲁棒性。针对该问题,提出一种双重优化策略:... 当前图文跨模态检索存在两个主要瓶颈:传统注意力机制往往包含大量冗余区域,引入无关语义噪声;过度筛选又会导致有效区域不足,造成关键视觉信息丢失。这两种情况均会显著降低模型的匹配精度与鲁棒性。针对该问题,提出一种双重优化策略:首先通过双重过滤机制自适应保留高响应区域,有效抑制冗余噪声;同时创新性引入Top-K动态补全方法,在检测到特征不足时自动补充关键语义区域。实验验证表明,该方法在保持特征选择精度的同时,有效避免关键信息丢失,显著提升了模型在复杂场景下的跨模态匹配性能。 展开更多
关键词 跨模态检索 图文检索 特征对齐 阈值过滤 注意力优化
在线阅读 下载PDF
机器人辅助与传统全膝关节置换临床功能及影像学的差异
18
作者 刘浩洋 李宏旭 +3 位作者 周宇 岳德波 王佰亮 马金辉 《中国组织工程研究》 北大核心 2026年第15期3936-3945,共10页
背景:随着机器人辅助全膝关节置换的兴起,它在植入物定位精度上的优势备受关注。然而,关于机器人全膝关节置换在改善患者术后关节活动度、假体翻修率、下肢力线和患者满意度方面是否优于传统全膝关节置换,现有证据尚不明确。目的:探讨M... 背景:随着机器人辅助全膝关节置换的兴起,它在植入物定位精度上的优势备受关注。然而,关于机器人全膝关节置换在改善患者术后关节活动度、假体翻修率、下肢力线和患者满意度方面是否优于传统全膝关节置换,现有证据尚不明确。目的:探讨Mako机器人辅助与传统全膝关节置换在临床功能与影像学上的差异及临床意义。方法:回顾性分析中日友好医院2023年1月至2024年9月接受Mako机器人辅助全膝关节置换的66例骨关节炎患者(机器人组);根据年龄、性别、体质量指数匹配59例传统全膝关节置换患者(传统组)。记录手术时间、出血量、止血带使用时间以及两组患者术前、术后3,6,12个月的临床结果,包括美国特种外科医院膝关节评分、西安大略和麦克马斯特大学骨关节炎指数、美国膝关节学会评分(功能)及膝关节活动度;术前及术后1周拍摄双下肢前后负重位全长X射线片,测量两组患者髋膝踝角、股骨外翻角、股骨远端外侧角、股骨近端外侧角和胫骨近端内侧角并进行对比。结果与结论:①机器人组较传统组手术时间更长,但止血带使用时间更短,失血量相当;②术后1周两组活动度均改善,传统组略胜一筹,但无显著性差异(P>0.05);③美国特种外科医院膝关节评分显示两组术后功能提升相近,无显著性差异(P>0.05);美国膝关节学会功能评分在术后1年时,机器人组优于传统组(P<0.05);西安大略和麦克马斯特大学骨关节炎指数表明两组术后疼痛和功能均改善,但机器人组在术后6个月和1年表现更优(P<0.05);④X射线片结果显示所有患者下肢力线改善,假体位置良好,随访期间未发生严重不良事件;⑤影像学测量显示,机器人组术后髋膝踝角、胫骨近端内侧角和股骨近端外侧角显著改善(P<0.05),而股骨外翻角和股骨远端外侧角无显著变化(P>0.05);传统组髋膝踝角和胫骨近端内侧角改善显著(P<0.05),但股骨外翻角、股骨近端外侧角和股骨远端外侧角无显著变化(P>0.05);两组术后影像学指标比较无显著性差异(P>0.05),术前与术后差值比较除股骨外翻角和股骨近端外侧角外,其余指标亦无显著性差异(P>0.05);⑥术后并发症发生率在两组间无显著性差异(P>0.05);⑦提示Mako机器人辅助全膝关节置换在术中角度、力线和平衡调整上优于传统全膝关节置换,提升了术中截骨、假体置入的精确性和个性化;术后下肢力线恢复、临床效果不劣于传统全膝关节置换,具有较好的应用前景。 展开更多
关键词 机器人辅助 全膝关节置换 影像学 下肢力线 活动度 膝关节功能
暂未订购
基于多模态交叉对齐网络的小样本图像语义分割
19
作者 周莹 赵国栋 《燕山大学学报》 北大核心 2026年第1期76-84,94,共10页
针对现有小样本图像语义分割方法对未知图片中目标定位不精确的问题,提出一种基于多模态交叉对齐网络的小样本图像语义分割方法。首先,利用一组共享权重的主干网络将支持图片和查询图片映射到深度特征空间,提取图片在视觉维度的编码特... 针对现有小样本图像语义分割方法对未知图片中目标定位不精确的问题,提出一种基于多模态交叉对齐网络的小样本图像语义分割方法。首先,利用一组共享权重的主干网络将支持图片和查询图片映射到深度特征空间,提取图片在视觉维度的编码特征。其次,利用预训练的CLIP文本编码器将支持图片中的目标类信息编码到文本空间中,捕获目标类对应的文本语义。再次,利用交叉注意力机制建立文本和视觉空间的特征交互,促进不同模态间的语义对齐。最后,利用临时预测的查询掩码建立反向交叉指导策略,指导原始支持图片中已知目标的掩码预测。在开源的PASCAL和COCO数据集上进行了对比实验和消融实验,实验结果验证了所设计方法在处理查询图片中未知目标时的优越性。 展开更多
关键词 图像语义分割 小样本学习 多模态交叉对齐 反向交叉指导 CLIP
在线阅读 下载PDF
基于虚拟像平面模型的RGB-T图像生成方法
20
作者 蔡鑫洋 李巍岷 +1 位作者 张瀛煜 刘昶 《仪表技术》 2026年第1期50-54,共5页
针对可见光与热红外(RGB-Thermal,RGB-T)图像的融合问题,提出一种基于虚拟像平面模型的RGB-T图像生成方法。采用带有发热元件的平面棋盘格标定板,完成可见光相机与热红外面阵传感器的联合标定;在畸变校正基础上,将可见光图像重投影至虚... 针对可见光与热红外(RGB-Thermal,RGB-T)图像的融合问题,提出一种基于虚拟像平面模型的RGB-T图像生成方法。采用带有发热元件的平面棋盘格标定板,完成可见光相机与热红外面阵传感器的联合标定;在畸变校正基础上,将可见光图像重投影至虚拟像平面,得到对应的虚拟图像;将热红外图像投影至同一虚拟像平面,实现可见光与热红外图像的像素级对齐,从而构建目标的四通道RGB-T图像。实验结果表明,所提方法能够有效融合目标的颜色信息与温度特征,有助于提升热源目标的检测性能。 展开更多
关键词 图像融合 虚拟像平面模型 RGB-T图像 像素级对齐 热目标检测
原文传递
上一页 1 2 26 下一页 到第
使用帮助 返回顶部