目的随着电影内容的复杂化与多样化,电影场景分割成为理解影片结构和支持多媒体应用的重要任务。为提升镜头特征提取和特征关联的有效性,增强镜头序列的上下文感知能力,提出一种混合架构电影场景分割方法(hybrid architecture scene seg...目的随着电影内容的复杂化与多样化,电影场景分割成为理解影片结构和支持多媒体应用的重要任务。为提升镜头特征提取和特征关联的有效性,增强镜头序列的上下文感知能力,提出一种混合架构电影场景分割方法(hybrid architecture scene segmentation network,HASSNet)。方法首先,采用预训练结合微调策略,在大量无场景标签的电影数据上进行无监督预训练,使模型学习有效的镜头特征表示和关联特性,然后在有场景标签的数据上进行微调训练,进一步提升模型性能;其次,模型架构上混合了状态空间模型和自注意力机制模型,分别设计Shot Mamba镜头特征提取模块和Scene Transformer特征关联模块,Shot Mamba通过对镜头图像分块建模提取有效特征表示,Scene Transformer则通过注意力机制对不同镜头特征进行关联建模;最后,采用3种无监督损失函数进行预训练,提升模型在镜头特征提取和关联上的性能,并使用Focal Loss损失函数进行微调,以改善由于类别不平衡导致的精度不足问题。结果实验结果表明,HASSNet在3个数据集上显著提升了场景分割的精度,在典型电影场景分割数据集MovieNet中,与先进的场景分割方法相比,AP(average precision)、mIoU(mean intersection over union)、AUC-ROC(area under the receiver operating characteristic curve)和F1分别提升1.66%、10.54%、0.21%和16.83%,验证了本文提出的HASSNet方法可以有效提升场景边界定位的准确性。结论本文提出的HASSNet方法有效结合了预训练与微调策略,借助混合状态空间模型和自注意力机制模型的特点,增强了镜头的上下文感知能力,使电影场景分割的结果更加准确。展开更多
Microwave radiometer SSM/I data and scatterometer QuikSCAT data have been widely used for the icesheet near-surface snowmelt detection based on their sensitivity to liquid water present in snow. In order to improve th...Microwave radiometer SSM/I data and scatterometer QuikSCAT data have been widely used for the icesheet near-surface snowmelt detection based on their sensitivity to liquid water present in snow. In order to improve the Antarctic ice-sheet near-surface snowmelt detection accuracy, a new Antarctic icesheet near-surface snowmelt synergistic detection method was proposed based on the principle of complementary advantages of SSM/I data(high reliability) and QuikSCAT data(high sensitivity) by the use of edge detection model to automatically extract the edge information to get the distribution of Antarctic snowmelt onset date, snowmelt duration and snowmelt end date. The verification result shows that the proposed snowmelt synergistic detection method improves the detection accuracy from about 75% to 86% based on AWS(Automatic Weather Stations) Butler Island and Larsen Ice Shelf. The algorithm can also be applied to other regions, which provides methodological support and supplement for the global snowmelt detection.展开更多
The wearisome enthusiasm for making reasonable structures for programming reuse has gone before for whatever period of time that item has existed. Accepting particular intense structures are made for ensuring an abnor...The wearisome enthusiasm for making reasonable structures for programming reuse has gone before for whatever period of time that item has existed. Accepting particular intense structures are made for ensuring an abnormal state ofrensability from one suspect to the accompanying. The unavoidable slant for dares to ask for extensive alterations, paying little mind to being proposed for most noteworthy reusability, remains strong affirmation of this reality. Programming reusability makes examination of stable examination; arrange, likewise, plan outlines a range of tremendous interest. By extrapolating the unfaltering thoughts that use programming consistent quality show and the Knowledge Maps, we attempt to comprehend programming plans that do not require unrestrained modifications, changes or, then again augment. Such cases works kind of a structure, to the most recent inquiries could be incorporated rely on simply the development of the circumstance to which that is associated. Suitability of such methodology must be displayed in the paper.展开更多
文摘目的随着电影内容的复杂化与多样化,电影场景分割成为理解影片结构和支持多媒体应用的重要任务。为提升镜头特征提取和特征关联的有效性,增强镜头序列的上下文感知能力,提出一种混合架构电影场景分割方法(hybrid architecture scene segmentation network,HASSNet)。方法首先,采用预训练结合微调策略,在大量无场景标签的电影数据上进行无监督预训练,使模型学习有效的镜头特征表示和关联特性,然后在有场景标签的数据上进行微调训练,进一步提升模型性能;其次,模型架构上混合了状态空间模型和自注意力机制模型,分别设计Shot Mamba镜头特征提取模块和Scene Transformer特征关联模块,Shot Mamba通过对镜头图像分块建模提取有效特征表示,Scene Transformer则通过注意力机制对不同镜头特征进行关联建模;最后,采用3种无监督损失函数进行预训练,提升模型在镜头特征提取和关联上的性能,并使用Focal Loss损失函数进行微调,以改善由于类别不平衡导致的精度不足问题。结果实验结果表明,HASSNet在3个数据集上显著提升了场景分割的精度,在典型电影场景分割数据集MovieNet中,与先进的场景分割方法相比,AP(average precision)、mIoU(mean intersection over union)、AUC-ROC(area under the receiver operating characteristic curve)和F1分别提升1.66%、10.54%、0.21%和16.83%,验证了本文提出的HASSNet方法可以有效提升场景边界定位的准确性。结论本文提出的HASSNet方法有效结合了预训练与微调策略,借助混合状态空间模型和自注意力机制模型的特点,增强了镜头的上下文感知能力,使电影场景分割的结果更加准确。
基金supported by National Natural Science Foundation of China(Grant No. 41606209)supported by National Key Research and Development Program of China (Grant No. 2016YFB0501501)+3 种基金supported by Fujian Provincial Key Laboratory of Photonics Technology, Key Laboratory of Optoelectronic Science and Technology for Medicine of Ministry of Education, Fujian Normal University, China(Grant No. JYG1707)supported by Polar Science Strategic Research Foundation of China (Grant No. 20150312)supported by the Fundamental Research Funds for the Henan Provincial Colleges and Universities (Grant No. 2015QNJH16)supported by Science and technology project of Zhengzhou Science and Technology Bureau(Grant No. 20150251)
文摘Microwave radiometer SSM/I data and scatterometer QuikSCAT data have been widely used for the icesheet near-surface snowmelt detection based on their sensitivity to liquid water present in snow. In order to improve the Antarctic ice-sheet near-surface snowmelt detection accuracy, a new Antarctic icesheet near-surface snowmelt synergistic detection method was proposed based on the principle of complementary advantages of SSM/I data(high reliability) and QuikSCAT data(high sensitivity) by the use of edge detection model to automatically extract the edge information to get the distribution of Antarctic snowmelt onset date, snowmelt duration and snowmelt end date. The verification result shows that the proposed snowmelt synergistic detection method improves the detection accuracy from about 75% to 86% based on AWS(Automatic Weather Stations) Butler Island and Larsen Ice Shelf. The algorithm can also be applied to other regions, which provides methodological support and supplement for the global snowmelt detection.
文摘The wearisome enthusiasm for making reasonable structures for programming reuse has gone before for whatever period of time that item has existed. Accepting particular intense structures are made for ensuring an abnormal state ofrensability from one suspect to the accompanying. The unavoidable slant for dares to ask for extensive alterations, paying little mind to being proposed for most noteworthy reusability, remains strong affirmation of this reality. Programming reusability makes examination of stable examination; arrange, likewise, plan outlines a range of tremendous interest. By extrapolating the unfaltering thoughts that use programming consistent quality show and the Knowledge Maps, we attempt to comprehend programming plans that do not require unrestrained modifications, changes or, then again augment. Such cases works kind of a structure, to the most recent inquiries could be incorporated rely on simply the development of the circumstance to which that is associated. Suitability of such methodology must be displayed in the paper.