摘要
提出一种字幕区域分割算法.首先对图像做小波变换和重构,并抽取字幕区域特征,再分块计算统计特征;然后对子块进行K均值聚类,实现字幕区域分割.与已有算法相比,该算法简单,不需要设置阈值.实验结果表明,即使在复杂背景下,对于字体、大小和位置都不确定的字幕,该算法仍具有良好的分割效果.
An algorithm is proposed in the paper to segment caption region. By the algorithm, firstly, the caption features are extracted by wavelet transformation and reconstruction on the image, and secondly, statistic features are calculated block by block. At last blocks are classified by K-mean clustering method. In comparison with other algorithms, the algorithm is simpler and no requirement for setting any threshold. Experimental results show that the proposed algorithm performs well, even for captions with unknown font, size, and position.
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2006年第10期1508-1512,共5页
Journal of Computer-Aided Design & Computer Graphics
基金
江苏省自然科学基金(BK2004137)
关键词
字幕分割
小波变换
K均值聚类
caption extraction
wavelet transform
K-mean clustering