期刊文献+
共找到74,006篇文章
< 1 2 250 >
每页显示 20 50 100
一种HEVC帧内预测模式决策的快速算法
1
作者 王爽 刘家良 +1 位作者 张海坤 胡越黎 《上海大学学报(自然科学版)》 北大核心 2025年第3期561-570,共10页
高效视频编码(high efficiency video coding,HEVC)相较于上一代编码标准H.264降低了约50%的比特率,但为了提高帧内预测的准确性,HEVC提出的35种预测模式导致计算量大幅增加,对软件和硬件实现均构成了挑战.针对该问题,在HEVC的基础上提... 高效视频编码(high efficiency video coding,HEVC)相较于上一代编码标准H.264降低了约50%的比特率,但为了提高帧内预测的准确性,HEVC提出的35种预测模式导致计算量大幅增加,对软件和硬件实现均构成了挑战.针对该问题,在HEVC的基础上提出了一种依据图片纹理方向,结合预测模式之间的关联性来确定帧内预测模式的快速算法.实验结果表明,本算法与HEVC参考软件HM16.20相比,在BD-Rate损失仅为5.79%的情况下,节省46%以上的编码时间,显著降低了帧内预测模式决策的复杂度,便于在嵌入式系统等硬件资源有限的端侧实现算法落地. 展开更多
关键词 高效视频编码 帧内预测 角度模式 预测模式决策
在线阅读 下载PDF
基于H.265/HEVC的快速帧内编码研究 被引量:1
2
作者 马振华 贾华宇 罗飚 《现代电子技术》 北大核心 2025年第8期51-55,共5页
随着视频应用和新兴业务的快速发展,对视频编码速度和质量的要求也不断提高。为了降低H.265/HEVC的帧内编码复杂度,提出一种基于最有可能模式(MPM)的模糊搜索算法,通过减少搜索候选模式的数量来降低计算复杂度;同时提出一种简化编码单... 随着视频应用和新兴业务的快速发展,对视频编码速度和质量的要求也不断提高。为了降低H.265/HEVC的帧内编码复杂度,提出一种基于最有可能模式(MPM)的模糊搜索算法,通过减少搜索候选模式的数量来降低计算复杂度;同时提出一种简化编码单元划分过程的方法,利用相邻编码单元率失真代价计算的阈值,提前终止编码单元划分,避免了传统算法的遍历划分,提高了编码效率。实验结果表明,所提算法与HEVC传统模型比较,能够平均降低39.38%的编码时间,而码率只增加了1.62%,峰值信噪比差值仅降低0.085 dB。在保证视频质量的前提下,所提算法大幅降低了编码的复杂度。 展开更多
关键词 hevc 视频编码 帧内编码 最有可能模式 编码单元划分 率失真优化
在线阅读 下载PDF
基于决策树的HEVC到VVC转码算法
3
作者 许世扬 肖广 滕国伟 《工业控制计算机》 2025年第1期66-67,128,共3页
H.266/VVC优秀的编码效率促使人们将H.265/HEVC编码的视频内容转码到H.266/VVC标准。然而H.266/VVC新引入的四叉树嵌套多类型树(QTMT)编码单元(CU)划分技术显著增加了VVC编码可能的划分模式,并导致HEVC到VVC的转码的高复杂度。因此提出... H.266/VVC优秀的编码效率促使人们将H.265/HEVC编码的视频内容转码到H.266/VVC标准。然而H.266/VVC新引入的四叉树嵌套多类型树(QTMT)编码单元(CU)划分技术显著增加了VVC编码可能的划分模式,并导致HEVC到VVC的转码的高复杂度。因此提出了基于深度重用和决策树的HEVC到VVC转码算法。首先通过HEVC的划分深度缩小了VVC编码单元划分的深度范围,随后使用决策树判断剩余深度上CU分裂的可能性,并对不太可能的划分深度执行简化测试。实验结果显示,和原版的转码算法相比,提出的算法平均降低了45.53%的转码复杂度,而BDBR仅增加了0.80%。 展开更多
关键词 H.265/hevc H.266/VVC 转码
在线阅读 下载PDF
高效HEVC编码器的硬件架构设计
4
作者 黄晖 施隆照 黄霖 《中国集成电路》 2025年第3期35-42,共8页
高效视频编码标准(High Efficiency Video Coding,HEVC)作为H.264/AVC的继任者,提高了约2倍的编码效率。但其编码数据的计算复杂度和依赖性的增加,使视频编码器在硬件实现上更加困难。尤其是对编码器视频数据的处理和存取以及编码器内... 高效视频编码标准(High Efficiency Video Coding,HEVC)作为H.264/AVC的继任者,提高了约2倍的编码效率。但其编码数据的计算复杂度和依赖性的增加,使视频编码器在硬件实现上更加困难。尤其是对编码器视频数据的处理和存取以及编码器内部状态控制的实现带来挑战。本文基于HEVC的宏块编码流程,提出了一种满足整体编码器实时高效运行的视频数据的存取结构和协调编码器各模块的顶层控制的方案。整个设计基于VCS和VIVADO的联合仿真环境验证功能的正确性。并在Xilinx公司的VCU118型号的FPGA上完成上板验证。测试结果表明,综合后的编码器的主频为100 MHz,可以满足编码器实现1080P30@fps的编码需求。 展开更多
关键词 视频编码 hevc DDR FPGA
在线阅读 下载PDF
Separate Source Channel Coding Is Still What You Need:An LLM-Based Rethinking 被引量:3
5
作者 REN Tianqi LI Rongpeng +5 位作者 ZHAO Mingmin CHEN Xianfu LIU Guangyi YANG Yang ZHAO Zhifeng ZHANG Honggang 《ZTE Communications》 2025年第1期30-44,共15页
Along with the proliferating research interest in semantic communication(Sem Com),joint source channel coding(JSCC)has dominated the attention due to the widely assumed existence in efficiently delivering information ... Along with the proliferating research interest in semantic communication(Sem Com),joint source channel coding(JSCC)has dominated the attention due to the widely assumed existence in efficiently delivering information semantics.Nevertheless,this paper challenges the conventional JSCC paradigm and advocates for adopting separate source channel coding(SSCC)to enjoy a more underlying degree of freedom for optimization.We demonstrate that SSCC,after leveraging the strengths of the Large Language Model(LLM)for source coding and Error Correction Code Transformer(ECCT)complemented for channel coding,offers superior performance over JSCC.Our proposed framework also effectively highlights the compatibility challenges between Sem Com approaches and digital communication systems,particularly concerning the resource costs associated with the transmission of high-precision floating point numbers.Through comprehensive evaluations,we establish that assisted by LLM-based compression and ECCT-enhanced error correction,SSCC remains a viable and effective solution for modern communication systems.In other words,separate source channel coding is still what we need. 展开更多
关键词 separate source channel coding(SSCC) joint source channel coding(JSCC) end-to-end communication system Large Language Model(LLM) lossless text compression Error Correction Code Transformer(ECCT)
在线阅读 下载PDF
基于能耗优化的无人战车HEVC解码感知算法
6
作者 李正雄 李明月 +2 位作者 王孟 杨振宇 刘辉 《火力与指挥控制》 北大核心 2025年第1期138-145,共8页
为了优化无人战车在解码压缩视频时产生的能耗,提出一种改变视频码流的HEVC解码复杂度感知算法,以降低解码复杂度,并限制编码效率的影响。确定解码复杂性,视频质量和比特率的比例,再通过解码复杂度感知算法,生成解码复杂度-码率-失真模... 为了优化无人战车在解码压缩视频时产生的能耗,提出一种改变视频码流的HEVC解码复杂度感知算法,以降低解码复杂度,并限制编码效率的影响。确定解码复杂性,视频质量和比特率的比例,再通过解码复杂度感知算法,生成解码复杂度-码率-失真模型优化的HEVC比特流。与HM16.0和OpenHEVC解码器实现相比,该算法产生的比特流在相同视频质量下分别降低了29.19%和13.24%的解码复杂度,编码效率影响最小。相比于HM16.0编码,该算法能耗减少了15%。 展开更多
关键词 无人战车 解码能耗 能耗优化 复杂度-码率-失真模型 hevc
在线阅读 下载PDF
一种HEVC帧内预测模式提前判决装置
7
作者 池承利 《中国集成电路》 2025年第10期40-45,共6页
目前市面上HEVC(High Efficiency Video Coding)实时编码器要求能够实现在500 MHz时钟的情况下,完成4k 30 fps(Frames Per Second)及以下图像的实时编码。由于HEVC帧内预测模式有35种,并且预测单元PU(Prediction Unit)分为4×4、8&#... 目前市面上HEVC(High Efficiency Video Coding)实时编码器要求能够实现在500 MHz时钟的情况下,完成4k 30 fps(Frames Per Second)及以下图像的实时编码。由于HEVC帧内预测模式有35种,并且预测单元PU(Prediction Unit)分为4×4、8×8、16×16、32×32、64×64这么多层,对于实时编码是一个很大的挑战,因此需要进行帧内预测模式的初步选择,减少RDO(Rate distortion optimization)中帧内模式的数量,降低硬件开销和满足实时性。本文提供一种HEVC帧内预测模式提前判决装置PRE_INTRA(Previous Intra Prediction),使用原始数据替代重构数据,从帧内35种预测模式中,使用SAD(Sum of Absolute Differences)算法的方式选择出亮度6种模式,色度一种模式,供RDO判决模块进行选择。实验结果表明:提出的算法与HM已有快速算法相比,PSNR(Peak signal-to-noise ratio)平均下降0.02 dB,输出码率平均增加0.22%,但是可以满足HEVC实施编码器性能要求。 展开更多
关键词 hevc 帧内预测 提前判决 模式选择 模式判决 预测角度 实时编码
在线阅读 下载PDF
Tunable reflective spin-decoupled encoding metasurface based on Dirac semimetals
8
作者 HAO Xiao-yu ZHENG Si-yu +6 位作者 WANG Yu LIU Yang LIU Meng ZHANG Yu-ping ZHANG Jin-juan ZHAN Yi ZHANG Hui-yun 《中国光学(中英文)》 北大核心 2025年第4期968-978,共11页
Multiple functional metasurfaces with high information capacity have attracted considerable attention from researchers.This study proposes a 2-bit tunable spin-decoupled coded metasurface designed for the terahertz ba... Multiple functional metasurfaces with high information capacity have attracted considerable attention from researchers.This study proposes a 2-bit tunable spin-decoupled coded metasurface designed for the terahertz band,which utilizes the tunable properties of Dirac semimetals(DSM)to create a novel multilayer structure.By incorporating both geometric and propagating phases into the metasurface design,we can effectively control the electromagnetic wave.When the Fermi level(EF)of the DSM is set at 6 meV,the electromagnetic wave is manipulated by the gold patch embedded in the DSM film,operating at a frequency of 1.3 THz.When the EF of the DSM is set at 80 meV,the electromagnetic wave is manipulated by the DSM patch,operating at a frequency of 1.4 THz.Both modes enable independent control of beam splitting under left-rotating circularly polarized(LCP)and rightrotating circularly polarized(RCP)wave excitation,resulting in the generation of vortex beams with distinct orbital angular momentum(OAM)modes.The findings of this study hold significant potential for enhancing information capacity and polarization multiplexing techniques in wireless communications. 展开更多
关键词 coding metasurface dirac semimetal spin decoupling circular polarization TUNABLE
在线阅读 下载PDF
基于PUPM的HEVC视频隐写发展进程
9
作者 于子超 于丽芳 《北京印刷学院学报》 2025年第3期22-29,共8页
隐写术是信息安全领域的一个热门研究方向。由于视频媒体的广泛使用,视频隐写术受到了研究领域的广泛关注。在视频隐写术中,HEVC编码视频中的基于预测单元划分模式(Prediction Unit Partition Mode,简称PUPM)的视频隐写术以其更高的视... 隐写术是信息安全领域的一个热门研究方向。由于视频媒体的广泛使用,视频隐写术受到了研究领域的广泛关注。在视频隐写术中,HEVC编码视频中的基于预测单元划分模式(Prediction Unit Partition Mode,简称PUPM)的视频隐写术以其更高的视觉质量成为研究人员关注的热点之一。本文主要研究了基于PUPM的视频隐写术。首先,讨论了基于PUPM的隐写术的基本原理和评价标准。其次,根据不同的技术特点,将基于PUPM域的隐写分为三类:传统的PUPM隐写、基于编码的PUPM隐写和基于最小化嵌入失真框架的自适应PUPM隐写。说明了上述代表性方法的优缺点。最后,提出了基于多因素的失真函数设计、基于深度学习的PUPM隐写以及将基于PUPM的隐写从实验室应用到现实世界等三个未来的研究方向。 展开更多
关键词 PUPM hevc视频隐写
在线阅读 下载PDF
基于HEVC技术的直播电视信号优化策略研究
10
作者 穆娜娜 《电视技术》 2025年第3期72-74,共3页
随着高清和超高清直播需求的增长,传统编码技术在压缩效率和传输质量方面逐渐显现出局限性。针对这一问题,提出基于高效视频编码(High Efficiency Video Coding,HEVC)技术的优化策略,以期改善直播电视信号的压缩效率、传输延迟及画质清... 随着高清和超高清直播需求的增长,传统编码技术在压缩效率和传输质量方面逐渐显现出局限性。针对这一问题,提出基于高效视频编码(High Efficiency Video Coding,HEVC)技术的优化策略,以期改善直播电视信号的压缩效率、传输延迟及画质清晰度。实验结果表明,HEVC技术结合快速用户数据报协议互联网连接(Quick User Datagram Protocol Internet Connections,QUIC)协议和硬件加速技术,可显著提高高分辨率场景下的信号稳定性和实时性。 展开更多
关键词 直播电视信号 高效视频编码(hevc) 快速用户数据报协议互联网连接(QUIC) 硬件加速技术
在线阅读 下载PDF
Research on deep learning decoding method for polar codes in ACO-OFDM spatial optical communication system
11
作者 LIU Kangrui LI Ming +2 位作者 CHEN Sizhe QU Jiashun ZHOU Ming’ou 《Optoelectronics Letters》 2025年第7期427-433,共7页
Aiming at the problem that the bit error rate(BER)of asymmetrically clipped optical orthogonal frequency division multiplexing(ACO-OFDM)space optical communication system is significantly affected by different turbule... Aiming at the problem that the bit error rate(BER)of asymmetrically clipped optical orthogonal frequency division multiplexing(ACO-OFDM)space optical communication system is significantly affected by different turbulence intensities,the deep learning technique is proposed to the polarization code decoding in ACO-OFDM space optical communication system.Moreover,this system realizes the polarization code decoding and signal demodulation without frequency conduction with superior performance and robustness compared with the performance of traditional decoder.Simulations under different turbulence intensities as well as different mapping orders show that the convolutional neural network(CNN)decoder trained under weak-medium-strong turbulence atmospheric channels achieves a performance improvement of about 10^(2)compared to the conventional decoder at 4-quadrature amplitude modulation(4QAM),and the BERs for both 16QAM and 64QAM are in between those of the conventional decoder. 展开更多
关键词 frequency conduction polar codes deep learning signal demodulation deep learning technique DEcoding ACO OFDM polarization code decoding
原文传递
Quasi-Orthogonal Space-Time Coding for Backscatter Communications with Multiple-Antenna Tags
12
作者 Cao Shuiling Wang Gongpu +2 位作者 Gao Jie Kuang Lei Chintha Tellambura 《China Communications》 2025年第7期186-194,共9页
Existing orthogonal space-time block coding(OSTBC)schemes for backscatter communication systems cannot achieve a full transmission code rate when the tag is equipped with more than two antennas.In this paper,we propos... Existing orthogonal space-time block coding(OSTBC)schemes for backscatter communication systems cannot achieve a full transmission code rate when the tag is equipped with more than two antennas.In this paper,we propose a quasi-orthogonal spacetime block code(QOSTBC)that can achieve a full transmission code rate for backscatter communication systems with a four-antenna tag and then extend the scheme to support tags with 2i antennas.Specifically,we first present the system model for the backscatter system.Next,we propose the QOSTBC scheme to encode the tag signals.Then,we provide the corresponding maximum likelihood detection algorithms to recover the tag signals.Finally,simulation results are provided to demonstrate that our proposed QOSTBC scheme and the detection algorithm can achieve a better transmission code rate or symbol error rate performance for backscatter communication systems compared with benchmark schemes. 展开更多
关键词 backscatter communications channel coding Internet of Things(IoT) multiple antennas quasi-orthogonal space-time block code
在线阅读 下载PDF
Semantic Secure Communication Based on the Joint Source-Channel Coding
13
作者 Yifeng Lin Yuer Yang +2 位作者 Jianxiang Xie Tong Ji Peiya Li 《Computers, Materials & Continua》 2025年第8期2865-2882,共18页
Semantic secure communication is an emerging field that combines the principles of source-channel coding with the need for secure data transmission.It is of great significance in modern communications to protect the c... Semantic secure communication is an emerging field that combines the principles of source-channel coding with the need for secure data transmission.It is of great significance in modern communications to protect the confidentiality and privacy of sensitive information and prevent information leaks and malicious attacks.This paper presents a novel approach to semantic secure communication through the utilization of joint source-channel coding,which is based on the design of an automated joint source-channel coding algorithm and an encryption and decryption algorithm based on semantic security.The traditional and state-of-the-art joint source-channel coding algorithms are selected as two baselines for different comparison purposes.Experimental results demonstrate that our proposed algorithm outperforms the first baseline algorithm,the traditional source-channel coding,by 61.21%in efficiency under identical channel conditions(SNR=15 dB).In security,our proposed method can resist 2 more types of attacks compared to the two baselines,exhibiting nearly no increases in time consumption and error rate compared to the state-of-the-art joint source-channel coding algorithm while the secure semantic communication is supported. 展开更多
关键词 Secure semantic communication joint source-channel coding(JSCC) automaticed joint source-channel coding algorithm
在线阅读 下载PDF
Broadband polarization-independent terahertz multifunctional liquid crystal coding metasurface based on topological optimization
14
作者 Yu Chen Wu-Hao Cao +4 位作者 Jia-Qi Li Ming-Zhe Zhang Xin-Yi Du Ding-Shan Gao Pei-Li Li 《Chinese Physics B》 2025年第4期432-440,共9页
A broadband polarization-independent terahertz multifunctional coding metasurface based on topological optimization using liquid crystal(LC)is proposed.The metasurface can achieve reconfigurability for beam steering a... A broadband polarization-independent terahertz multifunctional coding metasurface based on topological optimization using liquid crystal(LC)is proposed.The metasurface can achieve reconfigurability for beam steering and vortex beam generation within a frequency range of 0.68 THz–0.72 THz.Firstly,the metasurface unit is topologically optimized using the non-dominant sequencing genetic algorithms(NSGA-II)multi-objective optimization algorithm.By applying the LC’s electrically tunable refractive index properties,the metasurface unit enables polarization-independent 2-bit coding within a frequency range of 0.68 THz–0.72 THz.Then,based on the designed metasurface unit,the array arrangement of the metasurface is reverse-designed to achieve beam steering and vortex beam generation.The results show that,for beam steering,not only can polarization-independent steering of both single-and multi-beam be achieved within the 35°elevation angle range,but also independent control of the target angle of each beam in the multi-beam steering.For vortex beam generation,the metasurfaces can achieve the generation of single-and multi-vortex beams with topological charges l=±1,±2 within the 35elevation angle range,and the generation angles of each vortex beam in the multi-vortex beam can be independently controlled.This provides flexibility and diversity in the generation of vortex beams.Therefore,the proposed terahertz LC metasurface can realize flexible control of reconfigurable functions and has certain application prospects in terahertz communication,phased array radar,and vortex radar. 展开更多
关键词 coding metasurfaces polarization-independent TERAHERTZ topology optimization
原文传递
Deep learning-enabled inverse design of polarization-selective structural color based on coding metasurface
15
作者 Haolin Yang Bo Ni +2 位作者 Junhong Guo Hua Zhou Jianhua Chang 《Chinese Physics B》 2025年第5期311-318,共8页
Structural colors based on metasurfaces have very promising applications in areas such as optical image encryption and color printing.Herein,we propose a deep learning-enabled reverse design of polarization-selective ... Structural colors based on metasurfaces have very promising applications in areas such as optical image encryption and color printing.Herein,we propose a deep learning-enabled reverse design of polarization-selective structural color based on coding metasurface.In this study,the long short-term memory(LSTM)neural network is presented to enable the forward and inverse mapping between coding metasurface structure and corresponding color.The results show that the method can achieve 98%accuracy for the forward prediction of color and 93%accuracy for the inverse design of the structure.Moreover,a cascaded architecture is adopted to train the inverse neural network model,which can solve the nonuniqueness problem of the polarization-selective color reverse design.This study provides a new path for the application and development of structural colors. 展开更多
关键词 deep learning inverse design coding metasurface structural color polarization-selective
原文传递
Energy focusing of flexural waves via algorithmically optimized coding metasurface lenses
16
作者 Zi-Rui Wang Di-Chao Chen +1 位作者 Rui Hong Da-Jian Wu 《Chinese Physics B》 2025年第9期277-282,共6页
Efficient elastic wave focusing is crucial in materials and physical engineering.Elastic coding metasurfaces,which are innovative planar artificial structures,show great potential for use in the field of wave focusing... Efficient elastic wave focusing is crucial in materials and physical engineering.Elastic coding metasurfaces,which are innovative planar artificial structures,show great potential for use in the field of wave focusing.However,elastic coding lenses(ECLs)still suffer from low focusing performance,thickness comparable to wavelength,and frequency sensitivity.Here,we consider both the structural and material properties of the coding unit,thus realizing further compression of the thickness of the ECL.We chose the simplest ECL,which consists of only two encoding units.The coding unit 0 is a straight structure constructed using a carbon fiber reinforced composite material,and the coding unit 1 is a zigzag structure constructed using an aluminum material,and the thickness of the ECL constructed using them is only 1/8 of the wavelength.Based on the theoretical design,the arrangement of coding units is further optimized using genetic algorithms,which significantly improves the focusing performance of the lens at different focus and frequencies.This study provides a more effective way to control vibration and noise in advanced structures. 展开更多
关键词 coding metasurface elastic wave focusing genetic algorithm
原文传递
Review for wireless communication technology based on digital encoding metasurfaces
17
作者 Haojie Zhan Manna Gu +10 位作者 Ying Tian Huizhen Feng Mingmin Zhu Haomiao Zhou Yongxing Jin Ying Tang Chenxia Li Bo Fang Zhi Hong Xufeng Jing Le Wang 《Opto-Electronic Advances》 2025年第7期51-106,共56页
Metasurfaces offer exceptional capabilities for controlling electromagnetic waves,enabling the realization of unique electromagnetic properties.As communication technology continues to evolve,metasurfaces present prom... Metasurfaces offer exceptional capabilities for controlling electromagnetic waves,enabling the realization of unique electromagnetic properties.As communication technology continues to evolve,metasurfaces present promising applications in wireless communications.This paper reviews the latest advancements in metasurface research within the communication sector,explores metasurface-based wireless relay technologies,and summarizes various wireless communication methods employing different types of metasurfaces across diverse modulation schemes.This paper provides a detailed discussion on the design of wireless communication systems based on coding metasurfaces to simplify transmitter architecture,as well as the development of intelligent coding metasurfaces in the communication field.It also elaborates on the application of vector vortex light fields in metasurface communication.Finally,it offers a forward-looking perspective on wireless communication systems that incorporate coded metasurfaces.This review aims to furnish researchers with a thorough understanding of the current state and future directions of coded metasurface applications in communications. 展开更多
关键词 coding metasurface RIS wireless communications signal modulation TRANSMITTER vortex light
在线阅读 下载PDF
Text-and-Timbre-Based Speech Semantic Coding for Ultra-Low-Bitrate Communications
18
作者 Yang Xiaoniu Qian Liping +2 位作者 Lyu Sikai Wang Qian Wang Wei 《China Communications》 2025年第1期7-24,共18页
To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design... To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design a speech semantic coded communication system,referred to as Deep-STS(i.e.,Deep-learning based Speech To Speech),for the lowbandwidth speech communication.Specifically,we first deeply compress the speech data through extracting the textual information from the speech based on the conformer encoder and connectionist temporal classification decoder at the transmitter side of Deep-STS system.In order to facilitate the final speech timbre recovery,we also extract the short-term timbre feature of speech signals only for the starting 2s duration by the long short-term memory network.Then,the Reed-Solomon coding and hybrid automatic repeat request protocol are applied to improve the reliability of transmitting the extracted text and timbre feature over the wireless channel.Third,we reconstruct the speech signal by the mel spectrogram prediction network and vocoder,when the extracted text is received along with the timbre feature at the receiver of Deep-STS system.Finally,we develop the demo system based on the USRP and GNU radio for the performance evaluation of Deep-STS.Numerical results show that the ac-Received:Jan.17,2024 Revised:Jun.12,2024 Editor:Niu Kai curacy of text extraction approaches 95%,and the mel cepstral distortion between the recovered speech signal and the original one in the spectrum domain is less than 10.Furthermore,the experimental results show that the proposed Deep-STS system can reduce the total delay of speech communication by 85%on average compared to the G.723 coding at the transmission rate of 5.4 kbps.More importantly,the coding rate of the proposed Deep-STS system is extremely low,only 0.2 kbps for continuous speech communication.It is worth noting that the Deep-STS with lower coding rate can support the low-zero-power speech communication,unveiling a new era in ultra-efficient coded communications. 展开更多
关键词 low coding rate semantic communication speech recognition speech synthesis
在线阅读 下载PDF
Deep Learning Based Progressive Joint Source-Channel Coding for Wireless Image Transmission
19
作者 Yuan Hongjie Xu Weizhang +2 位作者 Wei Lingzhou Yu Xingle Yin Hang 《China Communications》 2025年第5期189-203,共15页
Deep learning-based Joint Source-Channel Coding(JSCC)is a crucial component in semantic communication,and recent research has made significant progress in adapting to different channels.In this paper,we propose a mult... Deep learning-based Joint Source-Channel Coding(JSCC)is a crucial component in semantic communication,and recent research has made significant progress in adapting to different channels.In this paper,we propose a multi-stage progressive technique called Deep learning based Progressive Joint Source-Channel Coding(DP-JSCC).This approach partitions the source into multiple stages and transmits the signals continuously.The receiver gradually enhances the quality of image reconstruction by progressively receiving the signals,offering greater flexibility compared to existing dynamic rate transmission methods.The model adopts a lightweight architectural design,where we introduce an efficient module called the Inverted Shuffle Attention Bottleneck(ISAB)and incorporate self-attention mechanisms in the encoding and decoding process to capture signal correlations and establish long-range dependencies.Additionally,we introduce the Progressive Focus Weight Allocation(PFWA)method to improve the image reconstruction capability in progressive transmission tasks.These design enhance the expressive capacity of the model.Simulation results demonstrate that DP-JSCC can flexibly adjust the transmission rate according to requirements without the need for retraining or deployment,enabling continuous optimization of signals at different rates.Furthermore,compared to stateof-the-art JSCC methods,DP-JSCC exhibits advantages in terms of computational complexity,parameter count,and reconstruction performance. 展开更多
关键词 BROADCASTING joint source-channel coding progressive refinement wireless image transmission
在线阅读 下载PDF
Facial Video Semantic Coding for Semantic Communication
20
作者 Du Qiyuan Duan Yiping Tao Xiaoming 《China Communications》 2025年第6期83-100,共18页
Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semant... Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semantics of video for transmission,is a key aspect in the framework of multimedia semantic communication.In this paper,we propose a facial video semantic coding method with low bitrate based on the temporal continuity of video semantics.At the sender’s end,we selectively transmit facial keypoints and deformation information,allocating distinct bitrates to different keypoints across frames.Compressive techniques involving sampling and quantization are employed to reduce the bitrate while retaining facial key semantic information.At the receiver’s end,a GAN-based generative network is utilized for reconstruction,effectively mitigating block artifacts and buffering problems present in traditional codec algorithms under low bitrates.The performance of the proposed approach is validated on multiple datasets,such as VoxCeleb and TalkingHead-1kH,employing metrics such as LPIPS,DISTS,and AKD for assessment.Experimental results demonstrate significant advantages over traditional codec methods,achieving up to approximately 10-fold bitrate reduction in prolonged,stable head pose scenarios across diverse conversational video settings. 展开更多
关键词 facial video semantic coding semantic communications talking head video compression
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部