期刊文献+
共找到2,076篇文章
< 1 2 104 >
每页显示 20 50 100
Human Perception of Group Synchronization Error in Remote Learning: Dependencies of Voice and Video Contents in One-Way Communication
1
作者 Hay Mar Mo Mo Lwin Yutaka Ishibashi Khin Than Mya 《International Journal of Communications, Network and System Sciences》 2022年第3期31-42,共12页
This paper examines dependencies of voice and video contents on human perception of group (or inter-destination) synchronization error in remote learning by Quality of Experience (QoE) assessment. In our assessment, w... This paper examines dependencies of voice and video contents on human perception of group (or inter-destination) synchronization error in remote learning by Quality of Experience (QoE) assessment. In our assessment, we use two videos and three voices (two voices for one video and one voice for the other video). We also investigate influences of silence periods in the voices and temporal relations between the voices and videos (called the tightly-coupled and loosely-coupled contents here). The voices are spoken by a teacher according to the videos. Each subject as a student assesses the group synchronization quality by watching each lecture video and the corresponding explanation voice, and then the subject answers whether he/she perceives the group synchronization error or not. As a result, assessment results illustrate that silence periods mitigate the perception rate of the error, and we can also find that we can more easily perceive the error for tightly-coupled contents than loosely-coupled ones. 展开更多
关键词 Remote Learning VOICE video Group synchronization Error Human Perception QoE Assessment
在线阅读 下载PDF
VB环境下Audio/Video压缩数据流播放技术的应用
2
作者 顾善发 张中元 《青岛建筑工程学院学报》 2001年第3期56-59,共4页
介绍了在 Windwos操作系统中 ,利用 VB自身条件和原有控件 ,灵活调用 Windows下的动态链接库开发
关键词 MPEG audio/video数据流 动态链接库
在线阅读 下载PDF
Real-time Audio &Video Transmission System Based on Visible Light Communication 被引量:3
3
作者 Yingjie He Liwei Ding +1 位作者 Yuxian Gong Yongjin Wang 《Optics and Photonics Journal》 2013年第2期153-157,共5页
With the increasing popularity of solid sate lighting devices, Visible Light Communication (VLC) is globally recognized as an advanced and promising technology to realize short-range, high speed as well as large capac... With the increasing popularity of solid sate lighting devices, Visible Light Communication (VLC) is globally recognized as an advanced and promising technology to realize short-range, high speed as well as large capacity wireless data transmission. In this paper, we propose a prototype of real-time audio and video broadcast system using inexpensive commercially available light emitting diode (LED) lamps. Experimental results show that real-time high quality audio and video with the maximum distance of 3 m can be achieved through proper layout of LED sources and improvement of concentration effects. Lighting model within room environment is designed and simulated which indicates close relationship between layout of light sources and distribution of illuminance. 展开更多
关键词 VISIBLE LIGHT Communications LED REAL-TIME video and audio BROADCAST System LIGHT Source Arrangement ILLUMINANCE Distribution
暂未订购
Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video 被引量:1
4
作者 Liu Hua-yong, Zhou Dong-ru School of Computer,Wuhan University,Wuhan 430072, Hubei, China 《Wuhan University Journal of Natural Sciences》 CAS 2003年第04A期1070-1074,共5页
Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The p... Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames, and integrates them with silence clips detection results, as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames, when the boundaries between news stories are detected, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust. 展开更多
关键词 news video story segmentation audio-visual features analysis text detection
在线阅读 下载PDF
Content-Based Hierarchical Analysis of News Video Using Audio and Visual Information
5
作者 Yu Jun-qing Zhou Dong-ru +1 位作者 Jin Ye Liu Hua-yong 《Wuhan University Journal of Natural Sciences》 EI CAS 2001年第4期779-783,共5页
A schema for content-based analysis of broadcast news video is presented. First, we separate commercials from news using audiovisual features. Then, we automatically organize news programs into a content hierarchy at ... A schema for content-based analysis of broadcast news video is presented. First, we separate commercials from news using audiovisual features. Then, we automatically organize news programs into a content hierarchy at various levels of abstraction via effective integration of video, audio, and text data available from the news programs. Based on these news video structure and content analysis technologies, a TV news video Library is generated, from which users can retrieve definite news story according to their demands. 展开更多
关键词 CONTENT-BASED audio news video SEGMENTATION
在线阅读 下载PDF
Study on an Audio and Video Network Monitoring System for Weather Modification Operation
6
作者 Yilin Wang Xueyi Xu +2 位作者 Desheng Xu Changzong Miao Gang Zhao 《Meteorological and Environmental Research》 CAS 2013年第1期5-7,共3页
An audio and video network monitoring system for weather modification operation transmitting information by 3G, ADSL and Internet has been developed and applied in weather modification operation of Tai'an City. The a... An audio and video network monitoring system for weather modification operation transmitting information by 3G, ADSL and Internet has been developed and applied in weather modification operation of Tai'an City. The all-in-one machine of 3G audio and video network highly integrates all front-end devices used for audio and video collection, communication, power supply and information storage, and has advantages of wireless video transmission, clear two-way voice intercom with the command center, waterproof and dustproof function, simple operation, good portability, and long working hours. Compression code of the system is transmitted by dynamic bandwidth, and compression rate varies from 32 kbps to 4 Mbps under different network conditions. This system has forwarding mode, that is, monitoring information from each front-end monitoring point is trans- mitted to the server of the command center by 3G/ADSL, and the server codes'and decodes again, then beck-end users call images from the serv- er, which can address 3G network stoppage caused by many users calling front-end video at the same time. In addition, the system has been ap- plied in surface weather modification operation of Tai'an City, and has made a great contribution to transmitting operation orders in real time, monitoring, standardizing and recording operating process, and improving operating safety. 展开更多
关键词 Weather modification operation Network monitoring audio and video INTEGRATION China
在线阅读 下载PDF
Design and Implementation of Multilevel Access Control in Synchronized Audio to Audio Steganography Using Symmetric Polynomial Scheme
7
作者 Jeddy Nafeesa Begum Krishnan Kumar Vembu Sumathy 《Journal of Information Security》 2010年第1期29-40,共12页
Steganography techniques are used in Multimedia data transfer to prevent adversaries from eaves dropping. Synchronized audio to audio steganography deals with recording the secret audio, hiding it in another audio fil... Steganography techniques are used in Multimedia data transfer to prevent adversaries from eaves dropping. Synchronized audio to audio steganography deals with recording the secret audio, hiding it in another audio file and subsequently sending to multiple receivers. This paper proposes a Multilevel Access control in Synchronized audio steganography, so that Audio files which are meant for the users of low level class can be listened by higher level users, whereas the vice-versa is not allowed. To provide multilevel access control, symmetric polynomial based scheme is used. The steganography scheme makes it possible to hide the audio in different bit locations of host media without inviting suspicion. The Secret file is embedded in a cover media with a key. At the receiving end the key can be derived by all the classes which are higher in the hierarchy using symmetric polynomial and the audio file is played. The system is implemented and found to be secure, fast and scalable. Simulation results show that the system is dynamic in nature and allows any type of hierarchy. The proposed approach is better even during frequent member joins and leaves. The computation cost is reduced as the same algorithm is used for key computation and descendant key derivation. Steganography technique used in this paper does not use the conventional LSB’s and uses two bit positions and the hidden data occurs only from a frame which is dictated by the key that is used. Hence the quality of stego data is improved. 展开更多
关键词 STEGANOGRAPHY MULTILEVEL Access Control synchronized audio SYMMETRIC POLYNOMIAL Dynamic Scalable
在线阅读 下载PDF
audio和video
8
作者 杨承辉 《语言教育》 1993年第8期32-32,共1页
电视上看到一则某种牌子的电器之广告,内中有audio和video两个字,借贵刊一角谈一谈。不用说,这两个字都是和电器有关的。audio与“音”有关系,video则和“影”有关。 audio是指由声音、机械、或电力所造成的频率(audio frs-quency),具... 电视上看到一则某种牌子的电器之广告,内中有audio和video两个字,借贵刊一角谈一谈。不用说,这两个字都是和电器有关的。audio与“音”有关系,video则和“影”有关。 audio是指由声音、机械、或电力所造成的频率(audio frs-quency),具有这种频率的声波每秒钟振动十五至二万次,也就是所谓低周波,是人类所能听得见的。在电器制品中,andio 特指电唱机、收音机或电视机的发音部分,平常人们所说的音响设备就叫audio equipment。因此,audio现用来泛指一般与音响有关的东西。 展开更多
关键词 audio video 音响设备 低周波 TELEVISION 二万 迪安
在线阅读 下载PDF
Audio Description for Educational Videos on COVID-19 Response:A Corpus-Based Study on Linguistic and Textual Idiosyncrasies
9
作者 XIONG Ling-song 《Journal of Literature and Art Studies》 2023年第4期276-285,共10页
Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to th... Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to the visually disadvantaged.In this study,a corpus of AD of COVID-19 educational videos is developed,named“Audio Description Corpus of COVID-19 Educational Videos”(ADCCEV).Drawing on the model of Textual and Linguistic Audio Description Matrix(TLADM),this paper aims to identify the linguistic and textual idiosyncrasies of AD themed on COVID-19 response released by the New Zealand Government.This study finds that linguistically,the AD script uses a mix of complete sentences and phrases,the majority being in Present Simple tense.Present participles and the“with”structure are used for brevity.Vocabulary is diverse,with simpler words for animated explainers.Third-person pronouns are common in educational videos.Color words are a salient feature of AD,where“yellow”denotes urgency,and“red”indicates importance,negativity,and hostility.On textual idiosyncrasies,coherence is achieved through intermodal components that align with the video’s mood and style.AD style varies depending on the video’s purpose,from informative to narrative or expressive. 展开更多
关键词 audio Description COVID-19 educational videos corpus-based study
在线阅读 下载PDF
Stylistic Analysis of Internet News——Taking Internet Video Newsand Internet Audio News as Examples
10
作者 周逸轩 《海外英语》 2019年第9期212-213,共2页
With the rapid development of Internet around the world, network is transmitting all kinds of information to human beings nowadays. Net news, also called cyber news is affecting people’s expression of daily English. ... With the rapid development of Internet around the world, network is transmitting all kinds of information to human beings nowadays. Net news, also called cyber news is affecting people’s expression of daily English. A large number of cyber words, phrases even sentences, which are different from conventional English, are formed and become popular in the cyber world. This paper discusses different markers of net news by taking Internet video news and Internet audio news as examples so that the readers can fully understand the properties of net news. 展开更多
关键词 INTERNET NEWS INTERNET video NEWS INTERNET audio NEWS STYLISTICS features of INTERNET NEWS
在线阅读 下载PDF
麦景图(McIntosh)MVP851 DVD Audio/Video碟机
11
《音响世界》 2003年第9期7-7,共1页
关键词 麦景图公司 MVP851 DVD audio/video碟机 功能
在线阅读 下载PDF
跃威USB VIDEO AUDIO延长器
12
作者 Shawn 《数字世界》 2007年第8期67-67,共1页
亲爱的,俺把电脑延长了科技的发展有时候总会让人措手不及。当我还在犹豫到底是否需要斥“巨资”购买一台HDTV的时候,发现只需要用USB VIDEO AUDIO延长器就可以把书房的电脑延伸到客厅中,—切问题都迎刃而解了。
关键词 video audio
在线阅读 下载PDF
DVD AUDIO/VIDEO碟机
13
《音响世界》 2004年第1期26-28,共3页
关键词 DVD audio/video碟机 SONY DVP-NS999ES DENON DVD-2900 ONKYO DV-SP800
在线阅读 下载PDF
安捷伦为93000 SOC系统推出Audio/Video 8模拟卡
14
《电子产品与技术》 2004年第10期87-87,共1页
关键词 安捷伦科技公司 audio/video8 SOC 93000系列 测试模拟卡
在线阅读 下载PDF
PIONEER先锋DV—S733A逐行扫描DVD AUDIO/VIDEO/SACD碟机
15
《音响世界》 2002年第8期6-6,共1页
关键词 先锋公司 DV-S733A 逐行扫描 DVD audio/video/SACD
在线阅读 下载PDF
访森海塞尔中国内地地区专业音频Audio for Video销售负责人贾毅阳及诺音曼中国内地地区销售负责人储海涛
16
作者 曹徐洋 《现代电视技术》 2023年第9期48-49,共2页
BIRTV2023期间,在中央广播电视总台展台《现代电视技术》现场访谈间,本刊对森海塞尔中国内地地区专业音频Audio for Video销售负责人贾毅阳以及诺音曼中国内地地区销售负责人储海涛进行了采访,采访围绕两个品牌的产品亮点、优势及市场... BIRTV2023期间,在中央广播电视总台展台《现代电视技术》现场访谈间,本刊对森海塞尔中国内地地区专业音频Audio for Video销售负责人贾毅阳以及诺音曼中国内地地区销售负责人储海涛进行了采访,采访围绕两个品牌的产品亮点、优势及市场定位等话题展开。曹徐洋:在今年的BIRTV展会上,森海塞尔和诺音曼的展台都展出了大量优秀的产品,这些产品里有哪些是重点推出的?请介绍一下它们的主要亮点。 展开更多
关键词 专业音频 森海塞尔 BIRTV 现场访谈 市场定位 audio video 广播电视总台
在线阅读 下载PDF
短视频视听感官陷阱:虚假信息采纳意愿的影响机制
17
作者 张志霞 李洁 +1 位作者 董婳婳 张新生 《情报杂志》 北大核心 2026年第1期161-167,共7页
[目的]短视频多模态传播特性加剧了虚假信息对用户认知的隐性操控风险。为破解“感官陷阱”的作用黑箱,探究虚假短视频的视听特征对用户信息采纳意愿的影响机制,以期为相关机构和平台进行虚假信息治理提供理论依据。[方法]基于SOR理论框... [目的]短视频多模态传播特性加剧了虚假信息对用户认知的隐性操控风险。为破解“感官陷阱”的作用黑箱,探究虚假短视频的视听特征对用户信息采纳意愿的影响机制,以期为相关机构和平台进行虚假信息治理提供理论依据。[方法]基于SOR理论框架,构建“视听特征-情绪反应-行为决策”链式传导模型。以抖音平台723个虚假短视频为研究样本,采用计算机视觉技术量化视频色彩属性,运用音频挖掘技术解析声学特征,结合SnowNLP实现评论情感分析。通过多元回归分析与Bootstrap法检验视听特征对用户信息采纳意愿的直接效应及情绪中介效应。[结果/结论]视觉特征与听觉特征对虚假信息采纳意愿的影响存在非对称性。视觉特征中,暖色率正向显著驱动虚假信息采纳意愿,饱和度与明亮度呈负向抑制;听觉特征中,人声响度与音乐节奏影响均显著降低虚假信息采纳意愿。用户情绪在视觉特征与信息采纳意愿间存在中介效应,但对听觉特征的传导路径影响不显著。 展开更多
关键词 短视频 虚假信息 视听特征 信息采纳意愿 用户情绪
在线阅读 下载PDF
全数字化多媒体技术的代表──DVI(Digital Video Iteractive)系统 被引量:1
18
作者 陈一民 《上海大学学报(自然科学版)》 CAS CSCD 1995年第5期557-563,共7页
本文介绍了数字视频交互(DVI)技术的发展概况.详细论述了DVI的硬件结构和组成原理,并论述了DVI系统软件平台的构成以及它的核心软件AVK的组成和原理.
关键词 多媒体 数字视频交互 全数字化多媒体
在线阅读 下载PDF
基于模态仿射融合的语音控制说话人脸视频对抗生成
19
作者 陈诗航 孙玉宝 《计算机工程》 北大核心 2026年第2期393-403,共11页
语音生成说话人脸视频是当前一个研究热点,涉及音频和视觉两个模态的处理,需要着重解决说话时唇部运动和输入音频对齐的问题。针对该问题提出一种端到端的语音控制说话人脸视频生成对抗模型,主要包括模态仿射融合的生成器、视觉质量判... 语音生成说话人脸视频是当前一个研究热点,涉及音频和视觉两个模态的处理,需要着重解决说话时唇部运动和输入音频对齐的问题。针对该问题提出一种端到端的语音控制说话人脸视频生成对抗模型,主要包括模态仿射融合的生成器、视觉质量判别器和唇形同步判别器,基于仿射融合的生成器通过模态仿射融合模块(MAFBlock),在人脸特征解码过程中添加音频信息,有效地融合音频信息和人脸信息,使得音频能够更好地控制说话人脸视频生成。引入空间注意力和通道注意力机制,增强模型对于局部区域的关注。基于双判别器提高模型生成质量和唇形同步率,唇形同步判别器用于约束唇部运动,对音频和唇形进行相似性判断,在不改变整体轮廓和脸部细节的前提下更精细地控制唇部动作生成,视觉质量判别器判断生成图片的真实性,提高生成图片质量。在两个视听数据集上与多个现有的代表性模型进行对比实验,结果表明:该模型在LRS2验证集上具有8.128的LSE-C分数和6.112的LSE-D分数,相比于Baseline分别提升了4.3%和4.4%;在LRS3验证集上具有7.963的LSE-C分数和6.259的LSE-D分数,相比于Baseline分别提升了6.2%和6.9%。 展开更多
关键词 说话人脸生成 视频生成 唇形同步 音频驱动生成 空间注意力 通道注意力
在线阅读 下载PDF
基于注意力机制的音频驱动数字人脸视频生成方法
20
作者 郭星星 肖雁南 +2 位作者 温佩芝 徐智 黄文明 《计算机科学》 北大核心 2026年第2期245-252,共8页
音频驱动数字人脸视频生成的难点问题在于,如何将音频与视频两种不同模态的信息对齐,从而实现唇音同步。现有技术大多基于英文数据集开发,由于中文发音与英文发音存在差异性,直接将这些技术运用于中文音频驱动数字人脸视频生成时,存在... 音频驱动数字人脸视频生成的难点问题在于,如何将音频与视频两种不同模态的信息对齐,从而实现唇音同步。现有技术大多基于英文数据集开发,由于中文发音与英文发音存在差异性,直接将这些技术运用于中文音频驱动数字人脸视频生成时,存在牙齿模糊和视频清晰度不够的问题。基于GAN框架,提出了一种基于注意力机制的音频驱动数字人脸视频生成方法M-CSAWav2Lip。将MFCC和Mel Spectrogram融合,实现音频特征提取。利用MFCC的时间动态特性和Mel Spectrogram的频率分辨能力,全面捕捉语音信息的细微变化。在数字人脸生成过程中,采用基于注意力机制及残差连接的网络架构,通过加权通道和空间注意力机制强化特征的重要性,提高关键音频和视频特征的获取能力,实现有效编码和融合中文音视频信息,生成与语音内容相匹配的唇部动作和面部视频。最后,在自建的中文数据集及通用数据集上进行训练与测试。实验结果表明,所提方法生成的唇音同步数字人脸视频在精度和质量方面均有一定的提升。 展开更多
关键词 音频驱动 唇音同步 音频特征提取 数字人脸生成 注意力机制
在线阅读 下载PDF
上一页 1 2 104 下一页 到第
使用帮助 返回顶部