期刊文献+
共找到28,014篇文章
< 1 2 250 >
每页显示 20 50 100
RSG-Conformer:ReLU-Based Sparse and Grouped Conformer for Audio-Visual Speech Recognition
1
作者 Yewei Xiao Xin Du Wei Zeng 《Computers, Materials & Continua》 2026年第3期1325-1348,共24页
Audio-visual speech recognition(AVSR),which integrates audio and visual modalities to improve recognition performance and robustness in noisy or adverse acoustic conditions,has attracted significant research interest.... Audio-visual speech recognition(AVSR),which integrates audio and visual modalities to improve recognition performance and robustness in noisy or adverse acoustic conditions,has attracted significant research interest.However,Conformer-based architectures remain computational expensive due to the quadratic increase in the spatial and temporal complexity of their softmax-based attention mechanisms with sequence length.In addition,Conformerbased architectures may not provide sufficient flexibility for modeling local dependencies at different granularities.To mitigate these limitations,this study introduces a novel AVSR framework based on a ReLU-based Sparse and Grouped Conformer(RSG-Conformer)architecture.Specifically,we propose a Global-enhanced Sparse Attention(GSA)module incorporating an efficient context restoration block to recover lost contextual cues.Concurrently,a Grouped-scale Convolution(GSC)module replaces the standard Conformer convolution module,providing adaptive local modeling across varying temporal resolutions.Furthermore,we integrate a Refined Intermediate Contextual CTC(RIC-CTC)supervision strategy.This approach applies progressively increasing loss weights combined with convolution-based context aggregation,thereby further relaxing the constraint of conditional independence inherent in standard CTC frameworks.Evaluations on the LRS2 and LRS3 benchmark validate the efficacy of our approach,with word error rates(WERs)reduced to 1.8%and 1.5%,respectively.These results further demonstrate and validate its state-of-the-art performance in AVSR tasks. 展开更多
关键词 audio-visual speech recognition CONFORMER CTC sparse attention
在线阅读 下载PDF
Cultivation of Students’Critical Thinking Ability in College English Audio-Visual and Oral Teaching 被引量:1
2
作者 Hui Zhang 《Journal of Contemporary Educational Research》 2025年第6期36-41,共6页
With the increasingly prominent trend of globalization,English,as the common language of international communication,plays an increasingly important role in university education.As a key link in English teaching,the c... With the increasingly prominent trend of globalization,English,as the common language of international communication,plays an increasingly important role in university education.As a key link in English teaching,the college English audio-visual oral course not only imparts language knowledge and skills,but also shoulders the important task of cultivating students’critical thinking.As one of the essential core qualities of modern talents,critical thinking ability plays an irreplaceable role in students’in-depth understanding of English knowledge,improving intercultural communication ability and cultivating innovative thinking.This paper expounds the significance of cultivating students’critical thinking ability in college English audio-visual and oral teaching,and puts forward a series of innovative teaching strategies to cultivate students’critical thinking ability combined with practical teaching experience and cutting-edge education theory,in order to provide new ideas and practical guidance for the improvement of college English teaching quality and the development of students’comprehensive quality. 展开更多
关键词 Critical thinking ability College English audio-visual and oral teaching
在线阅读 下载PDF
Fully distributed edge-based adaptive Nash equilibrium seeking with input constraints
3
作者 Shengli DU Shuo LI +2 位作者 Tianli XU Honggui HAN Junfei QIAO 《Science China(Technological Sciences)》 2026年第3期214-224,共11页
The present study investigates the quest for a fully distributed Nash equilibrium(NE) in networked non-cooperative games, with particular emphasis on actuator limitations. Existing distributed NE seeking approaches of... The present study investigates the quest for a fully distributed Nash equilibrium(NE) in networked non-cooperative games, with particular emphasis on actuator limitations. Existing distributed NE seeking approaches often overlook practical input constraints or rely on centralized information. To address these issues, a novel edge-based double-layer adaptive control framework is proposed. Specifically, adaptive scaling parameters are embedded into the edge weights of the communication graph, enabling a fully distributed scheme that avoids dependence on centralized or global knowledge. Every participant modifies its strategy by exclusively utilizing local information and communicating with its neighbors to iteratively approach the NE. By incorporating damping terms into the design of the adaptive parameters, the proposed approach effectively suppresses unbounded parameter growth and consequently guarantees the boundedness of the adaptive gains. In addition, to account for actuator saturation, the proposed distributed NE seeking approach incorporates a saturation function, which ensures that control inputs do not exceed allowable ranges. A rigorous Lyapunov-based analysis guarantees the convergence and boundedness of all system variables. Finally, the presentation of simulation results aims to validate the efficacy and theoretical soundness of the proposed approach. 展开更多
关键词 distributed NE seeking networked games bounded control input double-layer adaptive law
原文传递
Litter input manipulations differentially regulated CO_(2),CH_(4)and N_(2)O emissions from subalpine coniferous and broad-leaf forest soils
4
作者 Baoshan Huang Xiuxian Men +2 位作者 Yong Bao Deping Zhai Xiaoli Cheng 《Journal of Forestry Research》 2026年第2期158-171,共14页
Soil greenhouse gas(GHG)emissions contribute profoundly to global warming;however,how plant detritus input alters GHG emissions is poorly understood.Here,we used detritus input and removal treatments(i.e.,DIRT:control... Soil greenhouse gas(GHG)emissions contribute profoundly to global warming;however,how plant detritus input alters GHG emissions is poorly understood.Here,we used detritus input and removal treatments(i.e.,DIRT:control,CK;double litter,DL;no roots with double litter,NRDL;no litter,NL;no roots,NR;no roots and no litter,NRNL)to assess the effects of litter and root inputs on soil CO_(2),CH_(4),and N_(2)O fluxes in soils in a coniferous(Pinus yunnanensis)and a broad-leaf forest(Quercus pannosa)in a subalpine region in southwestern China.Litter addition increased CO_(2) emissions on average 22.22%,but did not significantly alter CH_(4) uptake and N_(2)O emission compared to the CK.Litter removal(NL and NRNL)significantly reduced CO_(2) emissions on average 30.22%and N_(2)O emissions on average 31.16%from both forest soils,but did not significantly affect soil CH_(4) uptake.Root removal(NR and NRNL)generally decreased these three soil GHG fluxes.Changes inβ-1,4-glucosidase(BG)involved in C and phospholipid fatty acid(PLFAs)biomass were projected to influence CO_(2) emissions,while soil microclimates(temperature and moisture)combined with BG activity mainly regulated CH_(4) uptake.Alterations in dissolved organic nitrogen,microbial biomass nitrogen and BG were mainly responsible for changes in N_(2)O emissions.Interestingly,coniferous forest soil seemed to promote CH_(4) uptake more than the broad-leaf forest soil,but CO_(2) and N_(2)O fluxes were not significantly affected by the forest types.As expected,litter addition significantly increased the warming potential,while litter removal relatively lowered it.These findings revealed the divergent roles of plant detritus input and forest type in shaping soil GHG fluxes,thereby providing insights into forest management and predicting contributions of subalpine forests to global warming. 展开更多
关键词 Detritus input and removal treatment Edaphic properties Microbial activities Soil greenhouse gas Subalpine forests
在线阅读 下载PDF
Deep Audio-visual Learning:A Survey 被引量:6
5
作者 Hao Zhu Man-Di Luo +2 位作者 Rui Wang Ai-Hua Zheng Ran He 《International Journal of Automation and computing》 EI CSCD 2021年第3期351-376,共26页
Audio-visual learning,aimed at exploiting the relationship between audio and visual modalities,has drawn considerable attention since deep learning started to be used successfully.Researchers tend to leverage these tw... Audio-visual learning,aimed at exploiting the relationship between audio and visual modalities,has drawn considerable attention since deep learning started to be used successfully.Researchers tend to leverage these two modalities to improve the performance of previously considered single-modality tasks or address new challenging problems.In this paper,we provide a comprehensive survey of recent audio-visual learning development.We divide the current audio-visual learning tasks into four different subfields:audiovisual separation and localization,audio-visual correspondence learning,audio-visual generation,and audio-visual representation learning.State-of-the-art methods,as well as the remaining challenges of each subfield,are further discussed.Finally,we summarize the commonly used datasets and challenges. 展开更多
关键词 Deep audio-visual learning audio-visual separation and localization correspondence learning generative models representation learning
原文传递
Integrating Audio-Visual Features and Text Information for Story Segmentation of News Video 被引量:1
6
作者 Liu Hua-yong, Zhou Dong-ru School of Computer,Wuhan University,Wuhan 430072, Hubei, China 《Wuhan University Journal of Natural Sciences》 CAS 2003年第04A期1070-1074,共5页
Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The p... Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames, and integrates them with silence clips detection results, as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames, when the boundaries between news stories are detected, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust. 展开更多
关键词 news video story segmentation audio-visual features analysis text detection
在线阅读 下载PDF
AV-FDTI:Audio-visual fusion for drone threat identification 被引量:1
7
作者 Yizhuo Yang Shenghai Yuan +5 位作者 Jianfei Yang Thien Hoang Nguyen Muqing Cao Thien-Minh Nguyen Han Wang Lihua Xie 《Journal of Automation and Intelligence》 2024年第3期144-151,共8页
In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fus... In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fusion system designed for Drone Threat Identification.AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs,providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization.Specifically,AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction.Furthermore,we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data.Notably,our system is trained based on automated Leica tracking annotations,offering accurate ground truth data with millimeter-level accuracy.Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems.In our commitment to advancing this field,we will release this work as open-source code and wearable AV-FDTI design,contributing valuable resources to the research community. 展开更多
关键词 audio-visual fusion Anti-UAV Multi-modal fusion Classification 3D localization Self-attention
在线阅读 下载PDF
A Review on Audio-visual Translation Studies
8
作者 李瑶 《语言与文化研究》 2008年第1期146-150,共5页
This paper is dedicated to a thorough review on the audio-visual related translations from both home and abroad.In reviewing the foreign achievements on this specific field of translation studies it can shed some ligh... This paper is dedicated to a thorough review on the audio-visual related translations from both home and abroad.In reviewing the foreign achievements on this specific field of translation studies it can shed some lights on our national audio-visual practice and research.The review on the Chinese scholars’ audio-visual translation studies is to offer the potential developing direction and guidelines to the studies and aspects neglected as well.Based on the summary of relevant studies,possible topics for further studies are proposed. 展开更多
关键词 audio-visual TRANSLATION SUBTITLING DUBBING
原文传递
Audio-visual emotion recognition with multilayer boosted HMM
9
作者 吕坤 贾云得 张欣 《Journal of Beijing Institute of Technology》 EI CAS 2013年第1期89-93,共5页
Emotion recognition has become an important task of modern human-computer interac- tion. A multilayer boosted HMM ( MBHMM ) classifier for automatic audio-visual emotion recognition is presented in this paper. A mod... Emotion recognition has become an important task of modern human-computer interac- tion. A multilayer boosted HMM ( MBHMM ) classifier for automatic audio-visual emotion recognition is presented in this paper. A modified Baum-Welch algorithm is proposed for component HMM learn- ing and adaptive boosting (AdaBoost) is used to train ensemble classifiers for different layers (cues). Except for the first layer, the initial weights of training samples in current layer are decided by recognition results of the ensemble classifier in the upper layer. Thus the training procedure using current cue can focus more on the difficult samples according to the previous cue. Our MBHMM clas- sifier is combined by these ensemble classifiers and takes advantage of the complementary informa- tion from multiple cues and modalities. Experimental results on audio-visual emotion data collected in Wizard of Oz scenarios and labeled under two types of emotion category sets demonstrate that our approach is effective and promising. 展开更多
关键词 emotion recognition audio-visual fusion Baum-Welch algorithm multilayer boostedHMM Wizard of Oz scenario
在线阅读 下载PDF
The Audio-Visual Performance Highlighted Craze in Chicago During Chinese New Year
10
《China & The World Cultural Exchange》 2019年第2期38-39,共2页
February 10 (US Central Time), 2019, China National Peking Opera Company (CNPOC) and the Hubei Chime Bells National Chinese Orchestra presented a fantastic audio-visual performance of Chinese Peking Opera and Chinese ... February 10 (US Central Time), 2019, China National Peking Opera Company (CNPOC) and the Hubei Chime Bells National Chinese Orchestra presented a fantastic audio-visual performance of Chinese Peking Opera and Chinese chime bells for the American audience at the world s top-level Buntrock Hall at Symphony Center. 展开更多
关键词 audio-visual PERFORMANCE Chicago CHINESE New YEAR
在线阅读 下载PDF
Research on National Identity Based on National Audio-Visual Works: Taking Inner Mongolia as an Example
11
作者 LIU Haitao ZHANG Pei 《Cultural and Religious Studies》 2021年第8期391-396,共6页
Mongolian audio-visual works are an important carrier of exploring the true significance to this national culture.This paper believes that the Mongolian people in Inner Mongolia constantly enhance the individual sense... Mongolian audio-visual works are an important carrier of exploring the true significance to this national culture.This paper believes that the Mongolian people in Inner Mongolia constantly enhance the individual sense of identity to the overall ethnic group through the influence of film and television and music,and on this basis constantly evolve a new culture in line with modern and contemporary life to further enhance their sense of belonging to the ethnic nation. 展开更多
关键词 MONGOLIAN audio-visual works national identity
在线阅读 下载PDF
Application of Task-based Teaching Method to College Audio-visual English Teaching
12
作者 Liguo Shi 《International Journal of Technology Management》 2015年第9期65-67,共3页
Based on the current situation of college audio-visual English teaching in China, this article points out that the avoidance in class is a serious phenomenon in the process of college audio-visual English teaching. Af... Based on the current situation of college audio-visual English teaching in China, this article points out that the avoidance in class is a serious phenomenon in the process of college audio-visual English teaching. After further analysis and combination with the characteristics of college English audio-visual teaching in China, it puts forward the application of task-based teaching method to college audio-visual English teaching and its steps, attempting to alleviate the avoidance phenomenon in students through task-based teaching method. 展开更多
关键词 task-based teaching method college English audio-visual English teaching
在线阅读 下载PDF
Integrating Zhuang Culture Into College English Audio-Visual Speaking Course:A Multicultural Perspective
13
作者 LUO Mei CHEN Yingzhu 《Cultural and Religious Studies》 2024年第12期801-805,共5页
Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Vi... Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Visual Speaking Course,and enhance the intercultural communication ability of college students,this paper,from a multicultural perspective,explores the classroom practices of integrating indigenous Zhuang cultural elements in College English Audio-Visual Speaking Course,providing new perspectives and reference for multicultural education in foreign languages. 展开更多
关键词 Zhuang culture College English audio-visual Speaking Course classroom practice multicultural perspective
在线阅读 下载PDF
Teaching Strategies of Visual Interpretation and Audio-visual Interpretation
14
作者 DONG Yusa 《外文科技期刊数据库(文摘版)教育科学》 2021年第3期113-117,共5页
By distinguishing the differences between audio-visual interpretation and visual interpretation, it is clear that the two belong to different categories in essence and working methods, in order to avoid misunderstandi... By distinguishing the differences between audio-visual interpretation and visual interpretation, it is clear that the two belong to different categories in essence and working methods, in order to avoid misunderstanding and confusion between the two in learning. At the same time, there are some misconceptions in their teaching methods. This paper explores the teaching methods of visual interpretation and audio-visual interpretation, which will make them more reasonable and scientific in the teaching process. 展开更多
关键词 audio-visual interpretation visual interpretation TEACHING
在线阅读 下载PDF
Prioritized MPEG-4 Audio-Visual Objects Streaming over the DiffServ
15
作者 黄天云 郑婵 《Journal of Electronic Science and Technology of China》 2005年第4期314-320,共7页
The object-based scalable coding in MPEG-4 is investigated, and a prioritized transmission scheme of MPEG-4 audio-visual objects (AVOs) over the DiffServ network with the QoS guarantee is proposed. MPEG-4 AVOs are e... The object-based scalable coding in MPEG-4 is investigated, and a prioritized transmission scheme of MPEG-4 audio-visual objects (AVOs) over the DiffServ network with the QoS guarantee is proposed. MPEG-4 AVOs are extracted and classified into different groups according to their priority values and scalable layers (visual importance). These priority values are mapped to the 1P DiffServ per hop behaviors (PHB). This scheme can selectively discard packets with low importance, in order to avoid the network congestion. Simulation results show that the quality of received video can gracefully adapt to network state, as compared with the ‘best-effort' manner. Also, by allowing the content provider to define prioritization of each audio-visual object, the adaptive transmission of object-based scalable video can be customized based on the content. 展开更多
关键词 video streaming quality of service (QoS) MPEG-4 audio-visual objects (AVOs) DIFFSERV PRIORITIZATION
在线阅读 下载PDF
Event-Triggered Adaptive Control of Noncanonical Nonlinear Systems With Hysteresis Inputs
16
作者 Guanyu Lai Kairong Zeng +2 位作者 Yonghua Wang Tao Zhang Hanzhen Xiao 《IEEE/CAA Journal of Automatica Sinica》 2025年第8期1739-1741,共3页
Dear Editor,It is well known that event-triggered control(ETC)is an effective approach in addressing networked control problems for Industry 5.0.Its feasibility,however,is still restricted to canonical nonlinear syste... Dear Editor,It is well known that event-triggered control(ETC)is an effective approach in addressing networked control problems for Industry 5.0.Its feasibility,however,is still restricted to canonical nonlinear systems so far.Considering this,a gradient-based adaptive ETC scheme for noncanonical nonlinear systems is newly developed in this letter,where the hysteresis input constraints are considered also.By proper decomposition,the technical issue of handling ETC-induced measurement errors and hysteresis inputs can be transformed into the robustness problem to bounded disturbance-like terms,which is then addressed by integrating a switching modification strategy in adaptive design and developing a novel augmented error-based analysis framework.Experimental results based on a practical piezoactuator confirm the effectiveness of the proposed scheme. 展开更多
关键词 hysteresis input constraints event triggered control adaptive control hysteresis inputs networked control problems noncanonical nonlinear systems gradient based adaptive scheme canonical nonlinear systems
在线阅读 下载PDF
High-efficient single-phase,non-isolated,multi-input microinverter with common ground for photovoltaic systems
17
作者 Anees Alhasi Patrick Chi-Kwong Luk +1 位作者 Khalifa Aliyu Ibrahim Zhenhua Luo 《Journal of Electronic Science and Technology》 2025年第4期46-64,共19页
Single-phase,non-isolated microinverters used in photovoltaic(PV)systems commonly encounter two persistent challenges:High-frequency leakage current and fluctuating power delivery.This paper presents a novel single-ph... Single-phase,non-isolated microinverters used in photovoltaic(PV)systems commonly encounter two persistent challenges:High-frequency leakage current and fluctuating power delivery.This paper presents a novel single-phase,non-isolated,multi-input microinverter topology with a common-ground structure that effectively eliminates ground leakage current without requiring additional active components.The proposed microinverter architecture integrates a dual-boost configuration and uses only four active switches.This is especially advantageous in terms of the component count,which is beneficial to enhance reliability,reduce cost,and simplify the overall system design.With one,two,or four PV inputs,it can operate without interruption under unbalanced voltage or partial shading and even if some inputs drop to zero.A tailored modulation scheme minimizes conduction losses while maintaining a stable direct-current(DC)-link voltage,and a decoupling capacitor efficiently absorbs the single-phase pulsating power,thus overcoming one major limitation in existing microinverter designs.By validating with a 1-kW GaN-based prototype,both the simulated and experimental results demonstrate its high efficiency,robustness,and practical suitability for cost-effective PV applications,with a peak efficiency value of 94.8%. 展开更多
关键词 Dual-boost Leakage current elimination Multiple input microinverter Non-isolated Photovoltaic Single-boost
在线阅读 下载PDF
Investigation of SAW heat input on modified 9Cr-1Mo steel: microstructure, mechanical properties, and residual stress
18
作者 Joydeep Roy Pritam Das Raja Chakrabarti 《China Welding》 2025年第3期207-216,共10页
This study investigates the impact of welding heat input on weldments of modified 9Cr-1Mo(P91)steel,a high-strength material that requires high-energy welding processes like submerged arc welding.In the as-welded cond... This study investigates the impact of welding heat input on weldments of modified 9Cr-1Mo(P91)steel,a high-strength material that requires high-energy welding processes like submerged arc welding.In the as-welded condition,P91 steel welds primarily consist of untempered martensite,which transforms into tempered martensite during post-weld heat treatment(PWHT).Electron spectro-scopy analysis reveals the presence of M_(23)C_(6) and MX carbonitride precipitates at grain boundaries.Increasing the heat input leads to greater quantities of precipitates in the prior austenite grain boundaries,which can affect material properties.Weldment hardness profiles exhibit modest improvements,while ultimate tensile strength and toughness decrease with higher welding heat input,poten-tially due to the formation of a ferritic phase.Residual stress distributions are noticeably influenced by the welding heat input level. 展开更多
关键词 P91 steel Heat input MICROSTRUCTURE Mechanical properties Residual stress
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部