With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are partic...With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.展开更多
Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been i...Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.展开更多
Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of ex...Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.展开更多
Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may ...Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.展开更多
Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semant...Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semantics of video for transmission,is a key aspect in the framework of multimedia semantic communication.In this paper,we propose a facial video semantic coding method with low bitrate based on the temporal continuity of video semantics.At the sender’s end,we selectively transmit facial keypoints and deformation information,allocating distinct bitrates to different keypoints across frames.Compressive techniques involving sampling and quantization are employed to reduce the bitrate while retaining facial key semantic information.At the receiver’s end,a GAN-based generative network is utilized for reconstruction,effectively mitigating block artifacts and buffering problems present in traditional codec algorithms under low bitrates.The performance of the proposed approach is validated on multiple datasets,such as VoxCeleb and TalkingHead-1kH,employing metrics such as LPIPS,DISTS,and AKD for assessment.Experimental results demonstrate significant advantages over traditional codec methods,achieving up to approximately 10-fold bitrate reduction in prolonged,stable head pose scenarios across diverse conversational video settings.展开更多
The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos l...The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos leverage the unique expressive logic of the platform by adopting a small entry point and prioritizing dissemination rate.They are strategically planned in terms of content,visuals,and interaction to cater to users needs for relaxation,knowledge acquisition,social sharing,agricultural product marketing,and talent display.Through careful design,full creativity,rich emotion,and the creation of distinct character personalities,these videos deliver positive,entertaining,informative,and opinion-driven agricultural content.The production and operation of agricultural short videos can be effectively optimized by analyzing the characteristics of both popular and less popular videos,and utilizing smart tools and trending topics.展开更多
Objectives:Medical students often rely on recreational internet media to relieve the stress caused by immense academic and life pressures,and among these media,short-form videos,which are an emerging digital medium,ha...Objectives:Medical students often rely on recreational internet media to relieve the stress caused by immense academic and life pressures,and among these media,short-form videos,which are an emerging digital medium,have gradually become the mainstream choice of students to relieve their stress.However,the addiction caused by their usage has attracted the widespread attention of both academia and society,which is why the purpose of this study is to systematically explore the underlying mechanisms that link perceived stress,entertainment gratification,emotional gratification,short-form video usage intensity,and short-form video addiction based on multiple theoretical frameworks including the Compensatory Internet Use Model(CIU),the Interaction of Person-Affect-Cognition-Execution Model(I-PACE),and the Use and Gratification Theory(UGT).Methods:A hypothetical model with 9 research hypotheses was constructed.Taking medical students from Chi-nese universities as the research subjects,1057 valid responses were collected through an online questionnaire survey,including 358 males and 658 females.Structural equation modelling(SEM)was performed using the AMOS software to test the research hypotheses.Results:(1)Perceived stress positively predicted entertainment gratification and emotional gratification(β=0.72,p<0.001;β=0.61,p<0.001);(2)Entertainment gratifi-cation and emotional gratification positively influenced short-form video usage intensity(β=0.35,p<0.001;β=0.19,p<0.001);(3)Entertainment gratification and emotional gratification positively predicted short-form video addiction(β=0.40,p<0.001;β=0.17,p<0.001);(4)Short-form video usage intensity positively influenced short-form video addiction(β=0.36,p<0.001);and(5)Perceived stress exerted an indirect but positive effect on both short-form video usage intensity and short-form video addiction,mediated by entertainment and emotional gratification(β=0.37,p<0.001;β=0.52,p<0.001).Conclusion:The mechanisms that underlie medical students’short-form video addiction in stressful situations were revealed in this study.It was found that stress enhances medical students’need for entertainment and emotional online compensation,prompting more frequent short-form video usage and ultimately leading to addiction.These results underscore the need to address the stressors faced by medical students.Effective interventions should prioritise stress management strategies and promote healthier alternative coping mechanisms to mitigate the risk of addiction.展开更多
Video action recognition(VAR)aims to analyze dynamic behaviors in videos and achieve semantic understanding.VAR faces challenges such as temporal dynamics,action-scene coupling,and the complexity of human interactions...Video action recognition(VAR)aims to analyze dynamic behaviors in videos and achieve semantic understanding.VAR faces challenges such as temporal dynamics,action-scene coupling,and the complexity of human interactions.Existing methods can be categorized into motion-level,event-level,and story-level ones based on spatiotemporal granularity.However,single-modal approaches struggle to capture complex behavioral semantics and human factors.Therefore,in recent years,vision-language models(VLMs)have been introduced into this field,providing new research perspectives for VAR.In this paper,we systematically review spatiotemporal hierarchical methods in VAR and explore how the introduction of large models has advanced the field.Additionally,we propose the concept of“Factor”to identify and integrate key information from both visual and textual modalities,enhancing multimodal alignment.We also summarize various multimodal alignment methods and provide in-depth analysis and insights into future research directions.展开更多
Objectives:Short video addiction has emerged as a significant public health issue in recent years,with a growing trend toward severity.However,research on the causes and impacts of short video addiction remains limite...Objectives:Short video addiction has emerged as a significant public health issue in recent years,with a growing trend toward severity.However,research on the causes and impacts of short video addiction remains limited,and understanding of the variable“TikTok brain”is still in its infancy.Therefore,based on the Stimulus-Organism-Behavior-Consequence(SOBC)framework,we proposed six research hypotheses and constructed a model to explore the relationships between short video usage intensity,TikTok brain,short video addiction,and decreased attention control.Methods:Given that students are considered a high-risk group for excessive short video use,we collected 1086 valid participants from Chinese student users,including 609 males(56.1%)and 477 females(43.9%),with an average participant age of 19.84 years,to test the hypotheses.Results:(1)Short video usage intensity was positively related to short video addiction,TikTok brain,and decreased attention control;(2)TikTok brain was positively related to short video addiction and decreased attention control;and(3)Short video addiction was positively related to decreased attention control.Conclusions:These findings suggest that although excessive use of short video applications brings negative consequences,users still spend significant amounts of time on these platforms,indicating a need for strict self-regulation of usage time.展开更多
Video synopsis is an effective way to easily summarize long-recorded surveillance videos.The omnidirectional view allows the observer to select the desired fields of view(FoV)from the different FoVavailable for spheri...Video synopsis is an effective way to easily summarize long-recorded surveillance videos.The omnidirectional view allows the observer to select the desired fields of view(FoV)from the different FoVavailable for spherical surveillance video.By choosing to watch one portion,the observer misses out on the events occurring somewhere else in the spherical scene.This causes the observer to experience fear of missing out(FOMO).Hence,a novel personalized video synopsis approach for the generation of non-spherical videos has been introduced to address this issue.It also includes an action recognition module that makes it easy to display necessary actions by prioritizing them.This work minimizes and maximizes multiple goals such as loss of activity,collision,temporal consistency,length,show,and important action cost respectively.The performance of the proposed framework is evaluated through extensive simulation and compared with the state-of-art video synopsis optimization algorithms.Experimental results suggest that some constraints are better optimized by using the latest metaheuristic optimization algorithms to generate compact personalized synopsis videos from spherical surveillance videos.展开更多
Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing int...Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods.展开更多
The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profile...The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.展开更多
Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as ...Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as trauma,inflammation,or anatomical abnormalities airway management becomes challenging.A commonly utilized method to overcome this challenge is the use of video laryngoscopy(VL),which employs a specialized device equipped with a camera and a light source to allow a clear view of the larynx and vocal cords.VL overcomes the limitations of direct laryngoscopy in patients with limited mouth opening,enabling better visualization and successful intubation.Various types of VL blades are available.We devised a novel flangeless video laryngoscope for use in patients with a limited mouth opening and then tested it on a manikin.展开更多
Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial fo...Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial for scene understanding. However, a challenge many semantic learning models face is the lack of data. Existing video datasets are limited to short, low-resolution videos that are not representative of real-world examples. Thus, one of our key contributions is a customized semantic segmentation version of the Walking Tours Dataset that features hour-long, high-resolution, real-world data from tours of different cities. Additionally, we evaluate the performance of open-vocabulary, semantic model OpenSeeD on our own custom dataset and discuss future implications.展开更多
Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The...Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The subjects were female university students living in the Kinki area. A longitudinal survey was conducted on 67 members in the intervention group, who received the health education, and 52 members in the control group, who did not receive the health education. The primary outcome measures were knowledge of PCC and the subscales of the Health Promotion Lifestyle Profile. Surveys were conducted before, after, and six months after the intervention in the intervention group, and an initial survey and survey six months later were conducted in the control group. Cochran’s Q test, Bonferroni’s multiple comparison test, and McNemar’s test were used to analyze the knowledge of PCC data. The Health Awareness, Nutrition, and Stress Management subscales of the Health Promotion Lifestyle Profile were analyzed by paired t-test, and comparisons between the intervention and control groups were performed using the two-way repeated measures analysis of variance. Results: In the intervention group of 67 people, the number of subjects who answered “correct” for five of the nine items concerning knowledge of PCC increased immediately after the health education (P = 0.006) but decreased for five items from immediately after the health education to six months later (P = 0.043). In addition, the number of respondents who answered “correct” for “low birth weight infants and future lifestyle-related diseases” (P = 0.016) increased after six months compared with before the health education. For the 52 subjects in the control group, there was no change in the number of subjects who answered “correct” for eight out of the nine items after six months. There was also no increase in scores for the Health Promotion Lifestyle Profile after six months for either the intervention or control group. Conclusion: Providing health education about PCC using videos and leaflets to adolescent females was shown to enhance the knowledge of PCC immediately after the education.展开更多
Objective:The objective of this study is to determine the effect of nurse-led instructional video(NLIV)on anxiety,satisfaction,and recovery among mothers admitted for cesarean section(CS).Materials and Methods:A quasi...Objective:The objective of this study is to determine the effect of nurse-led instructional video(NLIV)on anxiety,satisfaction,and recovery among mothers admitted for cesarean section(CS).Materials and Methods:A quasi-experimental design was carried out on the mothers scheduled for CS.Eighty participants were selected by a purposive sampling technique,which were divided(40 participants in each group)into an experimental group and a control group.Nurse-led informational video(NLIV)was shown to the experimental group,and routine care was provided for the control group.Modified hospital anxiety scale(HADS),scale for measuring maternal satisfaction in cesarean birth,and obstetric quality of recovery following cesarean delivery were used to assess anxiety,satisfaction,and recovery.Results:Both the experimental and control groups showed significant reductions in anxiety by the first postintervention day(P<0.001),with the experimental group experiencing a greater mean reduction(mean difference[MD]=4.37)than the control group(MD=3.35)but the intergroup difference was not statistically significant(P>0.05).The experimental group reported significantly higher satisfaction scores(175.55±9.42)on the 3rd postoperative day compared to the control group(151.93±14.89;P<0.001).Similarly,the experimental group’s recovery scores(79.90±6.24)were considerably higher than those of the control group(62.45±15.18;P<0.001).On the 3rd postintervention day,satisfaction was significantly associated with age(P<0.001),and recovery with gravidity(P<0.05).Conclusions:NLIV can be used in the preoperative period to reduce anxiety related to CS and to improve satisfaction and recovery after the CS.展开更多
Objective While there is consensus regarding a positive effect of video gaming on dexterity,little is known regarding how much traditional laparoscopic practice can or should be substituted with video gaming.This stud...Objective While there is consensus regarding a positive effect of video gaming on dexterity,little is known regarding how much traditional laparoscopic practice can or should be substituted with video gaming.This study was designed to assess the effects of varying the amount of traditional practice in a lap box trainer and video gaming on performance in two fundamentals of laparoscopic surgery core tasks.Methods Undergraduate and medical students were recruited and randomized into one of four groups:a control group,a lap box group,a video game group,and a combined group with 50%of the time allocated to each modality.Performance in the peg transfer and precision cutting tasks was assessed both prior to and following the 6 training sessions.Results Peg transfer performance significantly improved in the lap box group(168.4±70.6 s vs.332.9±178.2 s,p<0.001),video game group(176.7±53.3 s vs.300.0±101.2 s,p<0.001)and combined group(214.2±86.9 s vs.406.8±239.5 s,p=0.002)after training.Similar improvements were also observed in precision cutting performance in the lap box group(413.1±138.4 s vs.614.3±211.4 s,p=0.002),video game group(434.1±150.8 s vs.609.2±233.2 s,p=0.007)and combined group(469.2±185.3 s vs.663.8±296.3 s,p=0.020).When analyzing improvements in performance across three different training groups compared with the control group,we found that both the lap box group(p<0.001)and the combined group(p<0.001)showed better improvement in both tasks,and the video game group had significantly better outcomes in the precision cutting task(p=0.003).Conclusion Traditional lap box training remains the most effective method for improving the performance of simulated laparoscopic surgery.Video games can be encouraged to enhance skills retention and supplement simulated practice outside of a formal training curriculum.展开更多
Under the background of competency-based transformation of higher education,bioengineering major urgently needs to solve the problems such as the disconnect between practical teaching and industry needs and the lack o...Under the background of competency-based transformation of higher education,bioengineering major urgently needs to solve the problems such as the disconnect between practical teaching and industry needs and the lack of resources.This paper proposed supplementing traditional experimental teaching with video resources to construct a closed-loop model of"theoretical instruction,case analysis,video demonstration,and reflective application".Through the development of instructional videos covering core techniques such as PCR,Western blot,CRISPR-Cas9,cell culture,HPLC,GMP operations,and bioinformatics analysis,teaching costs can be reduced,spatiotemporal constraints can be overcome,and process visualization can be enhanced,thereby supporting students in mastering the entire workflow of modern biomanufacturing.The paper further explored resource development pathways,university-enterprise collaboration mechanisms,and curriculum integration strategies,offering actionable solutions for practical teaching reform.展开更多
The Double Take column looks at a single topic from an African and Chinese perspective.This month,we explore how we can cope with the influence of short videos.
基金supported by the National Natural Science Foundation of China(Grant No.72334003)the National Key Research and Development Program of China(Grant No.2022YFB2702804)+1 种基金the Shandong Key Research and Development Program(Grant No.2020ZLYS09)the Jinan Program(Grant No.2021GXRC084-2).
文摘With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.
基金supported by the Cultivation Program for Major Scientific Research Projects of Harbin Institute of Technology(ZDXMPY20180109).
文摘Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.
文摘Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.
基金funded by the Guangxi Philosophy and Social Science Research Project,grant number 24XWC002.
文摘Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.
基金supported by the National Natural Science Foundation of China (Nos. NSFC 61925105, 62322109, 62171257 and U22B2001)the Xplorer Prize in Information and Electronics technologiesthe Tsinghua University (Department of Electronic Engineering)-Nantong Research Institute for Advanced Communication Technologies Joint Research Center for Space, Air, Ground and Sea Cooperative Communication Network Technology
文摘Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semantics of video for transmission,is a key aspect in the framework of multimedia semantic communication.In this paper,we propose a facial video semantic coding method with low bitrate based on the temporal continuity of video semantics.At the sender’s end,we selectively transmit facial keypoints and deformation information,allocating distinct bitrates to different keypoints across frames.Compressive techniques involving sampling and quantization are employed to reduce the bitrate while retaining facial key semantic information.At the receiver’s end,a GAN-based generative network is utilized for reconstruction,effectively mitigating block artifacts and buffering problems present in traditional codec algorithms under low bitrates.The performance of the proposed approach is validated on multiple datasets,such as VoxCeleb and TalkingHead-1kH,employing metrics such as LPIPS,DISTS,and AKD for assessment.Experimental results demonstrate significant advantages over traditional codec methods,achieving up to approximately 10-fold bitrate reduction in prolonged,stable head pose scenarios across diverse conversational video settings.
文摘The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos leverage the unique expressive logic of the platform by adopting a small entry point and prioritizing dissemination rate.They are strategically planned in terms of content,visuals,and interaction to cater to users needs for relaxation,knowledge acquisition,social sharing,agricultural product marketing,and talent display.Through careful design,full creativity,rich emotion,and the creation of distinct character personalities,these videos deliver positive,entertaining,informative,and opinion-driven agricultural content.The production and operation of agricultural short videos can be effectively optimized by analyzing the characteristics of both popular and less popular videos,and utilizing smart tools and trending topics.
文摘Objectives:Medical students often rely on recreational internet media to relieve the stress caused by immense academic and life pressures,and among these media,short-form videos,which are an emerging digital medium,have gradually become the mainstream choice of students to relieve their stress.However,the addiction caused by their usage has attracted the widespread attention of both academia and society,which is why the purpose of this study is to systematically explore the underlying mechanisms that link perceived stress,entertainment gratification,emotional gratification,short-form video usage intensity,and short-form video addiction based on multiple theoretical frameworks including the Compensatory Internet Use Model(CIU),the Interaction of Person-Affect-Cognition-Execution Model(I-PACE),and the Use and Gratification Theory(UGT).Methods:A hypothetical model with 9 research hypotheses was constructed.Taking medical students from Chi-nese universities as the research subjects,1057 valid responses were collected through an online questionnaire survey,including 358 males and 658 females.Structural equation modelling(SEM)was performed using the AMOS software to test the research hypotheses.Results:(1)Perceived stress positively predicted entertainment gratification and emotional gratification(β=0.72,p<0.001;β=0.61,p<0.001);(2)Entertainment gratifi-cation and emotional gratification positively influenced short-form video usage intensity(β=0.35,p<0.001;β=0.19,p<0.001);(3)Entertainment gratification and emotional gratification positively predicted short-form video addiction(β=0.40,p<0.001;β=0.17,p<0.001);(4)Short-form video usage intensity positively influenced short-form video addiction(β=0.36,p<0.001);and(5)Perceived stress exerted an indirect but positive effect on both short-form video usage intensity and short-form video addiction,mediated by entertainment and emotional gratification(β=0.37,p<0.001;β=0.52,p<0.001).Conclusion:The mechanisms that underlie medical students’short-form video addiction in stressful situations were revealed in this study.It was found that stress enhances medical students’need for entertainment and emotional online compensation,prompting more frequent short-form video usage and ultimately leading to addiction.These results underscore the need to address the stressors faced by medical students.Effective interventions should prioritise stress management strategies and promote healthier alternative coping mechanisms to mitigate the risk of addiction.
基金supported by the Zhejiang Provincial Natural Science Foundation of China(No.LQ23F030001)the National Natural Science Foundation of China(No.62406280)+5 种基金the Autism Research Special Fund of Zhejiang Foundation for Disabled Persons(No.2023008)the Liaoning Province Higher Education Innovative Talents Program Support Project(No.LR2019058)the Liaoning Province Joint Open Fund for Key Scientific and Technological Innovation Bases(No.2021-KF-12-05)the Central Guidance on Local Science and Technology Development Fund of Liaoning Province(No.2023JH6/100100066)the Key Laboratory for Biomedical Engineering of Ministry of Education,Zhejiang University,Chinain part by the Open Research Fund of the State Key Laboratory of Cognitive Neuroscience and Learning.
文摘Video action recognition(VAR)aims to analyze dynamic behaviors in videos and achieve semantic understanding.VAR faces challenges such as temporal dynamics,action-scene coupling,and the complexity of human interactions.Existing methods can be categorized into motion-level,event-level,and story-level ones based on spatiotemporal granularity.However,single-modal approaches struggle to capture complex behavioral semantics and human factors.Therefore,in recent years,vision-language models(VLMs)have been introduced into this field,providing new research perspectives for VAR.In this paper,we systematically review spatiotemporal hierarchical methods in VAR and explore how the introduction of large models has advanced the field.Additionally,we propose the concept of“Factor”to identify and integrate key information from both visual and textual modalities,enhancing multimodal alignment.We also summarize various multimodal alignment methods and provide in-depth analysis and insights into future research directions.
基金supported by the International Joint Research Project of Huiyan International College,Faculty of Education,Beijing Normal University(Grant Number:ICER202102).
文摘Objectives:Short video addiction has emerged as a significant public health issue in recent years,with a growing trend toward severity.However,research on the causes and impacts of short video addiction remains limited,and understanding of the variable“TikTok brain”is still in its infancy.Therefore,based on the Stimulus-Organism-Behavior-Consequence(SOBC)framework,we proposed six research hypotheses and constructed a model to explore the relationships between short video usage intensity,TikTok brain,short video addiction,and decreased attention control.Methods:Given that students are considered a high-risk group for excessive short video use,we collected 1086 valid participants from Chinese student users,including 609 males(56.1%)and 477 females(43.9%),with an average participant age of 19.84 years,to test the hypotheses.Results:(1)Short video usage intensity was positively related to short video addiction,TikTok brain,and decreased attention control;(2)TikTok brain was positively related to short video addiction and decreased attention control;and(3)Short video addiction was positively related to decreased attention control.Conclusions:These findings suggest that although excessive use of short video applications brings negative consequences,users still spend significant amounts of time on these platforms,indicating a need for strict self-regulation of usage time.
文摘Video synopsis is an effective way to easily summarize long-recorded surveillance videos.The omnidirectional view allows the observer to select the desired fields of view(FoV)from the different FoVavailable for spherical surveillance video.By choosing to watch one portion,the observer misses out on the events occurring somewhere else in the spherical scene.This causes the observer to experience fear of missing out(FOMO).Hence,a novel personalized video synopsis approach for the generation of non-spherical videos has been introduced to address this issue.It also includes an action recognition module that makes it easy to display necessary actions by prioritizing them.This work minimizes and maximizes multiple goals such as loss of activity,collision,temporal consistency,length,show,and important action cost respectively.The performance of the proposed framework is evaluated through extensive simulation and compared with the state-of-art video synopsis optimization algorithms.Experimental results suggest that some constraints are better optimized by using the latest metaheuristic optimization algorithms to generate compact personalized synopsis videos from spherical surveillance videos.
基金Shenzhen Science and Technology Programme,Grant/Award Number:JCYJ202308071208000012023 Shenzhen sustainable supporting funds for colleges and universities,Grant/Award Number:20231121165240001Guangdong Provincial Key Laboratory of Ultra High Definition Immersive Media Technology,Grant/Award Number:2024B1212010006。
文摘Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods.
基金supported by National Natural Science Foundation of China(62072416)Key Research and Development Special Project of Henan Province(221111210500)Key TechnologiesR&DProgram of Henan rovince(232102211053,242102211071).
文摘The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.
文摘Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as trauma,inflammation,or anatomical abnormalities airway management becomes challenging.A commonly utilized method to overcome this challenge is the use of video laryngoscopy(VL),which employs a specialized device equipped with a camera and a light source to allow a clear view of the larynx and vocal cords.VL overcomes the limitations of direct laryngoscopy in patients with limited mouth opening,enabling better visualization and successful intubation.Various types of VL blades are available.We devised a novel flangeless video laryngoscope for use in patients with a limited mouth opening and then tested it on a manikin.
文摘Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial for scene understanding. However, a challenge many semantic learning models face is the lack of data. Existing video datasets are limited to short, low-resolution videos that are not representative of real-world examples. Thus, one of our key contributions is a customized semantic segmentation version of the Walking Tours Dataset that features hour-long, high-resolution, real-world data from tours of different cities. Additionally, we evaluate the performance of open-vocabulary, semantic model OpenSeeD on our own custom dataset and discuss future implications.
文摘Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The subjects were female university students living in the Kinki area. A longitudinal survey was conducted on 67 members in the intervention group, who received the health education, and 52 members in the control group, who did not receive the health education. The primary outcome measures were knowledge of PCC and the subscales of the Health Promotion Lifestyle Profile. Surveys were conducted before, after, and six months after the intervention in the intervention group, and an initial survey and survey six months later were conducted in the control group. Cochran’s Q test, Bonferroni’s multiple comparison test, and McNemar’s test were used to analyze the knowledge of PCC data. The Health Awareness, Nutrition, and Stress Management subscales of the Health Promotion Lifestyle Profile were analyzed by paired t-test, and comparisons between the intervention and control groups were performed using the two-way repeated measures analysis of variance. Results: In the intervention group of 67 people, the number of subjects who answered “correct” for five of the nine items concerning knowledge of PCC increased immediately after the health education (P = 0.006) but decreased for five items from immediately after the health education to six months later (P = 0.043). In addition, the number of respondents who answered “correct” for “low birth weight infants and future lifestyle-related diseases” (P = 0.016) increased after six months compared with before the health education. For the 52 subjects in the control group, there was no change in the number of subjects who answered “correct” for eight out of the nine items after six months. There was also no increase in scores for the Health Promotion Lifestyle Profile after six months for either the intervention or control group. Conclusion: Providing health education about PCC using videos and leaflets to adolescent females was shown to enhance the knowledge of PCC immediately after the education.
文摘Objective:The objective of this study is to determine the effect of nurse-led instructional video(NLIV)on anxiety,satisfaction,and recovery among mothers admitted for cesarean section(CS).Materials and Methods:A quasi-experimental design was carried out on the mothers scheduled for CS.Eighty participants were selected by a purposive sampling technique,which were divided(40 participants in each group)into an experimental group and a control group.Nurse-led informational video(NLIV)was shown to the experimental group,and routine care was provided for the control group.Modified hospital anxiety scale(HADS),scale for measuring maternal satisfaction in cesarean birth,and obstetric quality of recovery following cesarean delivery were used to assess anxiety,satisfaction,and recovery.Results:Both the experimental and control groups showed significant reductions in anxiety by the first postintervention day(P<0.001),with the experimental group experiencing a greater mean reduction(mean difference[MD]=4.37)than the control group(MD=3.35)but the intergroup difference was not statistically significant(P>0.05).The experimental group reported significantly higher satisfaction scores(175.55±9.42)on the 3rd postoperative day compared to the control group(151.93±14.89;P<0.001).Similarly,the experimental group’s recovery scores(79.90±6.24)were considerably higher than those of the control group(62.45±15.18;P<0.001).On the 3rd postintervention day,satisfaction was significantly associated with age(P<0.001),and recovery with gravidity(P<0.05).Conclusions:NLIV can be used in the preoperative period to reduce anxiety related to CS and to improve satisfaction and recovery after the CS.
基金the financial support from the China Scholarship Council(Grant No.202106370009)and Alberta Innovate Graduate Student Scholarship.
文摘Objective While there is consensus regarding a positive effect of video gaming on dexterity,little is known regarding how much traditional laparoscopic practice can or should be substituted with video gaming.This study was designed to assess the effects of varying the amount of traditional practice in a lap box trainer and video gaming on performance in two fundamentals of laparoscopic surgery core tasks.Methods Undergraduate and medical students were recruited and randomized into one of four groups:a control group,a lap box group,a video game group,and a combined group with 50%of the time allocated to each modality.Performance in the peg transfer and precision cutting tasks was assessed both prior to and following the 6 training sessions.Results Peg transfer performance significantly improved in the lap box group(168.4±70.6 s vs.332.9±178.2 s,p<0.001),video game group(176.7±53.3 s vs.300.0±101.2 s,p<0.001)and combined group(214.2±86.9 s vs.406.8±239.5 s,p=0.002)after training.Similar improvements were also observed in precision cutting performance in the lap box group(413.1±138.4 s vs.614.3±211.4 s,p=0.002),video game group(434.1±150.8 s vs.609.2±233.2 s,p=0.007)and combined group(469.2±185.3 s vs.663.8±296.3 s,p=0.020).When analyzing improvements in performance across three different training groups compared with the control group,we found that both the lap box group(p<0.001)and the combined group(p<0.001)showed better improvement in both tasks,and the video game group had significantly better outcomes in the precision cutting task(p=0.003).Conclusion Traditional lap box training remains the most effective method for improving the performance of simulated laparoscopic surgery.Video games can be encouraged to enhance skills retention and supplement simulated practice outside of a formal training curriculum.
基金Supported by Undergraduate Education and Teaching Reform Research Project of Chengdu University(XJJG-20242025228)Sichuan Genuine Medicinal Materials and Traditional Chinese Medicine Innovation Team(SCCXTD-2025-19)Sichuan Science and Technology Program(2021YFYZ0012).
文摘Under the background of competency-based transformation of higher education,bioengineering major urgently needs to solve the problems such as the disconnect between practical teaching and industry needs and the lack of resources.This paper proposed supplementing traditional experimental teaching with video resources to construct a closed-loop model of"theoretical instruction,case analysis,video demonstration,and reflective application".Through the development of instructional videos covering core techniques such as PCR,Western blot,CRISPR-Cas9,cell culture,HPLC,GMP operations,and bioinformatics analysis,teaching costs can be reduced,spatiotemporal constraints can be overcome,and process visualization can be enhanced,thereby supporting students in mastering the entire workflow of modern biomanufacturing.The paper further explored resource development pathways,university-enterprise collaboration mechanisms,and curriculum integration strategies,offering actionable solutions for practical teaching reform.
文摘The Double Take column looks at a single topic from an African and Chinese perspective.This month,we explore how we can cope with the influence of short videos.