The rapid progress of cloud technology has attracted a growing number of video providers to consider deploying their streaming services onto cloud platform for more cost-effective, scalable and reliable performance. I...The rapid progress of cloud technology has attracted a growing number of video providers to consider deploying their streaming services onto cloud platform for more cost-effective, scalable and reliable performance. In this paper, we utilize Markov decision process model to formulate the dynamic deployment of cloud-based video services over multiple geographically distributed datacenters. We focus on maximizing the average profits for the video service provider over a long run and introduce an average performance criteria which reflects the cost and user experience jointly. We develop an optimal algorithm based on the sensitivity analysis and sample-based policy iteration to obtain the optimal video placement and request dispatching strategy. We demonstrate the optimality of our algorithm with theoretical proof and specify the practical feasibility of our algorithm. We conduct simulations to evaluate the performance of our algorithm and the results show that our strategy can effectively cut down the total cost and guarantee users' quality of experience (QoE).展开更多
With the rapid development of generative artificial intelligence(GenAI),the task of story visualization,which transforms natural language narratives into coherent and consistent image sequences,has attracted growing r...With the rapid development of generative artificial intelligence(GenAI),the task of story visualization,which transforms natural language narratives into coherent and consistent image sequences,has attracted growing research attention.However,existing methods still face limitations in balancing multi-frame character consistency and generation efficiency,which restricts their feasibility for large-scale practical applications.To address this issue,this study proposes a modular cloud-based distributed system built on Stable Diffusion.By separating the character generation and story generation processes,and integratingmulti-feature control techniques,a cachingmechanism,and an asynchronous task queue architecture,the system enhances generation efficiency and scalability.The experimental design includes both automated and human evaluations of character consistency,performance testing,and multinode simulation.The results show that the proposed system outperforms the baseline model StoryGen in both CLIP-I and human evaluation metrics.In terms of performance,under the experimental environment of this study,dual-node deployment reduces average waiting time by approximately 19%,while the four-node simulation further reduces it by up to 65%.Overall,this study demonstrates the advantages of cloud-distributed GenAI in maintaining character consistency and reducing generation latency,highlighting its potential value inmulti-user collaborative story visualization applications.展开更多
In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-base...In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-based web services and the constraints of system resources.Then,a light-induced plant growth simulation algorithm was established.The performance of the algorithm was compared through several plant types,and the best plant model was selected as the setting for the system.Experimental results show that when the number of test cloud-based web services reaches 2048,the model being 2.14 times faster than PSO,2.8 times faster than the ant colony algorithm,2.9 times faster than the bee colony algorithm,and a remarkable 8.38 times faster than the genetic algorithm.展开更多
Cloud-based video communication and networking has emerged as a promising new research paradigm to significantly improve the quality of experience for video consumers.An architectural overview of this promising resear...Cloud-based video communication and networking has emerged as a promising new research paradigm to significantly improve the quality of experience for video consumers.An architectural overview of this promising research area was presented.This overview with an end-to-end partition of the cloud-based video system into major blocks with respect to their locations from the center of the cloud to the edge of the cloud was started.Following this partition,existing research efforts on how the principles of cloud computing can provide unprecedented support to 1)video servers,2)content delivery networks,and 3)edge networks within the global cloud video ecosystems were examined.Moreover,a case study was exemplfied on an edge cloud assisted HTTP adaptive video streaming to demonstrate the effectiveness of cloud computing support.Finally,by envisioning a list of future research topics in cloud-based video communication and networking a coclusion is made.展开更多
Precisely forecasting the performance of Deep Learning(DL)models,particularly in critical areas such as Uniform Resource Locator(URL)-based threat detection,aids in improving systems developed for difficult tasks.In c...Precisely forecasting the performance of Deep Learning(DL)models,particularly in critical areas such as Uniform Resource Locator(URL)-based threat detection,aids in improving systems developed for difficult tasks.In cybersecurity,recognizing harmful URLs is vital to lowering risks associated with phishing,malware,and other online-based attacks.Since it directly affects the model’s capacity to differentiate between benign and harmful URLs,finding the optimum mix of hyperparameters in DL models is a significant difficulty.Two commonly used architectures for sequential and spatial data processing,Long Short-Term Memory(LSTM)/Gated Recurrent Unit(GRU)and Convolutional Neural Network(CNN)/Long Short-Term Memory(LSTM)models are targeted in this study to have higher predictive capacity by modifying crucial hyperparameters such as learning rate,batch size,and dropout rate using cloud capability.Research finds the best settings for the models by testing 50 dropout rates(between 0.1 and 0.5)with different learning rates and batch sizes.Performances were measured in the form of accuracy,precision,recall,F1-score,and errors such as Mean Absolute Error(MAE),Mean Squared Error(MSE),Root Mean Squared Error(RMSE)and Mean Absolute Percent Error(MAPE).In our results,CNN/LSTM performed better often than LSTM/GRU,with up to 10%better F1-score and much lower MAPE when the learning rate was 0.001 and the dropout rate was 0.2.These results show the value of fine-tuning hyperparameters to increase model performance and reduce errors.Higher on many of the parameters,CNN/LSTM architecture became obvious as the more trustworthy one.It also discussed the importance of DL in enhancing URL attack detection mechanisms to provide increased accuracy and precision for real-world cybersecurity.展开更多
The integration of 5G technology with cloud-based control systems in industrial robots holds significant promise for the future of industrial automation.With its ultra-low latency,high data transfer speeds,and massive...The integration of 5G technology with cloud-based control systems in industrial robots holds significant promise for the future of industrial automation.With its ultra-low latency,high data transfer speeds,and massive connectivity,5G is poised to revolutionize real-time communication and coordination in manufacturing environments.This paper explores the prospects and challenges of applying 5G technology in industrial robots,focusing on cloud-based control systems that enable scalable,flexible,and efficient operations.Key advantages of 5G,including improved communication speed,enhanced real-time control,scalability,and predictive maintenance capabilities,are discussed.However,the transition to 5G also presents challenges,such as network reliability,security concerns,integration with legacy systems,and high implementation costs.The paper also examines case studies in the automotive,electronics,and aerospace industries,providing real-world examples of 5G adoption in industrial automation.The conclusion highlights key insights and outlines potential research directions for overcoming existing barriers and fully realizing the potential of 5G technology in industrial robot control.展开更多
The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effectiv...The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effective detection of key information and creating new possibilities for video synopsis.To summarize recent progress and accelerate research in this field,this paper provides a comprehensive review of deep learning-based video summarization methods developed over the past decade.We begin by examining the research landscape of video abstraction technologies and identifying core challenges in video summarization.Subsequently,we systematically analyze prevailing deep learning frameworks and methodologies employed in current video summarization systems,offering researchers a clear roadmap of the field's evelution.Unlike previous review works,we first classify research papers based on the structural hierarchy of the video(from frame-level to shot-level to video-level),then further categorize them according to the summary backbone model(feature extraction and spatiotemporal modeling).This approach provides a more systematic and hierarchical organization of the documents.Following this comprehensive review,we summarize the benchmark datasets and evaluation metrics commonly employed in the field.Finally,we analyze persistent challenges and propose insightful directions for future research,providing a forward-looking perspective on video summarization technologies.This systematic literature review is of great reference value to new researchers exploring the fields of deep learning and video summarization.展开更多
With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are partic...With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.展开更多
Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been i...Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.展开更多
Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-...Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-attention modeling of global temporal dependency has problems of high computational overhead and feature similarity.On the other hand,fixed-size convolution kernels are often used,which have weak perception ability for emotional regions of different scales.Therefore,this paper proposes a video emotion recognition model that combines multi-scale region-aware convolution with temporal interactive sampling.In terms of space,multi-branch large-kernel stripe convolution is used to perceive emotional region features at different scales,and attention weights are generated for each scale feature.In terms of time,multi-layer odd-even down-sampling is performed on the time series,and oddeven sub-sequence interaction is performed to solve the problem of feature similarity,while reducing computational costs due to the linear relationship between sampling and convolution overhead.This paper was tested on CMU-MOSI,CMU-MOSEI,and Hume Reaction.The Acc-2 reached 83.4%,85.2%,and 81.2%,respectively.The experimental results show that the model can significantly improve the accuracy of emotion recognition.展开更多
Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of ex...Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.展开更多
Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may ...Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.展开更多
The concept of sharing of personal health data over cloud storage in a healthcare-cyber physical system has become popular in recent times as it improves access quality.The privacy of health data can only be preserved...The concept of sharing of personal health data over cloud storage in a healthcare-cyber physical system has become popular in recent times as it improves access quality.The privacy of health data can only be preserved by keeping it in an encrypted form,but it affects usability and flexibility in terms of effective search.Attribute-based searchable encryption(ABSE)has proven its worth by providing fine-grained searching capabilities in the shared cloud storage.However,it is not practical to apply this scheme to the devices with limited resources and storage capacity because a typical ABSE involves serious computations.In a healthcare cloud-based cyber-physical system(CCPS),the data is often collected by resource-constraint devices;therefore,here also,we cannot directly apply ABSE schemes.In the proposed work,the inherent computational cost of the ABSE scheme is managed by executing the computationally intensive tasks of a typical ABSE scheme on the blockchain network.Thus,it makes the proposed scheme suitable for online storage and retrieval of personal health data in a typical CCPS.With the assistance of blockchain technology,the proposed scheme offers two main benefits.First,it is free from a trusted authority,which makes it genuinely decentralized and free from a single point of failure.Second,it is computationally efficient because the computational load is now distributed among the consensus nodes in the blockchain network.Specifically,the task of initializing the system,which is considered the most computationally intensive,and the task of partial search token generation,which is considered as the most frequent operation,is now the responsibility of the consensus nodes.This eliminates the need of the trusted authority and reduces the burden of data users,respectively.Further,in comparison to existing decentralized fine-grained searchable encryption schemes,the proposed scheme has achieved a significant reduction in storage and computational cost for the secret key associated with users.It has been verified both theoretically and practically in the performance analysis section.展开更多
This paper presents a cloud-based data-driven design optimization system,named DADOS,to help engineers and researchers improve a design or product easily and efficiently.DADOS has nearly 30 key algorithms,including th...This paper presents a cloud-based data-driven design optimization system,named DADOS,to help engineers and researchers improve a design or product easily and efficiently.DADOS has nearly 30 key algorithms,including the design of experiments,surrogate models,model validation and selection,prediction,optimization,and sensitivity analysis.Moreover,it also includes an exclusive ensemble surrogate modeling technique,the extended hybrid adaptive function,which can make use of the advantages of each surrogate and eliminate the effort of selecting the appropriate individual surrogate.To improve ease of use,DADOS provides a user-friendly graphical user interface and employed flow-based programming so that users can conduct design optimization just by dragging,dropping,and connecting algorithm blocks into a workflow instead of writing massive code.In addition,DADOS allows users to visualize the results to gain more insights into the design problems,allows multi-person collaborating on a project at the same time,and supports multi-disciplinary optimization.This paper also details the architecture and the user interface of DADOS.Two examples were employed to demonstrate how to use DADOS to conduct data-driven design optimization.Since DADOS is a cloud-based system,anyone can access DADOS at www.dados.com.cn using their web browser without the need for installation or powerful hardware.展开更多
With the rapid development of computer technology, cloud-based services have become a hot topic. They not only provide users with convenience, but also bring many security issues, such as data sharing and privacy issu...With the rapid development of computer technology, cloud-based services have become a hot topic. They not only provide users with convenience, but also bring many security issues, such as data sharing and privacy issue. In this paper, we present an access control system with privilege separation based on privacy protection(PS-ACS). In the PS-ACS scheme, we divide users into private domain(PRD) and public domain(PUD) logically. In PRD, to achieve read access permission and write access permission, we adopt the Key-Aggregate Encryption(KAE) and the Improved Attribute-based Signature(IABS) respectively. In PUD, we construct a new multi-authority ciphertext policy attribute-based encryption(CP-ABE) scheme with efficient decryption to avoid the issues of single point of failure and complicated key distribution, and design an efficient attribute revocation method for it. The analysis and simulation result show that our scheme is feasible and superior to protect users' privacy in cloud-based services.展开更多
Cloud-based satellite and terrestrial spectrum shared networks(CB-STSSN)combines the triple advantages of efficient and flexible net-work management of heterogeneous cloud access(H-CRAN),vast coverage of satellite net...Cloud-based satellite and terrestrial spectrum shared networks(CB-STSSN)combines the triple advantages of efficient and flexible net-work management of heterogeneous cloud access(H-CRAN),vast coverage of satellite networks,and good communication quality of terrestrial networks.Thanks to the complementary coverage characteristics,any-time and anywhere high-speed communications can be achieved to meet the various needs of users.The scarcity of spectrum resources is a common prob-lem in both satellite and terrestrial networks.In or-der to improve resource utilization,the spectrum is shared not only within each component but also be-tween satellite beams and terrestrial cells,which intro-duces inter-component interferences.To this end,this paper first proposes an analytical framework which considers the inter-component interferences induced by spectrum sharing(SS).An intelligent SS scheme based on radio map(RM)consisting of LSTM-based beam prediction(BP),transfer learning-based spec-trum prediction(SP)and joint non-preemptive prior-ity and preemptive priority(J-NPAP)-based propor-tional fair spectrum allocation is than proposed.The simulation result shows that the spectrum utilization rate of CB-STSSN is improved and user blocking rate and waiting probability are decreased by the proposed scheme.展开更多
An e-tag used on the freeway is a kind of passive sensors composed of sensors and radio- frequency identification (RFID) tags. The principle of the electronic toll collection system is that the sensor emits radio wa...An e-tag used on the freeway is a kind of passive sensors composed of sensors and radio- frequency identification (RFID) tags. The principle of the electronic toll collection system is that the sensor emits radio waves touching the e-tag within a certain range, the e-tag will respond to the radio waves by induction, and the sensor will read and write information of the vehicles. Although the RFID technology is popularly used in campus management systems, there is no e-tag technology application used in a campus parking system. In this paper, we use the e-tag technology on a campus parking management system based on the cloud-based construction. By this, it helps to achieve automated and standardized management of the campus parking system, enhance management efficiency, reduce the residence time of the vehicles at the entrances and exits, and improve the efficiency of vehicles parked at the same time.展开更多
Mixed redundancy strategies are generally used in cloud-based systems,with different node switch mechanisms from traditional fault-tolerant strategies.Existing studies often concentrate on optimizing a single strategy...Mixed redundancy strategies are generally used in cloud-based systems,with different node switch mechanisms from traditional fault-tolerant strategies.Existing studies often concentrate on optimizing a single strategy in cloud computing environment and ignore the impact of mixed redundancy strategies.Therefore,a model is proposed to evaluate and optimize the reliability and performance of cloud-based degraded systems subject to a mixed active and cold standby redundancy strategy.In this strategy,node switching is triggered by a continual monitoring and detection mechanism when active nodes fail.To evaluate the transient availability and the expected job completion rate of systems with such kind of strategy,a continuous-time Markov chain model is built on the state transition process and a numerical method is used to solve the model.To choose the optimal redundancy for the mixed strategy under system constraints,a greedy search algorithm is proposed after sensitivity analysis.Illustrative examples were presented to explain the process of calculating the transient probability of each system state and in turn,the availability and performance of the whole system.It was shown that the near-optimal redundancy solution could be obtained using the optimizationmethod.The comparison with optimization of the traditional mixed redundancy strategy proved that the system behavior was different using different kinds of mixed strategies and less redundancy was assigned for the new type of mixed strategy under the same system constraint.展开更多
The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profile...The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.展开更多
基金supported by the State Key Program of National Natural Science Foundation of China(No.61233003)National Natural Science Foundation of China(No.61503358)
文摘The rapid progress of cloud technology has attracted a growing number of video providers to consider deploying their streaming services onto cloud platform for more cost-effective, scalable and reliable performance. In this paper, we utilize Markov decision process model to formulate the dynamic deployment of cloud-based video services over multiple geographically distributed datacenters. We focus on maximizing the average profits for the video service provider over a long run and introduce an average performance criteria which reflects the cost and user experience jointly. We develop an optimal algorithm based on the sensitivity analysis and sample-based policy iteration to obtain the optimal video placement and request dispatching strategy. We demonstrate the optimality of our algorithm with theoretical proof and specify the practical feasibility of our algorithm. We conduct simulations to evaluate the performance of our algorithm and the results show that our strategy can effectively cut down the total cost and guarantee users' quality of experience (QoE).
文摘With the rapid development of generative artificial intelligence(GenAI),the task of story visualization,which transforms natural language narratives into coherent and consistent image sequences,has attracted growing research attention.However,existing methods still face limitations in balancing multi-frame character consistency and generation efficiency,which restricts their feasibility for large-scale practical applications.To address this issue,this study proposes a modular cloud-based distributed system built on Stable Diffusion.By separating the character generation and story generation processes,and integratingmulti-feature control techniques,a cachingmechanism,and an asynchronous task queue architecture,the system enhances generation efficiency and scalability.The experimental design includes both automated and human evaluations of character consistency,performance testing,and multinode simulation.The results show that the proposed system outperforms the baseline model StoryGen in both CLIP-I and human evaluation metrics.In terms of performance,under the experimental environment of this study,dual-node deployment reduces average waiting time by approximately 19%,while the four-node simulation further reduces it by up to 65%.Overall,this study demonstrates the advantages of cloud-distributed GenAI in maintaining character consistency and reducing generation latency,highlighting its potential value inmulti-user collaborative story visualization applications.
基金Shanxi Province Higher Education Science and Technology Innovation Fund Project(2022-676)Shanxi Soft Science Program Research Fund Project(2016041008-6)。
文摘In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-based web services and the constraints of system resources.Then,a light-induced plant growth simulation algorithm was established.The performance of the algorithm was compared through several plant types,and the best plant model was selected as the setting for the system.Experimental results show that when the number of test cloud-based web services reaches 2048,the model being 2.14 times faster than PSO,2.8 times faster than the ant colony algorithm,2.9 times faster than the bee colony algorithm,and a remarkable 8.38 times faster than the genetic algorithm.
基金supported by National Science Foundation Grants(ECCS-1405594).
文摘Cloud-based video communication and networking has emerged as a promising new research paradigm to significantly improve the quality of experience for video consumers.An architectural overview of this promising research area was presented.This overview with an end-to-end partition of the cloud-based video system into major blocks with respect to their locations from the center of the cloud to the edge of the cloud was started.Following this partition,existing research efforts on how the principles of cloud computing can provide unprecedented support to 1)video servers,2)content delivery networks,and 3)edge networks within the global cloud video ecosystems were examined.Moreover,a case study was exemplfied on an edge cloud assisted HTTP adaptive video streaming to demonstrate the effectiveness of cloud computing support.Finally,by envisioning a list of future research topics in cloud-based video communication and networking a coclusion is made.
文摘Precisely forecasting the performance of Deep Learning(DL)models,particularly in critical areas such as Uniform Resource Locator(URL)-based threat detection,aids in improving systems developed for difficult tasks.In cybersecurity,recognizing harmful URLs is vital to lowering risks associated with phishing,malware,and other online-based attacks.Since it directly affects the model’s capacity to differentiate between benign and harmful URLs,finding the optimum mix of hyperparameters in DL models is a significant difficulty.Two commonly used architectures for sequential and spatial data processing,Long Short-Term Memory(LSTM)/Gated Recurrent Unit(GRU)and Convolutional Neural Network(CNN)/Long Short-Term Memory(LSTM)models are targeted in this study to have higher predictive capacity by modifying crucial hyperparameters such as learning rate,batch size,and dropout rate using cloud capability.Research finds the best settings for the models by testing 50 dropout rates(between 0.1 and 0.5)with different learning rates and batch sizes.Performances were measured in the form of accuracy,precision,recall,F1-score,and errors such as Mean Absolute Error(MAE),Mean Squared Error(MSE),Root Mean Squared Error(RMSE)and Mean Absolute Percent Error(MAPE).In our results,CNN/LSTM performed better often than LSTM/GRU,with up to 10%better F1-score and much lower MAPE when the learning rate was 0.001 and the dropout rate was 0.2.These results show the value of fine-tuning hyperparameters to increase model performance and reduce errors.Higher on many of the parameters,CNN/LSTM architecture became obvious as the more trustworthy one.It also discussed the importance of DL in enhancing URL attack detection mechanisms to provide increased accuracy and precision for real-world cybersecurity.
文摘The integration of 5G technology with cloud-based control systems in industrial robots holds significant promise for the future of industrial automation.With its ultra-low latency,high data transfer speeds,and massive connectivity,5G is poised to revolutionize real-time communication and coordination in manufacturing environments.This paper explores the prospects and challenges of applying 5G technology in industrial robots,focusing on cloud-based control systems that enable scalable,flexible,and efficient operations.Key advantages of 5G,including improved communication speed,enhanced real-time control,scalability,and predictive maintenance capabilities,are discussed.However,the transition to 5G also presents challenges,such as network reliability,security concerns,integration with legacy systems,and high implementation costs.The paper also examines case studies in the automotive,electronics,and aerospace industries,providing real-world examples of 5G adoption in industrial automation.The conclusion highlights key insights and outlines potential research directions for overcoming existing barriers and fully realizing the potential of 5G technology in industrial robot control.
基金supported by UKRI(EP/Z000025/1)Horizon Europe Programme under the MSCA grant for the ACMod project(101130271)。
文摘The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effective detection of key information and creating new possibilities for video synopsis.To summarize recent progress and accelerate research in this field,this paper provides a comprehensive review of deep learning-based video summarization methods developed over the past decade.We begin by examining the research landscape of video abstraction technologies and identifying core challenges in video summarization.Subsequently,we systematically analyze prevailing deep learning frameworks and methodologies employed in current video summarization systems,offering researchers a clear roadmap of the field's evelution.Unlike previous review works,we first classify research papers based on the structural hierarchy of the video(from frame-level to shot-level to video-level),then further categorize them according to the summary backbone model(feature extraction and spatiotemporal modeling).This approach provides a more systematic and hierarchical organization of the documents.Following this comprehensive review,we summarize the benchmark datasets and evaluation metrics commonly employed in the field.Finally,we analyze persistent challenges and propose insightful directions for future research,providing a forward-looking perspective on video summarization technologies.This systematic literature review is of great reference value to new researchers exploring the fields of deep learning and video summarization.
基金supported by the National Natural Science Foundation of China(Grant No.72334003)the National Key Research and Development Program of China(Grant No.2022YFB2702804)+1 种基金the Shandong Key Research and Development Program(Grant No.2020ZLYS09)the Jinan Program(Grant No.2021GXRC084-2).
文摘With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.
基金supported by the Cultivation Program for Major Scientific Research Projects of Harbin Institute of Technology(ZDXMPY20180109).
文摘Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.
基金supported,in part,by the National Nature Science Foundation of China under Grant 62272236,62376128in part,by the Natural Science Foundation of Jiangsu Province under Grant BK20201136,BK20191401.
文摘Video emotion recognition is widely used due to its alignment with the temporal characteristics of human emotional expression,but existingmodels have significant shortcomings.On the one hand,Transformermultihead self-attention modeling of global temporal dependency has problems of high computational overhead and feature similarity.On the other hand,fixed-size convolution kernels are often used,which have weak perception ability for emotional regions of different scales.Therefore,this paper proposes a video emotion recognition model that combines multi-scale region-aware convolution with temporal interactive sampling.In terms of space,multi-branch large-kernel stripe convolution is used to perceive emotional region features at different scales,and attention weights are generated for each scale feature.In terms of time,multi-layer odd-even down-sampling is performed on the time series,and oddeven sub-sequence interaction is performed to solve the problem of feature similarity,while reducing computational costs due to the linear relationship between sampling and convolution overhead.This paper was tested on CMU-MOSI,CMU-MOSEI,and Hume Reaction.The Acc-2 reached 83.4%,85.2%,and 81.2%,respectively.The experimental results show that the model can significantly improve the accuracy of emotion recognition.
文摘Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.
基金funded by the Guangxi Philosophy and Social Science Research Project,grant number 24XWC002.
文摘Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.
文摘The concept of sharing of personal health data over cloud storage in a healthcare-cyber physical system has become popular in recent times as it improves access quality.The privacy of health data can only be preserved by keeping it in an encrypted form,but it affects usability and flexibility in terms of effective search.Attribute-based searchable encryption(ABSE)has proven its worth by providing fine-grained searching capabilities in the shared cloud storage.However,it is not practical to apply this scheme to the devices with limited resources and storage capacity because a typical ABSE involves serious computations.In a healthcare cloud-based cyber-physical system(CCPS),the data is often collected by resource-constraint devices;therefore,here also,we cannot directly apply ABSE schemes.In the proposed work,the inherent computational cost of the ABSE scheme is managed by executing the computationally intensive tasks of a typical ABSE scheme on the blockchain network.Thus,it makes the proposed scheme suitable for online storage and retrieval of personal health data in a typical CCPS.With the assistance of blockchain technology,the proposed scheme offers two main benefits.First,it is free from a trusted authority,which makes it genuinely decentralized and free from a single point of failure.Second,it is computationally efficient because the computational load is now distributed among the consensus nodes in the blockchain network.Specifically,the task of initializing the system,which is considered the most computationally intensive,and the task of partial search token generation,which is considered as the most frequent operation,is now the responsibility of the consensus nodes.This eliminates the need of the trusted authority and reduces the burden of data users,respectively.Further,in comparison to existing decentralized fine-grained searchable encryption schemes,the proposed scheme has achieved a significant reduction in storage and computational cost for the secret key associated with users.It has been verified both theoretically and practically in the performance analysis section.
基金Supported by National Key Research and Development Program of China (Grant No.2018YFB1700704)National Natural Science Foundation of China (Grant No.52075068)。
文摘This paper presents a cloud-based data-driven design optimization system,named DADOS,to help engineers and researchers improve a design or product easily and efficiently.DADOS has nearly 30 key algorithms,including the design of experiments,surrogate models,model validation and selection,prediction,optimization,and sensitivity analysis.Moreover,it also includes an exclusive ensemble surrogate modeling technique,the extended hybrid adaptive function,which can make use of the advantages of each surrogate and eliminate the effort of selecting the appropriate individual surrogate.To improve ease of use,DADOS provides a user-friendly graphical user interface and employed flow-based programming so that users can conduct design optimization just by dragging,dropping,and connecting algorithm blocks into a workflow instead of writing massive code.In addition,DADOS allows users to visualize the results to gain more insights into the design problems,allows multi-person collaborating on a project at the same time,and supports multi-disciplinary optimization.This paper also details the architecture and the user interface of DADOS.Two examples were employed to demonstrate how to use DADOS to conduct data-driven design optimization.Since DADOS is a cloud-based system,anyone can access DADOS at www.dados.com.cn using their web browser without the need for installation or powerful hardware.
基金financially supported by the National Natural Science Foundation of China(No.61303216,No.61272457,No.U1401251,and No.61373172)the National High Technology Research and Development Program of China(863 Program)(No.2012AA013102)National 111 Program of China B16037 and B08038
文摘With the rapid development of computer technology, cloud-based services have become a hot topic. They not only provide users with convenience, but also bring many security issues, such as data sharing and privacy issue. In this paper, we present an access control system with privilege separation based on privacy protection(PS-ACS). In the PS-ACS scheme, we divide users into private domain(PRD) and public domain(PUD) logically. In PRD, to achieve read access permission and write access permission, we adopt the Key-Aggregate Encryption(KAE) and the Improved Attribute-based Signature(IABS) respectively. In PUD, we construct a new multi-authority ciphertext policy attribute-based encryption(CP-ABE) scheme with efficient decryption to avoid the issues of single point of failure and complicated key distribution, and design an efficient attribute revocation method for it. The analysis and simulation result show that our scheme is feasible and superior to protect users' privacy in cloud-based services.
基金the National Nat-ural Science Foundation of China under Grants 61771163the Natural Science Foundation for Out-standing Young Scholars of Heilongjiang Province un-der Grant YQ2020F001the Science and Technol-ogy on Communication Networks Laboratory under Grants SXX19641X072 and SXX18641X028.(Cor-respondence author:Min Jia)。
文摘Cloud-based satellite and terrestrial spectrum shared networks(CB-STSSN)combines the triple advantages of efficient and flexible net-work management of heterogeneous cloud access(H-CRAN),vast coverage of satellite networks,and good communication quality of terrestrial networks.Thanks to the complementary coverage characteristics,any-time and anywhere high-speed communications can be achieved to meet the various needs of users.The scarcity of spectrum resources is a common prob-lem in both satellite and terrestrial networks.In or-der to improve resource utilization,the spectrum is shared not only within each component but also be-tween satellite beams and terrestrial cells,which intro-duces inter-component interferences.To this end,this paper first proposes an analytical framework which considers the inter-component interferences induced by spectrum sharing(SS).An intelligent SS scheme based on radio map(RM)consisting of LSTM-based beam prediction(BP),transfer learning-based spec-trum prediction(SP)and joint non-preemptive prior-ity and preemptive priority(J-NPAP)-based propor-tional fair spectrum allocation is than proposed.The simulation result shows that the spectrum utilization rate of CB-STSSN is improved and user blocking rate and waiting probability are decreased by the proposed scheme.
文摘An e-tag used on the freeway is a kind of passive sensors composed of sensors and radio- frequency identification (RFID) tags. The principle of the electronic toll collection system is that the sensor emits radio waves touching the e-tag within a certain range, the e-tag will respond to the radio waves by induction, and the sensor will read and write information of the vehicles. Although the RFID technology is popularly used in campus management systems, there is no e-tag technology application used in a campus parking system. In this paper, we use the e-tag technology on a campus parking management system based on the cloud-based construction. By this, it helps to achieve automated and standardized management of the campus parking system, enhance management efficiency, reduce the residence time of the vehicles at the entrances and exits, and improve the efficiency of vehicles parked at the same time.
基金supported by the National Natural Science Foundation of China(Grant No.61309005)the Basic and Frontier Research Program of Chongqing(Grant No.cstc2014jcyj A40015)
文摘Mixed redundancy strategies are generally used in cloud-based systems,with different node switch mechanisms from traditional fault-tolerant strategies.Existing studies often concentrate on optimizing a single strategy in cloud computing environment and ignore the impact of mixed redundancy strategies.Therefore,a model is proposed to evaluate and optimize the reliability and performance of cloud-based degraded systems subject to a mixed active and cold standby redundancy strategy.In this strategy,node switching is triggered by a continual monitoring and detection mechanism when active nodes fail.To evaluate the transient availability and the expected job completion rate of systems with such kind of strategy,a continuous-time Markov chain model is built on the state transition process and a numerical method is used to solve the model.To choose the optimal redundancy for the mixed strategy under system constraints,a greedy search algorithm is proposed after sensitivity analysis.Illustrative examples were presented to explain the process of calculating the transient probability of each system state and in turn,the availability and performance of the whole system.It was shown that the near-optimal redundancy solution could be obtained using the optimizationmethod.The comparison with optimization of the traditional mixed redundancy strategy proved that the system behavior was different using different kinds of mixed strategies and less redundancy was assigned for the new type of mixed strategy under the same system constraint.
基金supported by National Natural Science Foundation of China(62072416)Key Research and Development Special Project of Henan Province(221111210500)Key TechnologiesR&DProgram of Henan rovince(232102211053,242102211071).
文摘The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.