Unmanned aerial vehicles(UAVs)bring more innovation and attraction to outdoor mobile high-definition(HD)live streaming with its unique perspective.Due to the heavy computational requirements of HD live broadcast tasks...Unmanned aerial vehicles(UAVs)bring more innovation and attraction to outdoor mobile high-definition(HD)live streaming with its unique perspective.Due to the heavy computational requirements of HD live broadcast tasks and the limited hardware performance of UAV equipment,how to reduce the system response delay and improve the energy efficiency of terminal equipment directly affects the secure broadcast of the system.Secure task offloading in this scenario is considered a promising solution and has received academic attention.In this paper,we simulate the UAV-aided outdoor mobile HD live streaming scenarios and optimize the relevant task offloading strategies.First,we design the total cost function of task offloading that jointly optimizes secure time latency and energy consumption.Additionally,we propose a collaborative computing model for multi-UAV task offloading.This model combines the idea of simulated annealing(SA)and introduces the compression factor to enhance the particle swarm optimization(PSO)to realize secure task offloading.The simulation results show that the proposed strategy has better performance in balancing network latency and energy consumption.Compared with the discrete teaching–learning-based optimization(DTLBO)and quantum PSO(QPSO)task offloading strategies,the fitness value of the proposed strategy is decreased by an average of 26.73%and 16.42%,respectively.展开更多
With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are partic...With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.展开更多
Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been i...Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.展开更多
Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of ex...Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.展开更多
Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may ...Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.展开更多
AIM: To investigate if high-definition (HD) colonoscope with i-Scan gave a higher detection rate of mucosal le- sions vs standard white-light instruments. METHODS: Data were collected from the computer- ized datab...AIM: To investigate if high-definition (HD) colonoscope with i-Scan gave a higher detection rate of mucosal le- sions vs standard white-light instruments. METHODS: Data were collected from the computer- ized database of the endoscopy unit of our tertiary referral center. We retrospectively analyzed 1101 con- secutive colonoscopies that were performed over 1 year with standard white-light (n = 849) or HD+ with i-Scan (n = 252) instruments by four endoscopists, in an outpatient setting. Colonoscopy records included patients' main details and family history for colorectal cancer, indication for colonoscopy (screening, diagnos- tic or surveillance), type of instrument used (standard white-light or HD+ plus i-Scan), name of endoscopist and bowel preparation. Records for each procedure included whether the cecum was reached or not and the reason for failure, complications during or imme- diately after the procedure, and number, size, location and characteristics of the lesions. Polyps or protruding lesions were defined as sessile or pedunculated, and nonprotruding lesions were defined according to Paris classification. For each lesion, histological diagnosis was recorded. RESULTS: Eight hundred and forty-nine colonosco- pies were carried with the standard white-light video colonoscope and 252 with the HD+ plus i-Scan video colonoscope, The four endoscopists did 264, 300, 276 and 261 procedures, respectively; 21.6%, 24.0%, 21.7% and 24.1% of them with the HD+ plus i-Scan technique. There were no significant differences be- tween the four endoscopists in either the number of procedures done or the proportions of each imaging technique used. Both techniques detected one or more mucosal lesions in 522/1101 procedures (47.4%). The overall number of lesions recognized was 1266; 645 in the right colon and 621 in the left. A significantly higher number of colonoscopies recognized lesions in the HD+ plus i-Scan mode (171/252 = 67.9%) than with the standard white-light technique (408/849 = 48.1%) (P 〈 0.0001). HD+ with i-Scan colonoscopies identified more lesions than standard white-light imag- ing (459/252 and 807/849, P 〈 0.0001), in the right or left colon (mean :1: SD, 1.62±1.36 vs 1.33±0.73, P 〈 0.003 and 1.55±0.98 vs 1.17±0.93, P = 0.033), more lesions 〈 10 mm (P 〈 0.0001) or nonprotruding (P 〈 0.022), and flat polyps (P = 0.04). The cumulative mean number of lesions per procedure detected by the four endoscopists was significantly higher with HD+ with i-Scan than with standard white-light imaging (1.82 ± 2.89 vs 0.95± 1.35, P 〈 0.0001). CONCLUSION: HD imaging with i-Scan during the withdrawal phase of colonoscopy significantly increased the detection of colonic mucosal lesions, particularly small and nonprotruding polyps.展开更多
High-definition(HD)maps are key components that provide rich topologic and semantic information for decision-making in vehicle autonomous driving systems.A complete ground orthophoto is usually used as the base image ...High-definition(HD)maps are key components that provide rich topologic and semantic information for decision-making in vehicle autonomous driving systems.A complete ground orthophoto is usually used as the base image to construct the HD map.The ground orthophoto is obtained through inverse perspective transformation and image mosaicing.During the image mosaicing,multiple consecutive orthophotos are stitched together using pose information and image registration.In this study,wavelet transform is introduced to the image mosaicing process to alleviate the information loss caused by image overlapping.In the orthophoto wavelet transform,high-frequency and low-frequency components are fused using different strategies to form a complete base image with clearer local details.Experimental results show that the accuracy of the orthophotos generated using this method is improved.展开更多
Objective This study aimed to compare the performance of standard-definition white-light endoscopy(SD-WL),high-definition white-light endoscopy(HD-WL),and high-definition narrow-band imaging(HD-NBI)in detecting colore...Objective This study aimed to compare the performance of standard-definition white-light endoscopy(SD-WL),high-definition white-light endoscopy(HD-WL),and high-definition narrow-band imaging(HD-NBI)in detecting colorectal lesions in the Chinese population.Methods This was a multicenter,single-blind,randomized,controlled trial with a non-inferiority design.Patients undergoing endoscopy for physical examination,screening,and surveillance were enrolled from July 2017 to December 2020.The primary outcome measure was the adenoma detection rate(ADR),defined as the proportion of patients with at least one adenoma detected.The associated factors for detecting adenomas were assessed using univariate and multivariate logistic regression.Results Out of 653 eligible patients enrolled,data from 596 patients were analyzed.The ADRs were 34.5%in the SD-WL group,33.5%in the HD-WL group,and 37.5%in the HD-NBI group(P=0.72).The advanced neoplasm detection rates(ANDRs)in the three arms were 17.1%,15.5%,and 10.4%(P=0.17).No significant differences were found between the SD group and HD group regarding ADR or ANDR(ADR:34.5%vs.35.6%,P=0.79;ANDR:17.1%vs.13.0%,P=0.16,respectively).Similar results were observed between the HD-WL group and HD-NBI group(ADR:33.5%vs.37.7%,P=0.45;ANDR:15.5%vs.10.4%,P=0.18,respectively).In the univariate and multivariate logistic regression analyses,neither HD-WL nor HD-NBI led to a significant difference in overall adenoma detection compared to SD-WL(HD-WL:OR 0.91,P=0.69;HD-NBI:OR 1.15,P=0.80).Conclusion HD-NBI and HD-WL are comparable to SD-WL for overall adenoma detection among Chinese outpatients.It can be concluded that HD-NBI or HD-WL is not superior to SD-WL,but more effective instruction may be needed to guide the selection of different endoscopic methods in the future.Our study’s conclusions may aid in the efficient allocation and utilization of limited colonoscopy resources,especially advanced imaging technologies.展开更多
AIM:To examine performances regarding prediction of polyp histology using high-definition (HD) i-scan in a group of endoscopists with varying levels of experience. METHODS:We used a digital library of HD i-scan still ...AIM:To examine performances regarding prediction of polyp histology using high-definition (HD) i-scan in a group of endoscopists with varying levels of experience. METHODS:We used a digital library of HD i-scan still images, comprising twin pictures (surface enhancement and tone enhancement), collected at our university hospital. We defined endoscopic features of adenomatous and non-adenomatous polyps, according to the following parameters:color, surface pattern and vascular pattern. We familiarized the participating endoscopists on optical diagnosis of colorectal polyps using a 20-min didactic training session. All endoscopists were asked to evaluate an image set of 50 colorectal polyps with regard to polyp histology. We classified the diagnoses into high confidence (i.e., cases in which the endoscopist could assign a diagnosis with certainty) and low confidence diagnoses (i.e., cases in which the endoscopist preferred to send the polyp for formal histology). Mean sensitivity, specificity and accuracy per endoscopist/image were computed and differences between groups tested using independent-samples t tests. High vs low confidence diagnoses were compared using the pairedsamples t test. RESULTS:Eleven endoscopists without previous experience on optical diagnosis evaluated a total of 550 images (396 adenomatous, 154 non-adenomatous). Mean sensitivity, specificity and accuracy for diagnosing adenomas were 79.3%, 85.7% and 81.1%, respectively. No significant differences were found between gastroenterologists and trainees regarding performances of optical diagnosis (mean accuracy 78.0%vs 82.9%,P = 0.098). Diminutive lesions were predicted with a lower mean accuracy as compared to non-diminutive lesions (74.2% vs 93.1%, P = 0.008). A total of 446 (81.1%) diagnoses were made with high confidence. High confidence diagnoses corresponded to a significantly higher mean accuracy than low confidence diagnoses (84.0% vs 64.3%, P = 0.008). A total of 319 (58.0%) images were evaluated as having excellent quality. Considering excellent quality images in conjunction with high confidence diagnosis, overall accuracy increased to 92.8%. CONCLUSION:After a single training session, endoscopists with varying levels of experience can already provide optical diagnosis with an accuracy of 84.0%.展开更多
There is an increasing demand for supporting high-quality real-time audiovisual services for the next generation wired and wireless networks. However, due to variety of bandwidths of different networks, it is a great ...There is an increasing demand for supporting high-quality real-time audiovisual services for the next generation wired and wireless networks. However, due to variety of bandwidths of different networks, it is a great challenge for deployment. In this paper, a novel high-definition (HD) video transmission system was proposed which depends upon reliable compound multicast protocols and QoS control over the various kinds of networks. This system detects client's network condition and assigns it to a proper proxy. Each proxy is capable of detecting network parameters and adaptively tuning such transport parameters as bit rate, video resolution, frame rate and QoS mechanisms to this condition. It also provides FEC error recovery under consideration of characteristics of MPEG4 video codec. Our simulation demonstrates that different network clients such as ADSL, CERNET, and CERNET2 can receive more video reliability with less delay.展开更多
Identifying the compound formulae-related xenobiotics in bio-samples is full of challenges.Conventional strategies always exhibit the insufficiencies in overall coverage,analytical efficiency,and degree of automation,...Identifying the compound formulae-related xenobiotics in bio-samples is full of challenges.Conventional strategies always exhibit the insufficiencies in overall coverage,analytical efficiency,and degree of automation,and the results highly rely on the personal knowledge and experience.The goal of this work was to establish a software-aided approach,by integrating ultra-high performance liquid chromatography/ion-mobility quadrupole time-of-flight mass spectrometry(UHPLC/IM-QTOF-MS)and in-house high-definition MS^(2) library,to enhance the identification of prototypes and metabolites of the compound formulae in vivo,taking Sishen formula(SSF)as a template.Seven different MS2 acquisition methods were compared,which demonstrated the potency of a hybrid scan approach(namely high-definition data-independent/data-dependent acquisition(HDDIDDA))in the identification precision,MS1 coverage,and MS^(2) spectra quality.The HDDIDDA data for 55 reference compounds,four component drugs,and SSF,together with the rat bio-samples(e.g.,plasma,urine,feces,liver,and kidney),were acquired.Based on the UNIFI™platform(Waters),the efficient data processing workflows were established by combining mass defect filtering(MDF)-induced classification,diagnostic product ions(DPIs),and neutral loss filtering(NLF)-dominated structural confirmation.The high-definition MS^(2) spectral libraries,dubbed in vitro-SSF and in vivo-SSF,were elaborated,enabling the efficient and automatic identification of SSF-associated xenobiotics in diverse rat bio-samples.Consequently,118 prototypes and 206 metabolites of SSF were identified,with the identification rate reaching 80.51%and 79.61%,respectively.The metabolic pathways mainly involved the oxidation,reduction,hydrolysis,sulfation,methylation,demethylation,acetylation,glucuronidation,and the combined reactions.Conclusively,the proposed strategy can drive the identification of compound formulae-related xenobiotics in vivo in an intelligent manner.展开更多
The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profile...The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.展开更多
Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing int...Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods.展开更多
Objective: to explore and analyze the effect of applying visual high-definition rigid bronchoscopy in the removal of complicated respiratory foreign body. Methods: from September 2017 to June 2020, 97 patients with co...Objective: to explore and analyze the effect of applying visual high-definition rigid bronchoscopy in the removal of complicated respiratory foreign body. Methods: from September 2017 to June 2020, 97 patients with complex foreign body in respiratory tract with valid data registered in the database of our hospital were selected as the research subjects. The basic data of the patients were statistically analyzed. All the patients in this study had complicated foreign body obstruction in respiratory tract. After admission, the medical staff gave the patients visual high-definition rigid bronchoscopy for auxiliary treatment, and the final treatment effect was evaluated and analyzed. Results: in this study, 97 patients with complex foreign body in respiratory tract were improved after treatment, with a total effective rate of 100%. In this study, there were 2 patients with respiratory tract injury during surgery, with an incidence rate of 2.06%. Conclusion: in the removal of complicated respiratory foreign body, the choice of visual high-definition rigid bronchoscope for treatment can help medical staff to identify respiratory foreign body and promote the outcome of the patient's condition, which has good promotion value and is worthy of further application.展开更多
Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as ...Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as trauma,inflammation,or anatomical abnormalities airway management becomes challenging.A commonly utilized method to overcome this challenge is the use of video laryngoscopy(VL),which employs a specialized device equipped with a camera and a light source to allow a clear view of the larynx and vocal cords.VL overcomes the limitations of direct laryngoscopy in patients with limited mouth opening,enabling better visualization and successful intubation.Various types of VL blades are available.We devised a novel flangeless video laryngoscope for use in patients with a limited mouth opening and then tested it on a manikin.展开更多
Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial fo...Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial for scene understanding. However, a challenge many semantic learning models face is the lack of data. Existing video datasets are limited to short, low-resolution videos that are not representative of real-world examples. Thus, one of our key contributions is a customized semantic segmentation version of the Walking Tours Dataset that features hour-long, high-resolution, real-world data from tours of different cities. Additionally, we evaluate the performance of open-vocabulary, semantic model OpenSeeD on our own custom dataset and discuss future implications.展开更多
Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The...Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The subjects were female university students living in the Kinki area. A longitudinal survey was conducted on 67 members in the intervention group, who received the health education, and 52 members in the control group, who did not receive the health education. The primary outcome measures were knowledge of PCC and the subscales of the Health Promotion Lifestyle Profile. Surveys were conducted before, after, and six months after the intervention in the intervention group, and an initial survey and survey six months later were conducted in the control group. Cochran’s Q test, Bonferroni’s multiple comparison test, and McNemar’s test were used to analyze the knowledge of PCC data. The Health Awareness, Nutrition, and Stress Management subscales of the Health Promotion Lifestyle Profile were analyzed by paired t-test, and comparisons between the intervention and control groups were performed using the two-way repeated measures analysis of variance. Results: In the intervention group of 67 people, the number of subjects who answered “correct” for five of the nine items concerning knowledge of PCC increased immediately after the health education (P = 0.006) but decreased for five items from immediately after the health education to six months later (P = 0.043). In addition, the number of respondents who answered “correct” for “low birth weight infants and future lifestyle-related diseases” (P = 0.016) increased after six months compared with before the health education. For the 52 subjects in the control group, there was no change in the number of subjects who answered “correct” for eight out of the nine items after six months. There was also no increase in scores for the Health Promotion Lifestyle Profile after six months for either the intervention or control group. Conclusion: Providing health education about PCC using videos and leaflets to adolescent females was shown to enhance the knowledge of PCC immediately after the education.展开更多
Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semant...Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semantics of video for transmission,is a key aspect in the framework of multimedia semantic communication.In this paper,we propose a facial video semantic coding method with low bitrate based on the temporal continuity of video semantics.At the sender’s end,we selectively transmit facial keypoints and deformation information,allocating distinct bitrates to different keypoints across frames.Compressive techniques involving sampling and quantization are employed to reduce the bitrate while retaining facial key semantic information.At the receiver’s end,a GAN-based generative network is utilized for reconstruction,effectively mitigating block artifacts and buffering problems present in traditional codec algorithms under low bitrates.The performance of the proposed approach is validated on multiple datasets,such as VoxCeleb and TalkingHead-1kH,employing metrics such as LPIPS,DISTS,and AKD for assessment.Experimental results demonstrate significant advantages over traditional codec methods,achieving up to approximately 10-fold bitrate reduction in prolonged,stable head pose scenarios across diverse conversational video settings.展开更多
The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos l...The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos leverage the unique expressive logic of the platform by adopting a small entry point and prioritizing dissemination rate.They are strategically planned in terms of content,visuals,and interaction to cater to users needs for relaxation,knowledge acquisition,social sharing,agricultural product marketing,and talent display.Through careful design,full creativity,rich emotion,and the creation of distinct character personalities,these videos deliver positive,entertaining,informative,and opinion-driven agricultural content.The production and operation of agricultural short videos can be effectively optimized by analyzing the characteristics of both popular and less popular videos,and utilizing smart tools and trending topics.展开更多
基金supported in part by the National Natural Science Foundation of China(Nos.62271454 and 62171119).
文摘Unmanned aerial vehicles(UAVs)bring more innovation and attraction to outdoor mobile high-definition(HD)live streaming with its unique perspective.Due to the heavy computational requirements of HD live broadcast tasks and the limited hardware performance of UAV equipment,how to reduce the system response delay and improve the energy efficiency of terminal equipment directly affects the secure broadcast of the system.Secure task offloading in this scenario is considered a promising solution and has received academic attention.In this paper,we simulate the UAV-aided outdoor mobile HD live streaming scenarios and optimize the relevant task offloading strategies.First,we design the total cost function of task offloading that jointly optimizes secure time latency and energy consumption.Additionally,we propose a collaborative computing model for multi-UAV task offloading.This model combines the idea of simulated annealing(SA)and introduces the compression factor to enhance the particle swarm optimization(PSO)to realize secure task offloading.The simulation results show that the proposed strategy has better performance in balancing network latency and energy consumption.Compared with the discrete teaching–learning-based optimization(DTLBO)and quantum PSO(QPSO)task offloading strategies,the fitness value of the proposed strategy is decreased by an average of 26.73%and 16.42%,respectively.
基金supported by the National Natural Science Foundation of China(Grant No.72334003)the National Key Research and Development Program of China(Grant No.2022YFB2702804)+1 种基金the Shandong Key Research and Development Program(Grant No.2020ZLYS09)the Jinan Program(Grant No.2021GXRC084-2).
文摘With the continuous advancement of unmanned technology in various application domains,the development and deployment of blind-spot-free panoramic video systems have gained increasing importance.Such systems are particularly critical in battlefield environments,where advanced panoramic video processing and wireless communication technologies are essential to enable remote control and autonomous operation of unmanned ground vehicles(UGVs).However,conventional video surveillance systems suffer from several limitations,including limited field of view,high processing latency,low reliability,excessive resource consumption,and significant transmission delays.These shortcomings impede the widespread adoption of UGVs in battlefield settings.To overcome these challenges,this paper proposes a novel multi-channel video capture and stitching system designed for real-time video processing.The system integrates the Speeded-Up Robust Features(SURF)algorithm and the Fast Library for Approximate Nearest Neighbors(FLANN)algorithm to execute essential operations such as feature detection,descriptor computation,image matching,homography estimation,and seamless image fusion.The fused panoramic video is then encoded and assembled to produce a seamless output devoid of stitching artifacts and shadows.Furthermore,H.264 video compression is employed to reduce the data size of the video stream without sacrificing visual quality.Using the Real-Time Streaming Protocol(RTSP),the compressed stream is transmitted efficiently,supporting real-time remote monitoring and control of UGVs in dynamic battlefield environments.Experimental results indicate that the proposed system achieves high stability,flexibility,and low latency.With a wireless link latency of 30 ms,the end-to-end video transmission latency remains around 140 ms,enabling smooth video communication.The system can tolerate packet loss rates(PLR)of up to 20%while maintaining usable video quality(with latency around 200 ms).These properties make it well-suited for mobile communication scenarios demanding high real-time video performance.
基金supported by the Cultivation Program for Major Scientific Research Projects of Harbin Institute of Technology(ZDXMPY20180109).
文摘Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development.
文摘Background:This study aims to investigate the underlying mechanisms between parental marital conflict and adolescent short video dependence by constructing a chain mediation model,focusing on the mediating roles of experiential avoidance and emotional disturbance(anxiety,depression,and stress).Methods:Conducted in January 2025,the research recruited 4125 adolescents from multiple Chinese provinces through convenience sampling;after data cleaning,3957 valid participants(1959 males,1998 females)were included.Using a cross-sectional design,measures included parental marital conflict,experiential avoidance,anxiety,depression,stress,and short video dependence.Results:Pearson correlation analysis revealed significant positive correlations among all variables.Mediation analysis using the SPSS PROCESS macro showed that parental marital conflict directly predicted short video dependence(β=0.269,p<0.001),and also significantly predicted experiential avoidance(β=0.519,p<0.001),anxiety(β=0.072,p<0.001),depression(β=0.067,p<0.001),and stress(β=0.048,p<0.05).Experiential avoidance further predicted anxiety(β=0.521,p<0.001),depression(β=0.489,p<0.001),stress(β=0.408,p<0.001),and short video dependence(β=0.244,p<0.001).While both anxiety(β=0.050,p<0.05)and depression(β=0.116,p<0.001)positively predicted short video dependence,stress did not(β=0.019,p=0.257).Overall,experiential avoidance,anxiety,depression,and stress significantly mediated the relationship between parental marital conflict and short video dependence.Conclusion:These findings confirm that parental marital conflict not only directly influences adolescent short video dependence but also operates through a chain mediation pathway involving experiential avoidance and emotional disturbance,highlighting central psychological mechanisms and providing theoretical support for integrated mental health and behavioral interventions.
基金funded by the Guangxi Philosophy and Social Science Research Project,grant number 24XWC002.
文摘Background:In the Chinese context,the impact of short video applications on the psychological well-being of older adults is contested.While often examined through a pathological lens of addiction,this perspective may overlook paradoxical,context-dependent positive outcomes.Therefore,the main objective of this study is to challenge the traditional Compensatory Internet Use Theory by proposing and testing a chained mediation model that explores a paradoxical pathway from social support to life satisfaction via problematic social media use.Methods:Data were collected between July and August 2025 via the Credamo online survey platform,yielding 384 valid responses from Chinese older adults aged 60 and above.Key constructs were assessed using the Social Support Rating Scale(SSRS),Bergen Social Media Addiction Scale(BSMAS),Simplified UCLA Loneliness Scale,and Satisfaction with Life Scale(SWLS).A chained mediation model was tested using stepwise regression and non-parametric bootstrapping(5000 resamples),controlling for age,gender,household income,and health status.Results:The analysis revealed a paradoxical pathway,which was clarified by a key statistical suppression effect.Social support significantly and positively predicted problematic usage(β=0.157,p=0.002).After controlling for the suppressor effect of social support,problematic usage in turn negatively predicted social connectedness(β=−0.177,p<0.001).Finally,reduced social connectedness—reflecting a state of solitude—positively predicted life satisfaction(β=−0.227,p<0.001).Conclusion:The findings suggest that for older adults with sufficient offline social support,these resources may serve a“social empowerment”function.This empowerment allows behaviors measured as“problematic usage”to be theoretically reframed as a form of“deep immersive entertainment”.This immersion appears to occur alongside a state of“high-quality solitude”,which ultimately is associated with higher life satisfaction.This study provides a novel,non-pathological theoretical perspective on the consequences of high engagement with emerging social media,offering empirical grounds for non-abstinence-based intervention strategies.
文摘AIM: To investigate if high-definition (HD) colonoscope with i-Scan gave a higher detection rate of mucosal le- sions vs standard white-light instruments. METHODS: Data were collected from the computer- ized database of the endoscopy unit of our tertiary referral center. We retrospectively analyzed 1101 con- secutive colonoscopies that were performed over 1 year with standard white-light (n = 849) or HD+ with i-Scan (n = 252) instruments by four endoscopists, in an outpatient setting. Colonoscopy records included patients' main details and family history for colorectal cancer, indication for colonoscopy (screening, diagnos- tic or surveillance), type of instrument used (standard white-light or HD+ plus i-Scan), name of endoscopist and bowel preparation. Records for each procedure included whether the cecum was reached or not and the reason for failure, complications during or imme- diately after the procedure, and number, size, location and characteristics of the lesions. Polyps or protruding lesions were defined as sessile or pedunculated, and nonprotruding lesions were defined according to Paris classification. For each lesion, histological diagnosis was recorded. RESULTS: Eight hundred and forty-nine colonosco- pies were carried with the standard white-light video colonoscope and 252 with the HD+ plus i-Scan video colonoscope, The four endoscopists did 264, 300, 276 and 261 procedures, respectively; 21.6%, 24.0%, 21.7% and 24.1% of them with the HD+ plus i-Scan technique. There were no significant differences be- tween the four endoscopists in either the number of procedures done or the proportions of each imaging technique used. Both techniques detected one or more mucosal lesions in 522/1101 procedures (47.4%). The overall number of lesions recognized was 1266; 645 in the right colon and 621 in the left. A significantly higher number of colonoscopies recognized lesions in the HD+ plus i-Scan mode (171/252 = 67.9%) than with the standard white-light technique (408/849 = 48.1%) (P 〈 0.0001). HD+ with i-Scan colonoscopies identified more lesions than standard white-light imag- ing (459/252 and 807/849, P 〈 0.0001), in the right or left colon (mean :1: SD, 1.62±1.36 vs 1.33±0.73, P 〈 0.003 and 1.55±0.98 vs 1.17±0.93, P = 0.033), more lesions 〈 10 mm (P 〈 0.0001) or nonprotruding (P 〈 0.022), and flat polyps (P = 0.04). The cumulative mean number of lesions per procedure detected by the four endoscopists was significantly higher with HD+ with i-Scan than with standard white-light imaging (1.82 ± 2.89 vs 0.95± 1.35, P 〈 0.0001). CONCLUSION: HD imaging with i-Scan during the withdrawal phase of colonoscopy significantly increased the detection of colonic mucosal lesions, particularly small and nonprotruding polyps.
基金the National Natural Science Foundation of China(No.U1764264/61873165)the Shanghai Automotive Industry Science and Technology Development Foundation(No.1807)the Guangxi Key Laboratory of Automobile Components and Vehicle Technology Research Project(No.2020GKLACVTKF02)。
文摘High-definition(HD)maps are key components that provide rich topologic and semantic information for decision-making in vehicle autonomous driving systems.A complete ground orthophoto is usually used as the base image to construct the HD map.The ground orthophoto is obtained through inverse perspective transformation and image mosaicing.During the image mosaicing,multiple consecutive orthophotos are stitched together using pose information and image registration.In this study,wavelet transform is introduced to the image mosaicing process to alleviate the information loss caused by image overlapping.In the orthophoto wavelet transform,high-frequency and low-frequency components are fused using different strategies to form a complete base image with clearer local details.Experimental results show that the accuracy of the orthophotos generated using this method is improved.
基金supported by the Beijing Municipal Science and Technology Commission(BMSTC,No.D171100002617001).
文摘Objective This study aimed to compare the performance of standard-definition white-light endoscopy(SD-WL),high-definition white-light endoscopy(HD-WL),and high-definition narrow-band imaging(HD-NBI)in detecting colorectal lesions in the Chinese population.Methods This was a multicenter,single-blind,randomized,controlled trial with a non-inferiority design.Patients undergoing endoscopy for physical examination,screening,and surveillance were enrolled from July 2017 to December 2020.The primary outcome measure was the adenoma detection rate(ADR),defined as the proportion of patients with at least one adenoma detected.The associated factors for detecting adenomas were assessed using univariate and multivariate logistic regression.Results Out of 653 eligible patients enrolled,data from 596 patients were analyzed.The ADRs were 34.5%in the SD-WL group,33.5%in the HD-WL group,and 37.5%in the HD-NBI group(P=0.72).The advanced neoplasm detection rates(ANDRs)in the three arms were 17.1%,15.5%,and 10.4%(P=0.17).No significant differences were found between the SD group and HD group regarding ADR or ANDR(ADR:34.5%vs.35.6%,P=0.79;ANDR:17.1%vs.13.0%,P=0.16,respectively).Similar results were observed between the HD-WL group and HD-NBI group(ADR:33.5%vs.37.7%,P=0.45;ANDR:15.5%vs.10.4%,P=0.18,respectively).In the univariate and multivariate logistic regression analyses,neither HD-WL nor HD-NBI led to a significant difference in overall adenoma detection compared to SD-WL(HD-WL:OR 0.91,P=0.69;HD-NBI:OR 1.15,P=0.80).Conclusion HD-NBI and HD-WL are comparable to SD-WL for overall adenoma detection among Chinese outpatients.It can be concluded that HD-NBI or HD-WL is not superior to SD-WL,but more effective instruction may be needed to guide the selection of different endoscopic methods in the future.Our study’s conclusions may aid in the efficient allocation and utilization of limited colonoscopy resources,especially advanced imaging technologies.
文摘AIM:To examine performances regarding prediction of polyp histology using high-definition (HD) i-scan in a group of endoscopists with varying levels of experience. METHODS:We used a digital library of HD i-scan still images, comprising twin pictures (surface enhancement and tone enhancement), collected at our university hospital. We defined endoscopic features of adenomatous and non-adenomatous polyps, according to the following parameters:color, surface pattern and vascular pattern. We familiarized the participating endoscopists on optical diagnosis of colorectal polyps using a 20-min didactic training session. All endoscopists were asked to evaluate an image set of 50 colorectal polyps with regard to polyp histology. We classified the diagnoses into high confidence (i.e., cases in which the endoscopist could assign a diagnosis with certainty) and low confidence diagnoses (i.e., cases in which the endoscopist preferred to send the polyp for formal histology). Mean sensitivity, specificity and accuracy per endoscopist/image were computed and differences between groups tested using independent-samples t tests. High vs low confidence diagnoses were compared using the pairedsamples t test. RESULTS:Eleven endoscopists without previous experience on optical diagnosis evaluated a total of 550 images (396 adenomatous, 154 non-adenomatous). Mean sensitivity, specificity and accuracy for diagnosing adenomas were 79.3%, 85.7% and 81.1%, respectively. No significant differences were found between gastroenterologists and trainees regarding performances of optical diagnosis (mean accuracy 78.0%vs 82.9%,P = 0.098). Diminutive lesions were predicted with a lower mean accuracy as compared to non-diminutive lesions (74.2% vs 93.1%, P = 0.008). A total of 446 (81.1%) diagnoses were made with high confidence. High confidence diagnoses corresponded to a significantly higher mean accuracy than low confidence diagnoses (84.0% vs 64.3%, P = 0.008). A total of 319 (58.0%) images were evaluated as having excellent quality. Considering excellent quality images in conjunction with high confidence diagnosis, overall accuracy increased to 92.8%. CONCLUSION:After a single training session, endoscopists with varying levels of experience can already provide optical diagnosis with an accuracy of 84.0%.
文摘There is an increasing demand for supporting high-quality real-time audiovisual services for the next generation wired and wireless networks. However, due to variety of bandwidths of different networks, it is a great challenge for deployment. In this paper, a novel high-definition (HD) video transmission system was proposed which depends upon reliable compound multicast protocols and QoS control over the various kinds of networks. This system detects client's network condition and assigns it to a proper proxy. Each proxy is capable of detecting network parameters and adaptively tuning such transport parameters as bit rate, video resolution, frame rate and QoS mechanisms to this condition. It also provides FEC error recovery under consideration of characteristics of MPEG4 video codec. Our simulation demonstrates that different network clients such as ADSL, CERNET, and CERNET2 can receive more video reliability with less delay.
基金This work was financially supported by National Natural Science Foundation of China(Grant No.:82192914)Tianjin Outstanding Youth Fund(Grant No.:23JCJQJC00030)the Innovation Team and Talents Cultivation Program of National Administration of Traditional Chinese Medicine(Grant No.:ZYYCXTD-C-202009).
文摘Identifying the compound formulae-related xenobiotics in bio-samples is full of challenges.Conventional strategies always exhibit the insufficiencies in overall coverage,analytical efficiency,and degree of automation,and the results highly rely on the personal knowledge and experience.The goal of this work was to establish a software-aided approach,by integrating ultra-high performance liquid chromatography/ion-mobility quadrupole time-of-flight mass spectrometry(UHPLC/IM-QTOF-MS)and in-house high-definition MS^(2) library,to enhance the identification of prototypes and metabolites of the compound formulae in vivo,taking Sishen formula(SSF)as a template.Seven different MS2 acquisition methods were compared,which demonstrated the potency of a hybrid scan approach(namely high-definition data-independent/data-dependent acquisition(HDDIDDA))in the identification precision,MS1 coverage,and MS^(2) spectra quality.The HDDIDDA data for 55 reference compounds,four component drugs,and SSF,together with the rat bio-samples(e.g.,plasma,urine,feces,liver,and kidney),were acquired.Based on the UNIFI™platform(Waters),the efficient data processing workflows were established by combining mass defect filtering(MDF)-induced classification,diagnostic product ions(DPIs),and neutral loss filtering(NLF)-dominated structural confirmation.The high-definition MS^(2) spectral libraries,dubbed in vitro-SSF and in vivo-SSF,were elaborated,enabling the efficient and automatic identification of SSF-associated xenobiotics in diverse rat bio-samples.Consequently,118 prototypes and 206 metabolites of SSF were identified,with the identification rate reaching 80.51%and 79.61%,respectively.The metabolic pathways mainly involved the oxidation,reduction,hydrolysis,sulfation,methylation,demethylation,acetylation,glucuronidation,and the combined reactions.Conclusively,the proposed strategy can drive the identification of compound formulae-related xenobiotics in vivo in an intelligent manner.
基金supported by National Natural Science Foundation of China(62072416)Key Research and Development Special Project of Henan Province(221111210500)Key TechnologiesR&DProgram of Henan rovince(232102211053,242102211071).
文摘The rapid development of short video platforms poses new challenges for traditional recommendation systems.Recommender systems typically depend on two types of user behavior feedback to construct user interest profiles:explicit feedback(interactive behavior),which significantly influences users’short-term interests,and implicit feedback(viewing time),which substantially affects their long-term interests.However,the previous model fails to distinguish between these two feedback methods,leading it to predict only the overall preferences of users based on extensive historical behavior sequences.Consequently,it cannot differentiate between users’long-term and shortterm interests,resulting in low accuracy in describing users’interest states and predicting the evolution of their interests.This paper introduces a video recommendationmodel calledCAT-MFRec(CrossAttention Transformer-Mixed Feedback Recommendation)designed to differentiate between explicit and implicit user feedback within the DIEN(Deep Interest Evolution Network)framework.This study emphasizes the separate learning of the two types of behavioral feedback,effectively integrating them through the cross-attention mechanism.Additionally,it leverages the long sequence dependence capabilities of Transformer technology to accurately construct user interest profiles and predict the evolution of user interests.Experimental results indicate that CAT-MF Rec significantly outperforms existing recommendation methods across various performance indicators.This advancement offers new theoretical and practical insights for the development of video recommendations,particularly in addressing complex and dynamic user behavior patterns.
基金Shenzhen Science and Technology Programme,Grant/Award Number:JCYJ202308071208000012023 Shenzhen sustainable supporting funds for colleges and universities,Grant/Award Number:20231121165240001Guangdong Provincial Key Laboratory of Ultra High Definition Immersive Media Technology,Grant/Award Number:2024B1212010006。
文摘Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods.
文摘Objective: to explore and analyze the effect of applying visual high-definition rigid bronchoscopy in the removal of complicated respiratory foreign body. Methods: from September 2017 to June 2020, 97 patients with complex foreign body in respiratory tract with valid data registered in the database of our hospital were selected as the research subjects. The basic data of the patients were statistically analyzed. All the patients in this study had complicated foreign body obstruction in respiratory tract. After admission, the medical staff gave the patients visual high-definition rigid bronchoscopy for auxiliary treatment, and the final treatment effect was evaluated and analyzed. Results: in this study, 97 patients with complex foreign body in respiratory tract were improved after treatment, with a total effective rate of 100%. In this study, there were 2 patients with respiratory tract injury during surgery, with an incidence rate of 2.06%. Conclusion: in the removal of complicated respiratory foreign body, the choice of visual high-definition rigid bronchoscope for treatment can help medical staff to identify respiratory foreign body and promote the outcome of the patient's condition, which has good promotion value and is worthy of further application.
文摘Airway management plays a crucial role in providing adequate oxygenation and ventilation to patients during various medical procedures and emergencies.When patients have a limited mouth opening due to factors such as trauma,inflammation,or anatomical abnormalities airway management becomes challenging.A commonly utilized method to overcome this challenge is the use of video laryngoscopy(VL),which employs a specialized device equipped with a camera and a light source to allow a clear view of the larynx and vocal cords.VL overcomes the limitations of direct laryngoscopy in patients with limited mouth opening,enabling better visualization and successful intubation.Various types of VL blades are available.We devised a novel flangeless video laryngoscope for use in patients with a limited mouth opening and then tested it on a manikin.
文摘Semantic segmentation is a core task in computer vision that allows AI models to interact and understand their surrounding environment. Similarly to how humans subconsciously segment scenes, this ability is crucial for scene understanding. However, a challenge many semantic learning models face is the lack of data. Existing video datasets are limited to short, low-resolution videos that are not representative of real-world examples. Thus, one of our key contributions is a customized semantic segmentation version of the Walking Tours Dataset that features hour-long, high-resolution, real-world data from tours of different cities. Additionally, we evaluate the performance of open-vocabulary, semantic model OpenSeeD on our own custom dataset and discuss future implications.
文摘Objective: The purpose of this study was to evaluate health education using videos and leaflets for preconception care (PCC) awareness among adolescent females up to six months after the health education. Methods: The subjects were female university students living in the Kinki area. A longitudinal survey was conducted on 67 members in the intervention group, who received the health education, and 52 members in the control group, who did not receive the health education. The primary outcome measures were knowledge of PCC and the subscales of the Health Promotion Lifestyle Profile. Surveys were conducted before, after, and six months after the intervention in the intervention group, and an initial survey and survey six months later were conducted in the control group. Cochran’s Q test, Bonferroni’s multiple comparison test, and McNemar’s test were used to analyze the knowledge of PCC data. The Health Awareness, Nutrition, and Stress Management subscales of the Health Promotion Lifestyle Profile were analyzed by paired t-test, and comparisons between the intervention and control groups were performed using the two-way repeated measures analysis of variance. Results: In the intervention group of 67 people, the number of subjects who answered “correct” for five of the nine items concerning knowledge of PCC increased immediately after the health education (P = 0.006) but decreased for five items from immediately after the health education to six months later (P = 0.043). In addition, the number of respondents who answered “correct” for “low birth weight infants and future lifestyle-related diseases” (P = 0.016) increased after six months compared with before the health education. For the 52 subjects in the control group, there was no change in the number of subjects who answered “correct” for eight out of the nine items after six months. There was also no increase in scores for the Health Promotion Lifestyle Profile after six months for either the intervention or control group. Conclusion: Providing health education about PCC using videos and leaflets to adolescent females was shown to enhance the knowledge of PCC immediately after the education.
基金supported by the National Natural Science Foundation of China (Nos. NSFC 61925105, 62322109, 62171257 and U22B2001)the Xplorer Prize in Information and Electronics technologiesthe Tsinghua University (Department of Electronic Engineering)-Nantong Research Institute for Advanced Communication Technologies Joint Research Center for Space, Air, Ground and Sea Cooperative Communication Network Technology
文摘Multimedia semantic communication has been receiving increasing attention due to its significant enhancement of communication efficiency.Semantic coding,which is oriented towards extracting and encoding the key semantics of video for transmission,is a key aspect in the framework of multimedia semantic communication.In this paper,we propose a facial video semantic coding method with low bitrate based on the temporal continuity of video semantics.At the sender’s end,we selectively transmit facial keypoints and deformation information,allocating distinct bitrates to different keypoints across frames.Compressive techniques involving sampling and quantization are employed to reduce the bitrate while retaining facial key semantic information.At the receiver’s end,a GAN-based generative network is utilized for reconstruction,effectively mitigating block artifacts and buffering problems present in traditional codec algorithms under low bitrates.The performance of the proposed approach is validated on multiple datasets,such as VoxCeleb and TalkingHead-1kH,employing metrics such as LPIPS,DISTS,and AKD for assessment.Experimental results demonstrate significant advantages over traditional codec methods,achieving up to approximately 10-fold bitrate reduction in prolonged,stable head pose scenarios across diverse conversational video settings.
文摘The application of short videos in agricultural scenarios has become a new form of productive force driving agricultural development,injecting new vitality and opportunities into traditional agriculture.These videos leverage the unique expressive logic of the platform by adopting a small entry point and prioritizing dissemination rate.They are strategically planned in terms of content,visuals,and interaction to cater to users needs for relaxation,knowledge acquisition,social sharing,agricultural product marketing,and talent display.Through careful design,full creativity,rich emotion,and the creation of distinct character personalities,these videos deliver positive,entertaining,informative,and opinion-driven agricultural content.The production and operation of agricultural short videos can be effectively optimized by analyzing the characteristics of both popular and less popular videos,and utilizing smart tools and trending topics.