In low-light image enhancement,prevailing Retinex-based methods often struggle with precise illumina-tion estimation and brightness modulation.This can result in issues such as halo artifacts,blurred edges,and diminis...In low-light image enhancement,prevailing Retinex-based methods often struggle with precise illumina-tion estimation and brightness modulation.This can result in issues such as halo artifacts,blurred edges,and diminished details in bright regions,particularly under non-uniform illumination conditions.We propose an innovative approach that refines low-light images by leveraging an in-depth awareness of local content within the image.By introducing multi-scale effective guided filtering,our method surpasses the limitations of traditional isotropic filters,such as Gaussian filters,in handling non-uniform illumination.It dynamically adjusts regularization parameters in response to local image characteristics and significantly integrates edge perception across different scales.This balanced approach achieves a harmonious blend of smoothing and detail preservation,enabling more accurate illumination estimation.Additionally,we have designed an adaptive gamma correction function that dynamically adjusts the brightness value based on local pixel intensity,further balancing enhancement effects across different brightness levels in the image.Experimental results demonstrate the effectiveness of our proposed method for non-uniform illumination images across various scenarios.It exhibits superior quality and objective evaluation scores compared to existing methods.Our method effectively addresses potential issues that existing methods encounter when processing non-uniform illumination images,producing enhanced images with precise details and natural,vivid colors.展开更多
A content-aware multi-step prediction control(CAMPC)algorithm is proposed to determine the bitrate of 360-degree videos,aim⁃ing to enhance the quality of experience(QoE)of users and reduce the cost of video content pr...A content-aware multi-step prediction control(CAMPC)algorithm is proposed to determine the bitrate of 360-degree videos,aim⁃ing to enhance the quality of experience(QoE)of users and reduce the cost of video content providers(VCP).The CAMPC algorithm first em⁃ploys a neural network to generate the content richness and combines it with the current field of view(FOV)to accurately predict the probability distribution of tiles being viewed.Then,for the tiles in the predicted viewport which directly affect QoE,the CAMPC algorithm utilizes a multi-step prediction for future system states,and accordingly selects the bitrates of multiple subsequent steps,instead of an instantaneous state.Meanwhile,it controls the buffer occupancy to eliminate the impact of prediction errors.We implement CAMPC on players by building a 360-degree video streaming platform and evaluating other advanced adaptive bitrate(ABR)rules through the real network.Experimental results show that CAMPC can save 83.5%of bandwidth resources compared with the scheme that completely transmits the tiles outside the viewport with the Dynamic Adaptive Streaming over HTTP(DASH)protocol.Besides,the proposed method can improve the system utility by 62.7%and 27.6%compared with the DASH official and viewport-based rules,respectively.展开更多
A simple and effective content-aware image resizing method is proposed based on the row / column merging and improved importance diffusion,which preserves the important regions in an image as well as the global visual...A simple and effective content-aware image resizing method is proposed based on the row / column merging and improved importance diffusion,which preserves the important regions in an image as well as the global visual effect. By repeatedly merging two rows / columns into one row / column or inserting a new row /column between two rows / columns, this method realizes image-resolution reduction and expansion. The importance of the merged row / column is promoted and diffused to four rows / columns around the merged one,which is to avoid the unwanted image distortions resulted from excessively merging of un-important regions. In addition,the proposed method introduces the direction of gradient vector in the low-pass filter to reduce the interference caused by complex texture background and protect important content better. Furthermore,according to human mechanics principles,different weights are given to the row and column direction components of gradient vectors which can obtain better global visual effect. Experimented results show that the proposed method satisfied in not only visual effect but also objective evaluation.展开更多
With the rise in drowning accidents in swimming pools,the demand for the precision and speed in artificial intelligence(AI)drowning detection methods has become increasingly crucial.Here,an improved YOLO-based method,...With the rise in drowning accidents in swimming pools,the demand for the precision and speed in artificial intelligence(AI)drowning detection methods has become increasingly crucial.Here,an improved YOLO-based method,named DrownACB-YOLO,for drowning detection in swimming pools is proposed.Since existing methods focus on the drowned state,a transition label is added to the original dataset to provide timely alerts.Following this expanded dataset,two improvements are implemented in the original YOLOv5.Firstly,the spatial pyramid pooling(SPP)module and the default upsampling operator are replaced by the atrous spatial pyramid pooling(ASPP)module and the content-aware reassembly of feature(CARAFE)module,respectively.Secondly,the cross stage partial bottleneck with three convolutions(C3)module at the end of the backbone is replaced with the bottleneck transformer(BotNet)module.The results of comparison experiments demonstrate that DrownACB-YOLO performs better than other models.展开更多
In the area of 3D digital engineering and 3D digital geometry processing, shape simplification is an important task to reduce their requirement of large memory and high time complexity. By incorporating the content-aw...In the area of 3D digital engineering and 3D digital geometry processing, shape simplification is an important task to reduce their requirement of large memory and high time complexity. By incorporating the content-aware visual salience measure of a polygonal mesh into simplification operation, a novel feature-aware shape simplification approach is presented in this paper. Owing to the robust extraction of relief heights on 3D highly detailed meshes, our visual salience measure is defined by a center-surround operator on Gaussian-weighted relief heights in a scale-dependent manner. Guided by our visual salience map, the feature-aware shape simplification algorithm can be performed by weighting the high-dimensional feature space quadric error metric of vertex pair contractions with the weight map derived from our visual salience map. The weighted quadric error metric is calculated in a six-dimensional feature space by combining the position and normal information of mesh vertices. Experimental results demonstrate that our visual salience guided shape simplification scheme can adaptively and effectively re-sample the underlying models in a feature-aware manner, which can account for the visually salient features of the complex shapes and thus yield better visual fidelity.展开更多
文摘In low-light image enhancement,prevailing Retinex-based methods often struggle with precise illumina-tion estimation and brightness modulation.This can result in issues such as halo artifacts,blurred edges,and diminished details in bright regions,particularly under non-uniform illumination conditions.We propose an innovative approach that refines low-light images by leveraging an in-depth awareness of local content within the image.By introducing multi-scale effective guided filtering,our method surpasses the limitations of traditional isotropic filters,such as Gaussian filters,in handling non-uniform illumination.It dynamically adjusts regularization parameters in response to local image characteristics and significantly integrates edge perception across different scales.This balanced approach achieves a harmonious blend of smoothing and detail preservation,enabling more accurate illumination estimation.Additionally,we have designed an adaptive gamma correction function that dynamically adjusts the brightness value based on local pixel intensity,further balancing enhancement effects across different brightness levels in the image.Experimental results demonstrate the effectiveness of our proposed method for non-uniform illumination images across various scenarios.It exhibits superior quality and objective evaluation scores compared to existing methods.Our method effectively addresses potential issues that existing methods encounter when processing non-uniform illumination images,producing enhanced images with precise details and natural,vivid colors.
基金supported in part by ZTE Corporation under Grant No.2021420118000065.
文摘A content-aware multi-step prediction control(CAMPC)algorithm is proposed to determine the bitrate of 360-degree videos,aim⁃ing to enhance the quality of experience(QoE)of users and reduce the cost of video content providers(VCP).The CAMPC algorithm first em⁃ploys a neural network to generate the content richness and combines it with the current field of view(FOV)to accurately predict the probability distribution of tiles being viewed.Then,for the tiles in the predicted viewport which directly affect QoE,the CAMPC algorithm utilizes a multi-step prediction for future system states,and accordingly selects the bitrates of multiple subsequent steps,instead of an instantaneous state.Meanwhile,it controls the buffer occupancy to eliminate the impact of prediction errors.We implement CAMPC on players by building a 360-degree video streaming platform and evaluating other advanced adaptive bitrate(ABR)rules through the real network.Experimental results show that CAMPC can save 83.5%of bandwidth resources compared with the scheme that completely transmits the tiles outside the viewport with the Dynamic Adaptive Streaming over HTTP(DASH)protocol.Besides,the proposed method can improve the system utility by 62.7%and 27.6%compared with the DASH official and viewport-based rules,respectively.
基金Sponsored by the Natural Science Foundation of China(Grant No.61371099)the Heilongjiang Province Programs for Science and Technology Development(Grant No.GC12A305)
文摘A simple and effective content-aware image resizing method is proposed based on the row / column merging and improved importance diffusion,which preserves the important regions in an image as well as the global visual effect. By repeatedly merging two rows / columns into one row / column or inserting a new row /column between two rows / columns, this method realizes image-resolution reduction and expansion. The importance of the merged row / column is promoted and diffused to four rows / columns around the merged one,which is to avoid the unwanted image distortions resulted from excessively merging of un-important regions. In addition,the proposed method introduces the direction of gradient vector in the low-pass filter to reduce the interference caused by complex texture background and protect important content better. Furthermore,according to human mechanics principles,different weights are given to the row and column direction components of gradient vectors which can obtain better global visual effect. Experimented results show that the proposed method satisfied in not only visual effect but also objective evaluation.
文摘With the rise in drowning accidents in swimming pools,the demand for the precision and speed in artificial intelligence(AI)drowning detection methods has become increasingly crucial.Here,an improved YOLO-based method,named DrownACB-YOLO,for drowning detection in swimming pools is proposed.Since existing methods focus on the drowned state,a transition label is added to the original dataset to provide timely alerts.Following this expanded dataset,two improvements are implemented in the original YOLOv5.Firstly,the spatial pyramid pooling(SPP)module and the default upsampling operator are replaced by the atrous spatial pyramid pooling(ASPP)module and the content-aware reassembly of feature(CARAFE)module,respectively.Secondly,the cross stage partial bottleneck with three convolutions(C3)module at the end of the backbone is replaced with the bottleneck transformer(BotNet)module.The results of comparison experiments demonstrate that DrownACB-YOLO performs better than other models.
基金Project supported by the National Natural Science Foundation of China(No.61272309)the Key Laboratory of Visual Media Intelligent Process Technology of Zhejiang Province,China(No.2011E10003)
文摘In the area of 3D digital engineering and 3D digital geometry processing, shape simplification is an important task to reduce their requirement of large memory and high time complexity. By incorporating the content-aware visual salience measure of a polygonal mesh into simplification operation, a novel feature-aware shape simplification approach is presented in this paper. Owing to the robust extraction of relief heights on 3D highly detailed meshes, our visual salience measure is defined by a center-surround operator on Gaussian-weighted relief heights in a scale-dependent manner. Guided by our visual salience map, the feature-aware shape simplification algorithm can be performed by weighting the high-dimensional feature space quadric error metric of vertex pair contractions with the weight map derived from our visual salience map. The weighted quadric error metric is calculated in a six-dimensional feature space by combining the position and normal information of mesh vertices. Experimental results demonstrate that our visual salience guided shape simplification scheme can adaptively and effectively re-sample the underlying models in a feature-aware manner, which can account for the visually salient features of the complex shapes and thus yield better visual fidelity.