The Pingyang ancient woodblock painting“Presenting the Beauty of the Country with the Elegance of the Dynasty”is an important work that cannot be surpassed in the study of the history of Chinese printmaking.This art...The Pingyang ancient woodblock painting“Presenting the Beauty of the Country with the Elegance of the Dynasty”is an important work that cannot be surpassed in the study of the history of Chinese printmaking.This article focuses on the characteristics of the image style of this work,and first combines the process of excavation and discovery of this work to describe its historical background;Secondly,analyze the image information such as the structure and content presented in the work;Finally,the uniqueness of the flat water version production,the distinctive content presented in the work,and the artistic composition of the work were explained,and the importance of studying the work was pointed out.展开更多
Image style transfer is a research hotspot in the field of computer vision.For this job,many approaches have been put forth.These techniques do,however,still have some drawbacks,such as high computing complexity and c...Image style transfer is a research hotspot in the field of computer vision.For this job,many approaches have been put forth.These techniques do,however,still have some drawbacks,such as high computing complexity and content distortion caused by inadequate stylization.To address these problems,PhotoGAN,a new Generative AdversarialNetwork(GAN)model is proposed in this paper.A deeper feature extraction network has been designed to capture global information and local details better.Introducingmulti-scale attention modules helps the generator focus on important feature areas at different scales,further enhancing the effectiveness of feature extraction.Using a semantic discriminator helps the generator learn quickly and better understand image content,improving the consistency and visual quality of the generated images.Finally,qualitative and quantitative experiments were conducted on a self-built dataset.The experimental results indicate that PhotoGAN outperformed the current state-of-the-art techniques.It not only performed excellently on objective metrics but also appeared more visually appealing,particularly excelling in handling complex scenes and details.展开更多
Image conversion refers to converting an image from one style to another and ensuring that the content of the image remains unchanged.Using Generative Adversarial Networks(GAN)for image conversion can achieve good res...Image conversion refers to converting an image from one style to another and ensuring that the content of the image remains unchanged.Using Generative Adversarial Networks(GAN)for image conversion can achieve good results.However,if there are enough samples,any image in the target domain can be mapped to the same set of inputs.On this basis,the Cycle Consistency Generative Adversarial Network(CycleGAN)was developed.This article verifies and discusses the advantages and disadvantages of the CycleGAN model in image style conversion.CycleGAN uses two generator networks and two discriminator networks.The purpose is to learn the mapping relationship and inverse mapping relationship between the source domain and the target domain.It can reduce the mapping and improve the quality of the generated image.Through the idea of loop,the loss of information in image style conversion is reduced.When evaluating the results of the experiment,the degree of retention of the input image content will be judged.Through the experimental results,CycleGAN can understand the artist’s overall artistic style and successfully convert real landscape paintings.The advantage is that most of the content of the original picture can be retained,and only the texture line of the picture is changed to a level similar to the artist’s style.展开更多
The Three Idiots, has a Strong appreciation ,with excellent script, creative actor, sophisticated production, fine plot, the endless stream of jokes. It uses a variety of narrative technique and also gives insight to ...The Three Idiots, has a Strong appreciation ,with excellent script, creative actor, sophisticated production, fine plot, the endless stream of jokes. It uses a variety of narrative technique and also gives insight to audiences on the various time lines clearly, Dance also added a beautiful landscape to the film. We can learn from the achievements on Artistic and box office and it reflects the present situation of Indian society.展开更多
We present a novel framework for audio-guided localized image stylization.Sound often provides information about the specific context of a scene and is closely related to a certain part of the scene or object.However,...We present a novel framework for audio-guided localized image stylization.Sound often provides information about the specific context of a scene and is closely related to a certain part of the scene or object.However,existing image stylization works have focused on stylizing the entire image using an image or text input.Stylizing a particular part of the image based on audio input is natural but challenging.This work proposes a framework in which a user provides an audio input to localize the target in the input image and another to locally stylize the target object or scene.We first produce a fine localization map using an audio-visual localization network leveraging CLIP embedding space.We then utilize an implicit neural representation(INR)along with the predicted localization map to stylize the target based on sound information.The INR manipulates local pixel values to be semantically consistent with the provided audio input.Our experiments show that the proposed framework outperforms other audio-guided stylization methods.Moreover,we observe that our method constructs concise localization maps and naturally manipulates the target object or scene in accordance with the given audio input.展开更多
Digital cartoon production requires extensive manual labor to colorize sketches with visually pleasant color composition and color shading.During colorization,the artist usually takes an existing cartoon image as colo...Digital cartoon production requires extensive manual labor to colorize sketches with visually pleasant color composition and color shading.During colorization,the artist usually takes an existing cartoon image as color guidance,particularly when colorizing related characters or an animation sequence.Reference-guided colorization is more intuitive than colorization with other hints,such as color points or scribbles,or text-based hints.Unfortunately,reference-guided colorization is challenging since the style of the colorized image should match the style of the reference image in terms of both global color composition and local color shading.In this paper,we propose a novel learning-based framework which colorizes a sketch based on a color style feature extracted from a reference color image.Our framework contains a color style extractor to extract the color feature from a color image,a colorization network to generate multi-scale output images by combining a sketch and a color feature,and a multi-scale discriminator to improve the reality of the output image.Extensive qualitative and quantitative evaluations show that our method outperforms existing methods,providing both superior visual quality and style reference consistency in the task of reference-based colorization.展开更多
文摘The Pingyang ancient woodblock painting“Presenting the Beauty of the Country with the Elegance of the Dynasty”is an important work that cannot be surpassed in the study of the history of Chinese printmaking.This article focuses on the characteristics of the image style of this work,and first combines the process of excavation and discovery of this work to describe its historical background;Secondly,analyze the image information such as the structure and content presented in the work;Finally,the uniqueness of the flat water version production,the distinctive content presented in the work,and the artistic composition of the work were explained,and the importance of studying the work was pointed out.
基金funded by the Key R&D and Transformation Projects of Xizang(Tibet)Autonomous Region Science and Technology Program(funder:the Department of Science and Technology of the Xizang(Tibet)Autonomous Region),funding(grant)number:XZ202401ZY0004.
文摘Image style transfer is a research hotspot in the field of computer vision.For this job,many approaches have been put forth.These techniques do,however,still have some drawbacks,such as high computing complexity and content distortion caused by inadequate stylization.To address these problems,PhotoGAN,a new Generative AdversarialNetwork(GAN)model is proposed in this paper.A deeper feature extraction network has been designed to capture global information and local details better.Introducingmulti-scale attention modules helps the generator focus on important feature areas at different scales,further enhancing the effectiveness of feature extraction.Using a semantic discriminator helps the generator learn quickly and better understand image content,improving the consistency and visual quality of the generated images.Finally,qualitative and quantitative experiments were conducted on a self-built dataset.The experimental results indicate that PhotoGAN outperformed the current state-of-the-art techniques.It not only performed excellently on objective metrics but also appeared more visually appealing,particularly excelling in handling complex scenes and details.
基金supported by Scientific Research Starting Project of SWPU[No.0202002131604]Major Science and Technology Project of Sichuan Province[Nos.8ZDZX0143,2019YFG0424]+1 种基金Ministry of Education Collaborative Education Project of China[No.952]Fundamental Research Project[Nos.549,550].
文摘Image conversion refers to converting an image from one style to another and ensuring that the content of the image remains unchanged.Using Generative Adversarial Networks(GAN)for image conversion can achieve good results.However,if there are enough samples,any image in the target domain can be mapped to the same set of inputs.On this basis,the Cycle Consistency Generative Adversarial Network(CycleGAN)was developed.This article verifies and discusses the advantages and disadvantages of the CycleGAN model in image style conversion.CycleGAN uses two generator networks and two discriminator networks.The purpose is to learn the mapping relationship and inverse mapping relationship between the source domain and the target domain.It can reduce the mapping and improve the quality of the generated image.Through the idea of loop,the loss of information in image style conversion is reduced.When evaluating the results of the experiment,the degree of retention of the input image content will be judged.Through the experimental results,CycleGAN can understand the artist’s overall artistic style and successfully convert real landscape paintings.The advantage is that most of the content of the original picture can be retained,and only the texture line of the picture is changed to a level similar to the artist’s style.
文摘The Three Idiots, has a Strong appreciation ,with excellent script, creative actor, sophisticated production, fine plot, the endless stream of jokes. It uses a variety of narrative technique and also gives insight to audiences on the various time lines clearly, Dance also added a beautiful landscape to the film. We can learn from the achievements on Artistic and box office and it reflects the present situation of Indian society.
基金supported by the Culture,Sports and Tourism R&D Program through the Korea Creative Content Agency grant funded by the Ministry of Culture,Sports and Tourism in 2022-(4D Content Generation and Copyright Protection with Artificial Intelligence,R2022020068,30%Research on neural watermark technology for copyright protection of generative AI 3D content,RS-2024-00348469,40%+1 种基金International Collaborative Research and Global Talent Development for the Development of Copyright Management and Protection Technologies for Generative AI,RS-2024-00345025,10%)the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)(RS-2019-II190079,10%,No.2017-0-00417,10%).
文摘We present a novel framework for audio-guided localized image stylization.Sound often provides information about the specific context of a scene and is closely related to a certain part of the scene or object.However,existing image stylization works have focused on stylizing the entire image using an image or text input.Stylizing a particular part of the image based on audio input is natural but challenging.This work proposes a framework in which a user provides an audio input to localize the target in the input image and another to locally stylize the target object or scene.We first produce a fine localization map using an audio-visual localization network leveraging CLIP embedding space.We then utilize an implicit neural representation(INR)along with the predicted localization map to stylize the target based on sound information.The INR manipulates local pixel values to be semantically consistent with the provided audio input.Our experiments show that the proposed framework outperforms other audio-guided stylization methods.Moreover,we observe that our method constructs concise localization maps and naturally manipulates the target object or scene in accordance with the given audio input.
基金supported in part by a CIHE Institutional Development Grant No.IDG200107the National Natural Science Foundation of China under Grant No.61973221the Natural Science Foundation of Guangdong Province of China under Grant Nos.2018A030313381 and 2019A1515011165.
文摘Digital cartoon production requires extensive manual labor to colorize sketches with visually pleasant color composition and color shading.During colorization,the artist usually takes an existing cartoon image as color guidance,particularly when colorizing related characters or an animation sequence.Reference-guided colorization is more intuitive than colorization with other hints,such as color points or scribbles,or text-based hints.Unfortunately,reference-guided colorization is challenging since the style of the colorized image should match the style of the reference image in terms of both global color composition and local color shading.In this paper,we propose a novel learning-based framework which colorizes a sketch based on a color style feature extracted from a reference color image.Our framework contains a color style extractor to extract the color feature from a color image,a colorization network to generate multi-scale output images by combining a sketch and a color feature,and a multi-scale discriminator to improve the reality of the output image.Extensive qualitative and quantitative evaluations show that our method outperforms existing methods,providing both superior visual quality and style reference consistency in the task of reference-based colorization.