Cartoonizing portrait images is a stylish and eye-catching application in both computer vision and graphics.We aimed to train a face cartoonization model using very few(e.g.,5–10)style examples.The main difficulty in...Cartoonizing portrait images is a stylish and eye-catching application in both computer vision and graphics.We aimed to train a face cartoonization model using very few(e.g.,5–10)style examples.The main difficulty in this challenging task lies in producing stylizations of high quality while preserving the identity of the input,particularly when the style examples contain strong exaggerations.To address this,we propose a novel cross-domain center loss for few-shot generative adversarial network(GAN)adaptation,which forces the distribution of the target domain to be similar to that of the source.We then employ it to solve this few-shot problem along with a two-stage strategy.Stage I generates an intermediate cartoonization for the input,where we first stylize the individual facial components locally and then deform them to mimic the desired exaggeration under the guidance of landmarks.Stage II focuses on global refinement of the intermediate image.First,we adapt a pretrained StyleGAN model using the proposed crossdomain center loss to the target domain defined by a few examples.Subsequently,the intermediate cartoonization from Stage I can be holistically refined through GAN inversion.The generative power of StyleGAN guarantees high image quality,while the local translation and landmark-guided deformation applied to facial components provide high identity fidelity.Experiments show that the proposed method outperforms state-of-theart few-shot stylization approaches both qualitatively and quantitatively.展开更多
The contemporary era is characterized by rapid technological advancements,particularly in the fields of communication and multimedia.Digital media has significantly influenced the daily lives of individuals of all age...The contemporary era is characterized by rapid technological advancements,particularly in the fields of communication and multimedia.Digital media has significantly influenced the daily lives of individuals of all ages.One of the emerging domains in digital media is the creation of cartoons and animated videos.The accessibility of the internet has led to a surge in the consumption of cartoons among young children,presenting challenges in monitoring and controlling the content they view.The prevalence of cartoon videos containing potentially violent scenes has raised concerns regarding their impact,especially on young and impressionableminds.This article contributes to the growing concerns about the impact of animated media on children’s mental health and offers solutions to help mitigate these effects.To address this issue,an intelligent,multi-CNN fusion framework is proposed for detecting and predicting violent content in upcoming frames of animated videos.The framework integrates probabilistic and deep learning methodologies by leveraging a combination of visual and temporal features for violence prediction in future scenes.Two specific convolutional neural network classifiers i.e.,VGG16 and ResNet18 are employed to classify scenes from animated content as violent or non-violent.To enhance decision robustness,this study introduces a fusion strategy based on weighted averaging,combining the outputs of both Convolutional Neural Networks(CNNs)into a single decision stream.The resulting classifications are subsequently fed into Naive Bayes classifier,which analyzes sequential patterns to forecast violence in future scenes.The experimental findings demonstrate that the proposed framework achieved predictive accuracy of 92.84%,highlighting its effectiveness for intelligent content moderation.These results underscore the potential of intelligent data fusion techniques in enhancing the reliability and robustness of automated violence detection systems in animated content.This framework offers a promising solution for safeguarding young audiences by enabling proactive and accurate moderation of animated videos.展开更多
Do you like cartoons?I am crazy about cartoons because they are very interesting and cartoon music listens really great.I have a great time when I watch the cartoon films.
Applying visual grammar theory,this study examines representational,interactive,and compositional meanings of the giant panda in Western media cartoons related to China from 1999 to the present.Distinct phases in the ...Applying visual grammar theory,this study examines representational,interactive,and compositional meanings of the giant panda in Western media cartoons related to China from 1999 to the present.Distinct phases in the panda’s representation were identified and illustrated by cases of cartoons in major Western media.These phases trace shift of panda cartoon image from a symbol of peace and friendliness to a politicized emblem of China’s international stance.Key visual trends,such as transitivity,color symbolism,scale enlargement,and increasing compositional complexity,embody the panda’s role in shaping China’s global image and its function in international discourse.These trends reflect the panda’s transformation into a contested symbol,which mediates between China’s self-representation and Western perceptions of its geopolitical rise.By situating the analysis within the context of China’s growing global influence,this study contributes to visual and media studies,demonstrating how cultural symbols are recontextualized to reflect and shape geopolitical narratives.展开更多
Automated cartoon character recognition is crucial for applications in content indexing,filtering,and copyright protection,yet it faces a significant challenge in animated media due to high intra-class visual variabil...Automated cartoon character recognition is crucial for applications in content indexing,filtering,and copyright protection,yet it faces a significant challenge in animated media due to high intra-class visual variability,where characters frequently alter their appearance.To address this problem,we introduce the novel Kral Sakir dataset,a public benchmark of 16,725 images specifically curated for the task of multi-label cartoon character classification under these varied conditions.This paper conducts a comprehensive benchmark study,evaluating the performance of state-of-the-art pretrained Convolutional Neural Networks(CNNs),including DenseNet,ResNet,and VGG,against a custom baseline model trained from scratch.Our experiments,evaluated using metrics of F1-Score,accuracy,and Area Under the ROC Curve(AUC),demonstrate that fine-tuning pretrained models is a highly effective strategy.The best-performing model,DenseNet121,achieved an F1-Score of 0.9890 and an accuracy of 0.9898,significantly outperforming our baseline CNN(F1-Score of 0.9545).The findings validate the power of transfer learning for this domain and establish a strong performance benchmark.The introduced dataset provides a valuable resource for future research into developing robust and accurate character recognition systems.展开更多
This article aims to explore the feasibility of English animation-assisted children's English teaching based on the theoryof children's language acquisition, discusses its beneficial effects, and puts forward ...This article aims to explore the feasibility of English animation-assisted children's English teaching based on the theoryof children's language acquisition, discusses its beneficial effects, and puts forward certain reference suggestions based on this.展开更多
Animation creates a vivid, virtual world and expands the scope of human imagination. In this study, we investigated the time-courses of brain responses related to the evaluation of the attractiveness of cartoon faces ...Animation creates a vivid, virtual world and expands the scope of human imagination. In this study, we investigated the time-courses of brain responses related to the evaluation of the attractiveness of cartoon faces using the event-related potential (ERP) technique. The results demonstrated that N170 amplitude was higher for attractive than for unattractive cartoon faces in males, while the opposite was found in females. Facial attractiveness notably modulated the late positive component (LPC), which might reflect the task-related process of aesthetic appraisal of beauty. The mean LPC amplitude in males was significantly higher for attractive cartoon faces than for unattractive faces, while the LPC amplitude in females did not significantly differ between attractive and unattractive cartoon faces. Moreover, the paint mode (computer graphics, gouache, and stick figure) modulated the early encoding of facial structures and the late evaluative process. The early modulation effect by paint mode may be related to the spatial frequency of the pictures. The processing speed and intensity in females were both higher than those in males. In conclusion, our study, for the first time, reported ERP modulation based on the assessment of cartoon facial attractiveness, suggesting the facilitated selection of attractiveness information at the early stage, and that the attentional enhancement of attractive faces at the late stage only exists in males. This suggests that men's brains are hard-wired to be sensitive to facial beauty, even in cartoons.展开更多
BACKGROUND Preschoolers become anxious when they are about to undergo anesthesia and surgery,warranting the development of more appropriate and effective interventions.AIM To explore the effect of static cartoons comb...BACKGROUND Preschoolers become anxious when they are about to undergo anesthesia and surgery,warranting the development of more appropriate and effective interventions.AIM To explore the effect of static cartoons combined with dynamic virtual environments on preoperative anxiety and anesthesia induction compliance in preschool-aged children undergoing surgery.METHODS One hundred and sixteen preschool-aged children were selected and assigned to the drug(n=37),intervention(n=40),and control(n=39)groups.All the children received routine preoperative checkups and nursing before being transferred to the preoperative preparation room on the day of the operation.The drug group received 0.5 mg/kg midazolam and the intervention group treatment consisting of static cartoons combined with dynamic virtual environments.The control group received no intervention.The modified Yale Preoperative Anxiety Scale was used to evaluate the children’s anxiety level on the day before surgery(T0),before leaving the preoperative preparation room(T1),when entering the operating room(T2),and at anesthesia induction(T3).Compliance during anesthesia induction(T3)was evaluated using the Induction Compliance Checklist(ICC).Changes in mean arterial pressure(MAP),heart rate(HR),and respiratory rate(RR)were also recorded at each time point.RESULTS The anxiety scores of the three groups increased variously at T1 and T2.At T3,both the drug and intervention groups had similar anxiety scores,both of which were lower than those in the control group.At T1 and T2,MAP,HR,and RR of the three groups increased.The drug and control groups had significantly higher MAP and RR than the intervention group at T2.At T3,the MAP,HR,and RR of the drug group decreased and were significantly lower than those in the control group but were comparable to those in the intervention group.Both the drug and intervention groups had similar ICC scores and duration of anesthesia induction(T3),both of which were higher than those of the control group.CONCLUSION Combining static cartoons with dynamic virtual environments as effective as medication,specifically midazolam,in reducing preoperative anxiety and fear in preschool-aged children.This approach also improve their compliance during anesthesia induction and helped maintain their stable vital signs.展开更多
'The Secret Life of Walter Mitty'is written by James Thurber, who is the best known humorist in modern times. Thurber is famous both for his writing and his cartoon.Many critics consider him the finest America...'The Secret Life of Walter Mitty'is written by James Thurber, who is the best known humorist in modern times. Thurber is famous both for his writing and his cartoon.Many critics consider him the finest American humorist since Mark Twain and like Twain, his humor takes the form of ironic satire.展开更多
The aim of this study was to investigate the temporal cortical activation patterns underlying different stages of humor comprehension (e.g., detection of incongruity stage, resolution of incongruity stage, and affecti...The aim of this study was to investigate the temporal cortical activation patterns underlying different stages of humor comprehension (e.g., detection of incongruity stage, resolution of incongruity stage, and affective stage). Event-related potentials (ERPs) were measured when 16 subjects were apprehending cartoon pictures including humorous, non-humorous and unrelated items. Results showed that both humorous and unrelated items elicited a more negative ERP deflection (N500-800) than non-humorous ones between 500 - 800 ms, which might reflect detection to incongruent element during humor apprehension. Then, both humorous and non-humorous items elicited a more positive ERP deflection (P800-1000) than unrelated ones between 800 - 1000 ms, which might reflect a classification process preliminarily evaluating whether there were attainable cues in the pictures used to form possible association between context and picture (we named it “association evaluation” stage). Furthermore, humorous items elicited a more positive slow wave than non-humorous items which also elicited a more positive wave than unrelated items between 1000 - 1600 ms, during which this component might be involved in the forming of novel associations (resolution of incongruity). Lastly, between 1600 - 2000 ms, humorous items elicited a more positive ERP deflection (P1600-2000) than both non-humorous and unrelated items, which might be related to emotion processing during humor apprehension. Based on these results, we deeply subdivided the second stage (resolution of incongruity) into two stages: association evaluation and incongruity resolution.展开更多
Holography is an interesting tool in creating real objects and scenes which can be projected anywhere with accurate details and depth impression. It is also found to be more attractive to the artists than other altern...Holography is an interesting tool in creating real objects and scenes which can be projected anywhere with accurate details and depth impression. It is also found to be more attractive to the artists than other alternatives. For that reason, digital holography is being used as a display technology in cartoon movies. Since this application is dependent on the performance and the simplicity of the available display technology, it becomes very useful to improve the display technique in order to become fast, simple, and attractive by being combined with computer graphical effects. This paper discusses a simulation of a digital holographic model as a three dimensional (3D) display system and its application in making cartoon holography.展开更多
基金supported in parts by National Natural Science Foundation of China(U21B2023)DEGP Innovation Team Program(2022KCXTD025)+1 种基金Shenzhen Science and Technology Program(KQTD20210811090044003,RCJC20200714114435012)Scientific Development Funds from Shenzhen University.
文摘Cartoonizing portrait images is a stylish and eye-catching application in both computer vision and graphics.We aimed to train a face cartoonization model using very few(e.g.,5–10)style examples.The main difficulty in this challenging task lies in producing stylizations of high quality while preserving the identity of the input,particularly when the style examples contain strong exaggerations.To address this,we propose a novel cross-domain center loss for few-shot generative adversarial network(GAN)adaptation,which forces the distribution of the target domain to be similar to that of the source.We then employ it to solve this few-shot problem along with a two-stage strategy.Stage I generates an intermediate cartoonization for the input,where we first stylize the individual facial components locally and then deform them to mimic the desired exaggeration under the guidance of landmarks.Stage II focuses on global refinement of the intermediate image.First,we adapt a pretrained StyleGAN model using the proposed crossdomain center loss to the target domain defined by a few examples.Subsequently,the intermediate cartoonization from Stage I can be holistically refined through GAN inversion.The generative power of StyleGAN guarantees high image quality,while the local translation and landmark-guided deformation applied to facial components provide high identity fidelity.Experiments show that the proposed method outperforms state-of-theart few-shot stylization approaches both qualitatively and quantitatively.
基金funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2025R138),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘The contemporary era is characterized by rapid technological advancements,particularly in the fields of communication and multimedia.Digital media has significantly influenced the daily lives of individuals of all ages.One of the emerging domains in digital media is the creation of cartoons and animated videos.The accessibility of the internet has led to a surge in the consumption of cartoons among young children,presenting challenges in monitoring and controlling the content they view.The prevalence of cartoon videos containing potentially violent scenes has raised concerns regarding their impact,especially on young and impressionableminds.This article contributes to the growing concerns about the impact of animated media on children’s mental health and offers solutions to help mitigate these effects.To address this issue,an intelligent,multi-CNN fusion framework is proposed for detecting and predicting violent content in upcoming frames of animated videos.The framework integrates probabilistic and deep learning methodologies by leveraging a combination of visual and temporal features for violence prediction in future scenes.Two specific convolutional neural network classifiers i.e.,VGG16 and ResNet18 are employed to classify scenes from animated content as violent or non-violent.To enhance decision robustness,this study introduces a fusion strategy based on weighted averaging,combining the outputs of both Convolutional Neural Networks(CNNs)into a single decision stream.The resulting classifications are subsequently fed into Naive Bayes classifier,which analyzes sequential patterns to forecast violence in future scenes.The experimental findings demonstrate that the proposed framework achieved predictive accuracy of 92.84%,highlighting its effectiveness for intelligent content moderation.These results underscore the potential of intelligent data fusion techniques in enhancing the reliability and robustness of automated violence detection systems in animated content.This framework offers a promising solution for safeguarding young audiences by enabling proactive and accurate moderation of animated videos.
文摘Do you like cartoons?I am crazy about cartoons because they are very interesting and cartoon music listens really great.I have a great time when I watch the cartoon films.
基金supported by the Wuhan University Undergraduate Project of Innovation and Entrepreneurship Training“The Evolution of Cartoon Images of Pandas in Western Media’s China-Related News From the Perspective of Multimodal Theory”(Project Number:S202410486013).
文摘Applying visual grammar theory,this study examines representational,interactive,and compositional meanings of the giant panda in Western media cartoons related to China from 1999 to the present.Distinct phases in the panda’s representation were identified and illustrated by cases of cartoons in major Western media.These phases trace shift of panda cartoon image from a symbol of peace and friendliness to a politicized emblem of China’s international stance.Key visual trends,such as transitivity,color symbolism,scale enlargement,and increasing compositional complexity,embody the panda’s role in shaping China’s global image and its function in international discourse.These trends reflect the panda’s transformation into a contested symbol,which mediates between China’s self-representation and Western perceptions of its geopolitical rise.By situating the analysis within the context of China’s growing global influence,this study contributes to visual and media studies,demonstrating how cultural symbols are recontextualized to reflect and shape geopolitical narratives.
文摘Automated cartoon character recognition is crucial for applications in content indexing,filtering,and copyright protection,yet it faces a significant challenge in animated media due to high intra-class visual variability,where characters frequently alter their appearance.To address this problem,we introduce the novel Kral Sakir dataset,a public benchmark of 16,725 images specifically curated for the task of multi-label cartoon character classification under these varied conditions.This paper conducts a comprehensive benchmark study,evaluating the performance of state-of-the-art pretrained Convolutional Neural Networks(CNNs),including DenseNet,ResNet,and VGG,against a custom baseline model trained from scratch.Our experiments,evaluated using metrics of F1-Score,accuracy,and Area Under the ROC Curve(AUC),demonstrate that fine-tuning pretrained models is a highly effective strategy.The best-performing model,DenseNet121,achieved an F1-Score of 0.9890 and an accuracy of 0.9898,significantly outperforming our baseline CNN(F1-Score of 0.9545).The findings validate the power of transfer learning for this domain and establish a strong performance benchmark.The introduced dataset provides a valuable resource for future research into developing robust and accurate character recognition systems.
文摘This article aims to explore the feasibility of English animation-assisted children's English teaching based on the theoryof children's language acquisition, discusses its beneficial effects, and puts forward certain reference suggestions based on this.
基金supported by the New Talent Program of Zhejiang Province, China (2013R404046)
文摘Animation creates a vivid, virtual world and expands the scope of human imagination. In this study, we investigated the time-courses of brain responses related to the evaluation of the attractiveness of cartoon faces using the event-related potential (ERP) technique. The results demonstrated that N170 amplitude was higher for attractive than for unattractive cartoon faces in males, while the opposite was found in females. Facial attractiveness notably modulated the late positive component (LPC), which might reflect the task-related process of aesthetic appraisal of beauty. The mean LPC amplitude in males was significantly higher for attractive cartoon faces than for unattractive faces, while the LPC amplitude in females did not significantly differ between attractive and unattractive cartoon faces. Moreover, the paint mode (computer graphics, gouache, and stick figure) modulated the early encoding of facial structures and the late evaluative process. The early modulation effect by paint mode may be related to the spatial frequency of the pictures. The processing speed and intensity in females were both higher than those in males. In conclusion, our study, for the first time, reported ERP modulation based on the assessment of cartoon facial attractiveness, suggesting the facilitated selection of attractiveness information at the early stage, and that the attentional enhancement of attractive faces at the late stage only exists in males. This suggests that men's brains are hard-wired to be sensitive to facial beauty, even in cartoons.
基金Supported by Hangzhou Medical and Health Technology Project,No.OO20191141。
文摘BACKGROUND Preschoolers become anxious when they are about to undergo anesthesia and surgery,warranting the development of more appropriate and effective interventions.AIM To explore the effect of static cartoons combined with dynamic virtual environments on preoperative anxiety and anesthesia induction compliance in preschool-aged children undergoing surgery.METHODS One hundred and sixteen preschool-aged children were selected and assigned to the drug(n=37),intervention(n=40),and control(n=39)groups.All the children received routine preoperative checkups and nursing before being transferred to the preoperative preparation room on the day of the operation.The drug group received 0.5 mg/kg midazolam and the intervention group treatment consisting of static cartoons combined with dynamic virtual environments.The control group received no intervention.The modified Yale Preoperative Anxiety Scale was used to evaluate the children’s anxiety level on the day before surgery(T0),before leaving the preoperative preparation room(T1),when entering the operating room(T2),and at anesthesia induction(T3).Compliance during anesthesia induction(T3)was evaluated using the Induction Compliance Checklist(ICC).Changes in mean arterial pressure(MAP),heart rate(HR),and respiratory rate(RR)were also recorded at each time point.RESULTS The anxiety scores of the three groups increased variously at T1 and T2.At T3,both the drug and intervention groups had similar anxiety scores,both of which were lower than those in the control group.At T1 and T2,MAP,HR,and RR of the three groups increased.The drug and control groups had significantly higher MAP and RR than the intervention group at T2.At T3,the MAP,HR,and RR of the drug group decreased and were significantly lower than those in the control group but were comparable to those in the intervention group.Both the drug and intervention groups had similar ICC scores and duration of anesthesia induction(T3),both of which were higher than those of the control group.CONCLUSION Combining static cartoons with dynamic virtual environments as effective as medication,specifically midazolam,in reducing preoperative anxiety and fear in preschool-aged children.This approach also improve their compliance during anesthesia induction and helped maintain their stable vital signs.
文摘'The Secret Life of Walter Mitty'is written by James Thurber, who is the best known humorist in modern times. Thurber is famous both for his writing and his cartoon.Many critics consider him the finest American humorist since Mark Twain and like Twain, his humor takes the form of ironic satire.
文摘The aim of this study was to investigate the temporal cortical activation patterns underlying different stages of humor comprehension (e.g., detection of incongruity stage, resolution of incongruity stage, and affective stage). Event-related potentials (ERPs) were measured when 16 subjects were apprehending cartoon pictures including humorous, non-humorous and unrelated items. Results showed that both humorous and unrelated items elicited a more negative ERP deflection (N500-800) than non-humorous ones between 500 - 800 ms, which might reflect detection to incongruent element during humor apprehension. Then, both humorous and non-humorous items elicited a more positive ERP deflection (P800-1000) than unrelated ones between 800 - 1000 ms, which might reflect a classification process preliminarily evaluating whether there were attainable cues in the pictures used to form possible association between context and picture (we named it “association evaluation” stage). Furthermore, humorous items elicited a more positive slow wave than non-humorous items which also elicited a more positive wave than unrelated items between 1000 - 1600 ms, during which this component might be involved in the forming of novel associations (resolution of incongruity). Lastly, between 1600 - 2000 ms, humorous items elicited a more positive ERP deflection (P1600-2000) than both non-humorous and unrelated items, which might be related to emotion processing during humor apprehension. Based on these results, we deeply subdivided the second stage (resolution of incongruity) into two stages: association evaluation and incongruity resolution.
文摘Holography is an interesting tool in creating real objects and scenes which can be projected anywhere with accurate details and depth impression. It is also found to be more attractive to the artists than other alternatives. For that reason, digital holography is being used as a display technology in cartoon movies. Since this application is dependent on the performance and the simplicity of the available display technology, it becomes very useful to improve the display technique in order to become fast, simple, and attractive by being combined with computer graphical effects. This paper discusses a simulation of a digital holographic model as a three dimensional (3D) display system and its application in making cartoon holography.