Using the Shenlong Gorge Scenic Area in Nanchuan as a case study,this research adopts a network text analysis approach to examine the current state of tourism service management within the scenic area.Through Python s...Using the Shenlong Gorge Scenic Area in Nanchuan as a case study,this research adopts a network text analysis approach to examine the current state of tourism service management within the scenic area.Through Python software,online review data from tourists on the Dianping platform was collected and analyzed using ROST CM 6 software,focusing on dimensions such as high-frequency words,social semantic networks,and tourist sentiments.The findings illuminate the present state of tourism service management in the Shenlong Gorge Scenic Area,providing critical theoretical support and practical guidance for the scenic area’s management authorities.Based on the analysis,an optimized pathway for tourism service management is proposed to facilitate the sustainable development of the Shenlong Gorge Scenic Area in Nanchuan,improve tourism service management,and enhance the quality of tourists’service experiences.展开更多
This paper is attempted to explore advanced English teaching from perspective of text analysis. It involves the introduction of culture background, the application of genre-based approach, the appreciation of writing ...This paper is attempted to explore advanced English teaching from perspective of text analysis. It involves the introduction of culture background, the application of genre-based approach, the appreciation of writing style and the analysis of textual structure through sample studies.展开更多
Due to the rapid increase in the exchange of text information via internet networks,the security and the reliability of digital content have become a major research issue.The main challenges faced by researchers are a...Due to the rapid increase in the exchange of text information via internet networks,the security and the reliability of digital content have become a major research issue.The main challenges faced by researchers are authentication,integrity verication,and tampering detection of the digital contents.In this paper,text zero-watermarking and text feature-based approach is proposed to improve the tampering detection accuracy of English text contents.The proposed approach embeds and detects the watermark logically without altering the original English text document.Based on hidden Markov model(HMM),the fourth level order of the word mechanism is used to analyze the contents of the given English text to nd the interrelationship between the contexts.The extracted features are used as watermark information and integrated with digital zero-watermarking techniques.To detect eventual tampering,the proposed approach has been implemented and validated with attacked English text.Experiments were performed using four standard datasets of varying lengths under multiple random locations of insertion,reorder,and deletion attacks.The experimental and simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks.Comparison results show that our proposed approach outperforms all the other baseline approaches in terms of tampering detection accuracy.展开更多
Purpose:Changes in the world show that the role,importance,and coherence of SSH(social sciences and the humanities)will increase significantly in the coming years.This paper aims to monitor and analyze the evolution(o...Purpose:Changes in the world show that the role,importance,and coherence of SSH(social sciences and the humanities)will increase significantly in the coming years.This paper aims to monitor and analyze the evolution(or overlapping)of the SSH thematic pattern through three funding instruments since 2007.Design/methodology/approach:The goal of the paper is to check to what extent the EU Framework Program(FP)affects/does not affect research on national level,and to highlight hot topics from a given period with the help of text analysis.Funded project titles and abstracts derived from the EU FP,Slovenian,and Estonian RIS were used.The final analysis and comparisons between different datasets were made based on the 200 most frequent words.After removing punctuation marks,numeric values,articles,prepositions,conjunctions,and auxiliary verbs,4,854 unique words in ETIS,4,421 unique words in the Slovenian Research Information System(SICRIS),and 3,950 unique words in FP were identified.Findings:Across all funding instruments,about a quarter of the top words constitute half of the word occurrences.The text analysis results show that in the majority of cases words do not overlap between FP and nationally funded projects.In some cases,it may be due to using different vocabulary.There is more overlapping between words in the case of Slovenia(SL)and Estonia(EE)and less in the case of Estonia and EU Framework Programmes(FP).At the same time,overlapping words indicate a wider reach(culture,education,social,history,human,innovation,etc.).In nationally funded projects(bottom-up),it was relatively difficult to observe the change in thematic trends over time.More specific results emerged from the comparison of the different programs throughout FP(top-down).Research limitations:Only projects with English titles and abstracts were analyzed.Practical implications:The specifics of SSH have to take into account—the one-to-one meaning of terms/words is not as important as,for example,in the exact sciences.Thus,even in co-word analysis,the final content may go unnoticed.Originality/value:This was the first attempt to monitor the trends of SSH projects using text analysis.The text analysis of the SSH projects of the two new EU Member States used in the study showed that SSH’s thematic coverage is not much affected by the EU Framework Program.Whether this result is field-specific or country-specific should be shown in the following study,which targets SSH projects in the so-called old Member States.展开更多
The historical and cultural districts of a city serve as important cultural heritage and tourism resources.This paper focused on four such districts in Yangzhou and performed semantic analysis on online public comment...The historical and cultural districts of a city serve as important cultural heritage and tourism resources.This paper focused on four such districts in Yangzhou and performed semantic analysis on online public comments using ROST CM6 software.According to the high frequency words,attention preference of district site elements,activities and feelings in Yangzhou historical and cultural districts were analyzed.Through the analysis of semantic network and public emotional tendency,the relationship between the protection and utilization of Yangzhou historical and cultural districts and the perception and demand of users were discussed,and some suggestions for the protection,utilization and renewal of historical and cultural districts were put forward.展开更多
This paper attempts to explore how cohesion is realized by meanings of reference in text analysis. Through analyzing some aspects of reference, especially personal reference and demonstrative reference, we know that r...This paper attempts to explore how cohesion is realized by meanings of reference in text analysis. Through analyzing some aspects of reference, especially personal reference and demonstrative reference, we know that reference is a text characteristic beyond sentences. It contributes to the development of a text and makes the text more cohesive, communicative and accurate. Of course, reference in cohesion is not used separately, it is closely related with other aspects of text analysis, and they cooperate and restrain each other and perform functions together. So translators are required to have the competence of understanding and applying reference, ellipsis and other cohesive device from the viewpoint of texts level with the combination of other aspects in text analysis, then texts can be more cohesive, coherent and acceptable.展开更多
Different from other studies, the paper makes some attempts to combine text analysis with developing students' English thinking ability. Based on the view that English thinking ability is connected with improving ove...Different from other studies, the paper makes some attempts to combine text analysis with developing students' English thinking ability. Based on the view that English thinking ability is connected with improving overall language abilities such as listening, speaking, reading, writing and translating, the paper proposes approaches to develop English thinking ability as well as syntactic fluency. The paper presents a hypothesis: The development of a text is the author's train of thought. If one wants to acquire the thinking way used by author, the indirect but efficient approach is to analyze the texts written by the authors. If we want to develop students' thinking ability, the way is to analyze the various texts rather than to give the list of text developing principles. Therefore, the key to the problem is how to analyze text so as to improve students' thinking ability? Michael Stubbs in his Discourse Analysis wrote, "Predict ability may be the single most important feature of human communication, precisely since it is central not only to all level of language; but also central to meaning and to thinking in general. Predictions are possible because language is structured. Such intuitive predictions are in turn a crucial part of our data in setting up language structures." According to the view given by Michael Stubbs, the paper offers three approaches to analyzing texts.展开更多
This paper is an attempt to discuss the relationship between genre and the thematic progression. Through qualitative analysis of the short story Necklace of Maupassant, it is found that narrative texts always use the ...This paper is an attempt to discuss the relationship between genre and the thematic progression. Through qualitative analysis of the short story Necklace of Maupassant, it is found that narrative texts always use the linear theme pattern and the constant theme pattern, and sometimes the derived theme pattern and other patterns are also used to facilitate the text coherence and the plot development. Thematic progression in various fields has been studied extensively, but the analysis aims to short story is less than text coherence and other aspects. And there are some implications to teaching, writing, reading, and translation.展开更多
The Great Gatsby is universally recognized as one of the masterpieces in the world literature. This thesis adopts the methods of comparison, contrast, and quotation to explore why Great Gatsby is great. This thesis an...The Great Gatsby is universally recognized as one of the masterpieces in the world literature. This thesis adopts the methods of comparison, contrast, and quotation to explore why Great Gatsby is great. This thesis analyzes the text from the three aspects:the greatness in Gatsby's economic success, the greatness in Gatsby's perseverance of love, the greatness of Gatsby's personality. In this way, the readers can understand the great Gatsby's greatness.展开更多
Purpose:Policies have often,albeit inadvertently,overlooked certain scientific insights,especially in the handling of complex events.This study aims to systematically uncover and evaluate pivotal scientific insights t...Purpose:Policies have often,albeit inadvertently,overlooked certain scientific insights,especially in the handling of complex events.This study aims to systematically uncover and evaluate pivotal scientific insights that have been underrepresented in policy documents by leveraging extensive datasets from policy texts and scholarly publications.Design/methodology/approach:This article introduces a research framework aimed at excavating scientific insights that have been overlooked by policy,encompassing four integral parts:data acquisition and preprocessing,the identification of overlooked content through thematic analysis,the discovery of overlooked content via keyword analysis,and a comprehensive analysis and discussion of the overlooked content.Leveraging this framework,the research conducts an in-depth exploration of the scientific content overlooked by policies during the COVID-19 pandemic.Findings:During the COVID-19 pandemic,scientific information in four domains was overlooked by policy:psychological state of the populace,environmental issues,the role of computer technology,and public relations.These findings indicate a systematic underrepresentation of important scientific insights in policy.Research limitations:This study is subject to two key limitations.Firstly,the text analysis method—relying on pre-extracted keywords and thematic structures—may not fully capture the nuanced context and complexity of scientific insights in policy documents.Secondly,the focus on a limited set of case studies restricts the broader applicability of the conclusions across diverse situations.Practical implications:The study introduces a quantitative framework using text analysis to identify overlooked scientific content in policy,bridging the gap between science and policy.It also highlights overlooked scientific information during COVID-19,promoting more evidence-based and robust policies through improved science-policy integration.Originality/value:This paper provides new ideas and methods for excavating scientific information that has been overlooked by policy,further deepens the understanding of the interaction between policy and science during the COVID-19 period,and lays the foundation for the more rational use of scientific information in policy-making.展开更多
This article focuses on the Hainan Wenbifeng Pangu Cultural Tourist Area in Ding’an County,Hainan Province.Network text analysis was used to collect internet promotion information about the Wenbifeng Scenic Area.Data...This article focuses on the Hainan Wenbifeng Pangu Cultural Tourist Area in Ding’an County,Hainan Province.Network text analysis was used to collect internet promotion information about the Wenbifeng Scenic Area.Data from five platforms—Xiaohongshu,Tiktok,WeChat Official Accounts,Headlines Today,and Baidu—are gathered to understand the current situation and existing problems in the tourism promotion of the Wenbifeng Scenic Area.This article summarizes and analyzes these issues.Finally,combined with on-site research,targeted suggestions are proposed for tourism promotion in theWenbifeng Scenic Area.展开更多
Green consumption(GC)are crucial for achieving the SustainableDevelopmentGoals(SDGs).However,few studies have explored public attitudes toward GC using social media data,missing potential public concerns captured thro...Green consumption(GC)are crucial for achieving the SustainableDevelopmentGoals(SDGs).However,few studies have explored public attitudes toward GC using social media data,missing potential public concerns captured through big data.To address this gap,this study collects and analyzes public attention toward GC using web crawler technology.Based on the data from Sina Weibo,we applied RoBERTa,an advanced NLP model based on transformer architecture,to conduct fine-grained sentiment analysis of the public’s attention,attitudes and hot topics on GC,demonstrating the potential of deep learning methods in capturing dynamic and contextual emotional shifts across time and regions.Among the sample(N=188,509),53.91% expressed a positive attitude,with variation across different times and regions.Temporally,public interest in GC has shown an annual growth rate of 30.23%,gradually shifting fromfulfilling basic needs to prioritizing entertainment consumption.Spatially,GC is most prevalent in the southeast coastal regions of China,with Beijing ranking first across five evaluated domains.Individuals and government-affiliated accounts play a key role in public discussions on social networks,accounting for 45.89% and 30.01% of user reviews,respectively.A significant positive correlation exists between economic development and public attention to GC,as indicated by a Pearson correlation coefficient of 0.55.Companies,in particular,exhibit cautious behavior in the early stages of green product adoption,prioritizing profitability before making substantial investments.These findings provide valuable insights into the evolving public perception of GC,contributing to the development of more effective environmental policies in China.展开更多
Discourse pragmatics interprets the context-specific meaning of communicative events underlining texts in use. It is a cross-disciplinary study based on pragmatics, social linguistics and discourse analysis. This pape...Discourse pragmatics interprets the context-specific meaning of communicative events underlining texts in use. It is a cross-disciplinary study based on pragmatics, social linguistics and discourse analysis. This paper uses the theories and dynamic descriptions of conversation analysis and thematic structure analysis to examine two real texts for their textual and interpersonal meanings in order to illustrate the necessity as well as the feasibility of applying modern linguistic theories in transforming traditional methods of language teaching.展开更多
This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schem...This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.展开更多
This paper provides a systematic literature review of text analysis methodologies used in blockchain-related research to comprehend and synthesize existing studies across disciplines and define future research directi...This paper provides a systematic literature review of text analysis methodologies used in blockchain-related research to comprehend and synthesize existing studies across disciplines and define future research directions.We summarize the research scope,text data,and methodologies of 124 papers and identify the two most common combinations of these dimensions:(1)papers that focus on specific cryptocurrencies tend to apply sentiment analysis to instant user-generated content or news articles to discover the correlations between public opinion and market behavior,and(2)studies that examine the broad concept of blockchain with text data from documents published by companies tend to apply topic modeling techniques to explore classifications and trends in blockchain development.We discover five major research topics in the academic literature:relationship discovery,cryptocurrency performance prediction,classification and trend,crime and regulation,and perception of blockchain.Based on these findings,we highlight three potential research directions for researchers to select topics and implement suitable methodologies for text analysis.展开更多
With the remarkable growth of textual data sources in recent years,easy,fast,and accurate text processing has become a challenge with significant payoffs.Automatic text summarization is the process of compressing text...With the remarkable growth of textual data sources in recent years,easy,fast,and accurate text processing has become a challenge with significant payoffs.Automatic text summarization is the process of compressing text documents into shorter summaries for easier review of its core contents,which must be done without losing important features and information.This paper introduces a new hybrid method for extractive text summarization with feature selection based on text structure.The major advantage of the proposed summarization method over previous systems is the modeling of text structure and relationship between entities in the input text,which improves the sentence feature selection process and leads to the generation of unambiguous,concise,consistent,and coherent summaries.The paper also presents the results of the evaluation of the proposed method based on precision and recall criteria.It is shown that the method produces summaries consisting of chains of sentences with the aforementioned characteristics from the original text.展开更多
The Sustainable Development Goals(SDGs)are crucial in tackling the sustainability challenges and emerging issues faced by humanity,with government attention being a significant factor in promoting their successful ach...The Sustainable Development Goals(SDGs)are crucial in tackling the sustainability challenges and emerging issues faced by humanity,with government attention being a significant factor in promoting their successful achievement.However,there is limited quantitative research systematically examining the impacts of government attention on SDGs progress.This study employs text analysis and a panel regression model to analyze the impacts of government attention intensity,text similarity,and tone on the achievement of SDGs,utilizing data extracted from China’s Government Work Reports spanning the decade from 2010 to 2020.The findings reveal that the Chinese government attention to the SDGs has generally increased over time.The heightened focus has notably bolstered the achievement of the SDGs,with the most significant impact observed post-2015.Government attention intensity was identified as the most impactful factor.Moreover,government attention intensity,text similarity,and tone have positively influenced the coupling coordination relationship between 17 SDGs,as measured by the coupling coordination degree,leading to a more harmonious and balanced achievement of socioeconomic and environmental goals in China.Financial investment served as a moderating factor,enhancing the positive impacts of attention intensity,text similarity and tone on the promotion of SDGs attainment.The effects of government attention on SDGs progress were notably positive in the eastern region,exhibiting greater significance in areas with stronger governance capacity compared to those with weaker governance capacity.This study provides insightful information for enhancing the modernization and efficiency of China’s national governance system,promoting SDGs at local and global scales,and fostering sustainable transformation.展开更多
In this paper,a hybrid intelligent text zero-watermarking approach has been proposed by integrating text zero-watermarking and hidden Markov model as natural language processing techniques for the content authenticati...In this paper,a hybrid intelligent text zero-watermarking approach has been proposed by integrating text zero-watermarking and hidden Markov model as natural language processing techniques for the content authentication and tampering detection of Arabic text contents.The proposed approach known as Second order of Alphanumeric Mechanism of Markov model and Zero-Watermarking Approach(SAMMZWA).Second level order of alphanumeric mechanism based on hidden Markov model is integrated with text zero-watermarking techniques to improve the overall performance and tampering detection accuracy of the proposed approach.The SAMMZWA approach embeds and detects the watermark logically without altering the original text document.The extracted features are used as a watermark information and integrated with digital zero-watermarking techniques.To detect eventual tampering,SAMMZWA has been implemented and validated with attacked Arabic text.Experiments were performed on four datasets of varying lengths under multiple random locations of insertion,reorder and deletion attacks.The experimental results show that our method is more sensitive for all kinds of tampering attacks with high level accuracy of tampering detection than compared methods.展开更多
In this paper,a combined approach CAZWNLP(a combined approach of zero-watermarking and natural language processing)has been developed for the tampering detection of English text exchanged through the Internet.The thir...In this paper,a combined approach CAZWNLP(a combined approach of zero-watermarking and natural language processing)has been developed for the tampering detection of English text exchanged through the Internet.The third gram of alphanumeric of the Markov model has been used with text-watermarking technologies to improve the performance and accuracy of tampering detection issues which are limited by the existing works reviewed in the literature of this study.The third-grade level of the Markov model has been used in this method as natural language processing technology to analyze an English text and extract the textual characteristics of the given contexts.Moreover,the extracted features have been utilized as watermark information and then validated with the attacked English text to detect any suspected tampering occurred on it.The embedding mechanism of CAZWNLP method will be achieved logically without effects or modifying the original text document to embed a watermark key.CAZWNLP has been implemented using VS code IDE with PHP.The experimental and simulation results using standard datasets of varying lengths show that the proposed approach can obtain high robustness and better detection accuracy of tampering common random insertion,reorder,and deletion attacks,e.g.,Comparison results with baseline approaches also show the advantages of the proposed approach.展开更多
Content authentication,integrity verification,and tampering detection of digital content exchanged via the internet have been used to address a major concern in information and communication technology.In this paper,a...Content authentication,integrity verification,and tampering detection of digital content exchanged via the internet have been used to address a major concern in information and communication technology.In this paper,a text zero-watermarking approach known as Smart-Fragile Approach based on Soft Computing and Digital Watermarking(SFASCDW)is proposed for content authentication and tampering detection of English text.A first-level order of alphanumeric mechanism,based on hidden Markov model,is integrated with digital zero-watermarking techniques to improve the watermark robustness of the proposed approach.The researcher uses the first-level order and alphanumeric mechanism of Markov model as a soft computing technique to analyze English text.Moreover,he extracts the features of the interrelationship among the contexts of the text,utilizes the extracted features as watermark information,and validates it later with the studied English text to detect any tampering.SFASCDW has been implemented using PHP with VS code IDE.The robustness,effectiveness,and applicability of SFASCDW are proved with experiments involving four datasets of various lengths in random locations using the three common attacks,namely insertion,reorder,and deletion.The SFASCDW was found to be effective and could be applicable in detecting any possible tampering.展开更多
文摘Using the Shenlong Gorge Scenic Area in Nanchuan as a case study,this research adopts a network text analysis approach to examine the current state of tourism service management within the scenic area.Through Python software,online review data from tourists on the Dianping platform was collected and analyzed using ROST CM 6 software,focusing on dimensions such as high-frequency words,social semantic networks,and tourist sentiments.The findings illuminate the present state of tourism service management in the Shenlong Gorge Scenic Area,providing critical theoretical support and practical guidance for the scenic area’s management authorities.Based on the analysis,an optimized pathway for tourism service management is proposed to facilitate the sustainable development of the Shenlong Gorge Scenic Area in Nanchuan,improve tourism service management,and enhance the quality of tourists’service experiences.
文摘This paper is attempted to explore advanced English teaching from perspective of text analysis. It involves the introduction of culture background, the application of genre-based approach, the appreciation of writing style and the analysis of textual structure through sample studies.
基金The author extends his appreciation to the Deanship of Scientic Research at King Khalid University for funding this work under grant number(R.G.P.2/55/40/2019),Received by Fahd N.Al-Wesabi.www.kku.edu.sa.
文摘Due to the rapid increase in the exchange of text information via internet networks,the security and the reliability of digital content have become a major research issue.The main challenges faced by researchers are authentication,integrity verication,and tampering detection of the digital contents.In this paper,text zero-watermarking and text feature-based approach is proposed to improve the tampering detection accuracy of English text contents.The proposed approach embeds and detects the watermark logically without altering the original English text document.Based on hidden Markov model(HMM),the fourth level order of the word mechanism is used to analyze the contents of the given English text to nd the interrelationship between the contexts.The extracted features are used as watermark information and integrated with digital zero-watermarking techniques.To detect eventual tampering,the proposed approach has been implemented and validated with attacked English text.Experiments were performed using four standard datasets of varying lengths under multiple random locations of insertion,reorder,and deletion attacks.The experimental and simulation results prove the tampering detection accuracy of our method against all kinds of tampering attacks.Comparison results show that our proposed approach outperforms all the other baseline approaches in terms of tampering detection accuracy.
文摘Purpose:Changes in the world show that the role,importance,and coherence of SSH(social sciences and the humanities)will increase significantly in the coming years.This paper aims to monitor and analyze the evolution(or overlapping)of the SSH thematic pattern through three funding instruments since 2007.Design/methodology/approach:The goal of the paper is to check to what extent the EU Framework Program(FP)affects/does not affect research on national level,and to highlight hot topics from a given period with the help of text analysis.Funded project titles and abstracts derived from the EU FP,Slovenian,and Estonian RIS were used.The final analysis and comparisons between different datasets were made based on the 200 most frequent words.After removing punctuation marks,numeric values,articles,prepositions,conjunctions,and auxiliary verbs,4,854 unique words in ETIS,4,421 unique words in the Slovenian Research Information System(SICRIS),and 3,950 unique words in FP were identified.Findings:Across all funding instruments,about a quarter of the top words constitute half of the word occurrences.The text analysis results show that in the majority of cases words do not overlap between FP and nationally funded projects.In some cases,it may be due to using different vocabulary.There is more overlapping between words in the case of Slovenia(SL)and Estonia(EE)and less in the case of Estonia and EU Framework Programmes(FP).At the same time,overlapping words indicate a wider reach(culture,education,social,history,human,innovation,etc.).In nationally funded projects(bottom-up),it was relatively difficult to observe the change in thematic trends over time.More specific results emerged from the comparison of the different programs throughout FP(top-down).Research limitations:Only projects with English titles and abstracts were analyzed.Practical implications:The specifics of SSH have to take into account—the one-to-one meaning of terms/words is not as important as,for example,in the exact sciences.Thus,even in co-word analysis,the final content may go unnoticed.Originality/value:This was the first attempt to monitor the trends of SSH projects using text analysis.The text analysis of the SSH projects of the two new EU Member States used in the study showed that SSH’s thematic coverage is not much affected by the EU Framework Program.Whether this result is field-specific or country-specific should be shown in the following study,which targets SSH projects in the so-called old Member States.
基金the Open Project of China Grand Canal Research Institute,Yangzhou University(DYH202211)Jiangsu Provincial Social Science Applied Research Excellent Project(22SYB-053).
文摘The historical and cultural districts of a city serve as important cultural heritage and tourism resources.This paper focused on four such districts in Yangzhou and performed semantic analysis on online public comments using ROST CM6 software.According to the high frequency words,attention preference of district site elements,activities and feelings in Yangzhou historical and cultural districts were analyzed.Through the analysis of semantic network and public emotional tendency,the relationship between the protection and utilization of Yangzhou historical and cultural districts and the perception and demand of users were discussed,and some suggestions for the protection,utilization and renewal of historical and cultural districts were put forward.
文摘This paper attempts to explore how cohesion is realized by meanings of reference in text analysis. Through analyzing some aspects of reference, especially personal reference and demonstrative reference, we know that reference is a text characteristic beyond sentences. It contributes to the development of a text and makes the text more cohesive, communicative and accurate. Of course, reference in cohesion is not used separately, it is closely related with other aspects of text analysis, and they cooperate and restrain each other and perform functions together. So translators are required to have the competence of understanding and applying reference, ellipsis and other cohesive device from the viewpoint of texts level with the combination of other aspects in text analysis, then texts can be more cohesive, coherent and acceptable.
文摘Different from other studies, the paper makes some attempts to combine text analysis with developing students' English thinking ability. Based on the view that English thinking ability is connected with improving overall language abilities such as listening, speaking, reading, writing and translating, the paper proposes approaches to develop English thinking ability as well as syntactic fluency. The paper presents a hypothesis: The development of a text is the author's train of thought. If one wants to acquire the thinking way used by author, the indirect but efficient approach is to analyze the texts written by the authors. If we want to develop students' thinking ability, the way is to analyze the various texts rather than to give the list of text developing principles. Therefore, the key to the problem is how to analyze text so as to improve students' thinking ability? Michael Stubbs in his Discourse Analysis wrote, "Predict ability may be the single most important feature of human communication, precisely since it is central not only to all level of language; but also central to meaning and to thinking in general. Predictions are possible because language is structured. Such intuitive predictions are in turn a crucial part of our data in setting up language structures." According to the view given by Michael Stubbs, the paper offers three approaches to analyzing texts.
文摘This paper is an attempt to discuss the relationship between genre and the thematic progression. Through qualitative analysis of the short story Necklace of Maupassant, it is found that narrative texts always use the linear theme pattern and the constant theme pattern, and sometimes the derived theme pattern and other patterns are also used to facilitate the text coherence and the plot development. Thematic progression in various fields has been studied extensively, but the analysis aims to short story is less than text coherence and other aspects. And there are some implications to teaching, writing, reading, and translation.
文摘The Great Gatsby is universally recognized as one of the masterpieces in the world literature. This thesis adopts the methods of comparison, contrast, and quotation to explore why Great Gatsby is great. This thesis analyzes the text from the three aspects:the greatness in Gatsby's economic success, the greatness in Gatsby's perseverance of love, the greatness of Gatsby's personality. In this way, the readers can understand the great Gatsby's greatness.
基金financially supported by the Ningbo University of Technology New Faculty Research Fundthe 2023 Interdisciplinary Innovation Research Cultivation Program of School of Interdisciplinary Studies,RUCKey Project of the National Social Science Foundation of China(21ATQ008)。
文摘Purpose:Policies have often,albeit inadvertently,overlooked certain scientific insights,especially in the handling of complex events.This study aims to systematically uncover and evaluate pivotal scientific insights that have been underrepresented in policy documents by leveraging extensive datasets from policy texts and scholarly publications.Design/methodology/approach:This article introduces a research framework aimed at excavating scientific insights that have been overlooked by policy,encompassing four integral parts:data acquisition and preprocessing,the identification of overlooked content through thematic analysis,the discovery of overlooked content via keyword analysis,and a comprehensive analysis and discussion of the overlooked content.Leveraging this framework,the research conducts an in-depth exploration of the scientific content overlooked by policies during the COVID-19 pandemic.Findings:During the COVID-19 pandemic,scientific information in four domains was overlooked by policy:psychological state of the populace,environmental issues,the role of computer technology,and public relations.These findings indicate a systematic underrepresentation of important scientific insights in policy.Research limitations:This study is subject to two key limitations.Firstly,the text analysis method—relying on pre-extracted keywords and thematic structures—may not fully capture the nuanced context and complexity of scientific insights in policy documents.Secondly,the focus on a limited set of case studies restricts the broader applicability of the conclusions across diverse situations.Practical implications:The study introduces a quantitative framework using text analysis to identify overlooked scientific content in policy,bridging the gap between science and policy.It also highlights overlooked scientific information during COVID-19,promoting more evidence-based and robust policies through improved science-policy integration.Originality/value:This paper provides new ideas and methods for excavating scientific information that has been overlooked by policy,further deepens the understanding of the interaction between policy and science during the COVID-19 period,and lays the foundation for the more rational use of scientific information in policy-making.
基金supported by Sanya Science and Technology Special Fund,project number:2019YD23.
文摘This article focuses on the Hainan Wenbifeng Pangu Cultural Tourist Area in Ding’an County,Hainan Province.Network text analysis was used to collect internet promotion information about the Wenbifeng Scenic Area.Data from five platforms—Xiaohongshu,Tiktok,WeChat Official Accounts,Headlines Today,and Baidu—are gathered to understand the current situation and existing problems in the tourism promotion of the Wenbifeng Scenic Area.This article summarizes and analyzes these issues.Finally,combined with on-site research,targeted suggestions are proposed for tourism promotion in theWenbifeng Scenic Area.
基金supported by the National Nature Foundation of China under Grants(No.72104108)the College Students’Innovation and Entrepreneurship Training Program(No.202410298155Y).
文摘Green consumption(GC)are crucial for achieving the SustainableDevelopmentGoals(SDGs).However,few studies have explored public attitudes toward GC using social media data,missing potential public concerns captured through big data.To address this gap,this study collects and analyzes public attention toward GC using web crawler technology.Based on the data from Sina Weibo,we applied RoBERTa,an advanced NLP model based on transformer architecture,to conduct fine-grained sentiment analysis of the public’s attention,attitudes and hot topics on GC,demonstrating the potential of deep learning methods in capturing dynamic and contextual emotional shifts across time and regions.Among the sample(N=188,509),53.91% expressed a positive attitude,with variation across different times and regions.Temporally,public interest in GC has shown an annual growth rate of 30.23%,gradually shifting fromfulfilling basic needs to prioritizing entertainment consumption.Spatially,GC is most prevalent in the southeast coastal regions of China,with Beijing ranking first across five evaluated domains.Individuals and government-affiliated accounts play a key role in public discussions on social networks,accounting for 45.89% and 30.01% of user reviews,respectively.A significant positive correlation exists between economic development and public attention to GC,as indicated by a Pearson correlation coefficient of 0.55.Companies,in particular,exhibit cautious behavior in the early stages of green product adoption,prioritizing profitability before making substantial investments.These findings provide valuable insights into the evolving public perception of GC,contributing to the development of more effective environmental policies in China.
文摘Discourse pragmatics interprets the context-specific meaning of communicative events underlining texts in use. It is a cross-disciplinary study based on pragmatics, social linguistics and discourse analysis. This paper uses the theories and dynamic descriptions of conversation analysis and thematic structure analysis to examine two real texts for their textual and interpersonal meanings in order to illustrate the necessity as well as the feasibility of applying modern linguistic theories in transforming traditional methods of language teaching.
文摘This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.
基金supported by the Manchot Graduate School“Competitiveness of Young Enterprises”at the Heinrich-Heine-University of Düsseldorf.Funding was provided by the Jürgen Manchot Stiftung.
文摘This paper provides a systematic literature review of text analysis methodologies used in blockchain-related research to comprehend and synthesize existing studies across disciplines and define future research directions.We summarize the research scope,text data,and methodologies of 124 papers and identify the two most common combinations of these dimensions:(1)papers that focus on specific cryptocurrencies tend to apply sentiment analysis to instant user-generated content or news articles to discover the correlations between public opinion and market behavior,and(2)studies that examine the broad concept of blockchain with text data from documents published by companies tend to apply topic modeling techniques to explore classifications and trends in blockchain development.We discover five major research topics in the academic literature:relationship discovery,cryptocurrency performance prediction,classification and trend,crime and regulation,and perception of blockchain.Based on these findings,we highlight three potential research directions for researchers to select topics and implement suitable methodologies for text analysis.
文摘With the remarkable growth of textual data sources in recent years,easy,fast,and accurate text processing has become a challenge with significant payoffs.Automatic text summarization is the process of compressing text documents into shorter summaries for easier review of its core contents,which must be done without losing important features and information.This paper introduces a new hybrid method for extractive text summarization with feature selection based on text structure.The major advantage of the proposed summarization method over previous systems is the modeling of text structure and relationship between entities in the input text,which improves the sentence feature selection process and leads to the generation of unambiguous,concise,consistent,and coherent summaries.The paper also presents the results of the evaluation of the proposed method based on precision and recall criteria.It is shown that the method produces summaries consisting of chains of sentences with the aforementioned characteristics from the original text.
基金supported by Guizhou Province Major Science and Technology Achievement Transformation Project(QKHCG[2024]ZD016)the Excellent Young Scientists Fund from the National Natural Science Foundation of China(Grant No.42422105)+1 种基金Guizhou Province Natural Science Research Project(Qian Jiao Ji[2023]No.033)Provincial Science and Technology Program of Guizhou Province(Grant No.20201Y288).
文摘The Sustainable Development Goals(SDGs)are crucial in tackling the sustainability challenges and emerging issues faced by humanity,with government attention being a significant factor in promoting their successful achievement.However,there is limited quantitative research systematically examining the impacts of government attention on SDGs progress.This study employs text analysis and a panel regression model to analyze the impacts of government attention intensity,text similarity,and tone on the achievement of SDGs,utilizing data extracted from China’s Government Work Reports spanning the decade from 2010 to 2020.The findings reveal that the Chinese government attention to the SDGs has generally increased over time.The heightened focus has notably bolstered the achievement of the SDGs,with the most significant impact observed post-2015.Government attention intensity was identified as the most impactful factor.Moreover,government attention intensity,text similarity,and tone have positively influenced the coupling coordination relationship between 17 SDGs,as measured by the coupling coordination degree,leading to a more harmonious and balanced achievement of socioeconomic and environmental goals in China.Financial investment served as a moderating factor,enhancing the positive impacts of attention intensity,text similarity and tone on the promotion of SDGs attainment.The effects of government attention on SDGs progress were notably positive in the eastern region,exhibiting greater significance in areas with stronger governance capacity compared to those with weaker governance capacity.This study provides insightful information for enhancing the modernization and efficiency of China’s national governance system,promoting SDGs at local and global scales,and fostering sustainable transformation.
基金the Deanship of Scientific Research at King Khalid University for funding this work under grant number(R.G.P.2/55/40/2019),Received by Fahd N.Al-Wesabi.www.kku.edu.sa。
文摘In this paper,a hybrid intelligent text zero-watermarking approach has been proposed by integrating text zero-watermarking and hidden Markov model as natural language processing techniques for the content authentication and tampering detection of Arabic text contents.The proposed approach known as Second order of Alphanumeric Mechanism of Markov model and Zero-Watermarking Approach(SAMMZWA).Second level order of alphanumeric mechanism based on hidden Markov model is integrated with text zero-watermarking techniques to improve the overall performance and tampering detection accuracy of the proposed approach.The SAMMZWA approach embeds and detects the watermark logically without altering the original text document.The extracted features are used as a watermark information and integrated with digital zero-watermarking techniques.To detect eventual tampering,SAMMZWA has been implemented and validated with attacked Arabic text.Experiments were performed on four datasets of varying lengths under multiple random locations of insertion,reorder and deletion attacks.The experimental results show that our method is more sensitive for all kinds of tampering attacks with high level accuracy of tampering detection than compared methods.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work under grant number(R.G.P.2/55/40/2019)Received by Fahd N.Al-Wesabi.www.kku.edu.sa。
文摘In this paper,a combined approach CAZWNLP(a combined approach of zero-watermarking and natural language processing)has been developed for the tampering detection of English text exchanged through the Internet.The third gram of alphanumeric of the Markov model has been used with text-watermarking technologies to improve the performance and accuracy of tampering detection issues which are limited by the existing works reviewed in the literature of this study.The third-grade level of the Markov model has been used in this method as natural language processing technology to analyze an English text and extract the textual characteristics of the given contexts.Moreover,the extracted features have been utilized as watermark information and then validated with the attacked English text to detect any suspected tampering occurred on it.The embedding mechanism of CAZWNLP method will be achieved logically without effects or modifying the original text document to embed a watermark key.CAZWNLP has been implemented using VS code IDE with PHP.The experimental and simulation results using standard datasets of varying lengths show that the proposed approach can obtain high robustness and better detection accuracy of tampering common random insertion,reorder,and deletion attacks,e.g.,Comparison results with baseline approaches also show the advantages of the proposed approach.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work under Grant Number(RGP.1/147/42),Received by Fahd N.Al-Wesabi.www.kku.edu.sa.
文摘Content authentication,integrity verification,and tampering detection of digital content exchanged via the internet have been used to address a major concern in information and communication technology.In this paper,a text zero-watermarking approach known as Smart-Fragile Approach based on Soft Computing and Digital Watermarking(SFASCDW)is proposed for content authentication and tampering detection of English text.A first-level order of alphanumeric mechanism,based on hidden Markov model,is integrated with digital zero-watermarking techniques to improve the watermark robustness of the proposed approach.The researcher uses the first-level order and alphanumeric mechanism of Markov model as a soft computing technique to analyze English text.Moreover,he extracts the features of the interrelationship among the contexts of the text,utilizes the extracted features as watermark information,and validates it later with the studied English text to detect any tampering.SFASCDW has been implemented using PHP with VS code IDE.The robustness,effectiveness,and applicability of SFASCDW are proved with experiments involving four datasets of various lengths in random locations using the three common attacks,namely insertion,reorder,and deletion.The SFASCDW was found to be effective and could be applicable in detecting any possible tampering.