Text extraction is an important initial step in digitizing the historical documents. In this paper, we present a text extraction method for historical Tibetan document images based on block projections. The task of te...Text extraction is an important initial step in digitizing the historical documents. In this paper, we present a text extraction method for historical Tibetan document images based on block projections. The task of text extraction is considered as text area detection and location problem. The images are divided equally into blocks and the blocks are filtered by the information of the categories of connected components and corner point density. By analyzing the filtered blocks' projections, the approximate text areas can be located, and the text regions are extracted. Experiments on the dataset of historical Tibetan documents demonstrate the effectiveness of the proposed method.展开更多
Aim: To explore and analyze the feasibility of establishing a program of complex intervention in Traditional Chinese Medicine (TCM) based on Text Mining and Interviewing method. Methods: According to MRC, Constructing...Aim: To explore and analyze the feasibility of establishing a program of complex intervention in Traditional Chinese Medicine (TCM) based on Text Mining and Interviewing method. Methods: According to MRC, Constructing the program of complex intervention in TCM by Text Mining and Interviewing method should include 4 steps: 1) establishment of interview framework via normalization of extraction of ancient documents and Effectiveness of collection of modern periodical literatures;2) materialization of interview outline based on Focus Group Interview;3) rudimentary construction of complex intervention program based on Semi-structured Interview;4) evaluation of curative effect of complex intervention. Conclusions: It is feasible and significative to establish a program of complex intervention in TCM based on Text Mining and Interviewing method.展开更多
With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working...With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working with Text Mining. This study is based on The Women’s Clothing E-Commerce Reviews database, which consists of reviews written by real customers. The aim of this paper is to conduct a Text Mining approach on a set of customer reviews. Each review was classified as either a positive or negative review by employing a classification method. Four tree-based methods were applied to solve the classification problem, namely Classification Tree, Random Forest, Gradient Boosting and XGBoost. The dataset was categorized into training and test sets. The results indicate that the Random Forest method displays an overfitting, XGBoost displays an overfitting if the number of trees is too high, Classification Tree is good at detecting negative reviews and bad at detecting positive reviews and the Gradient Boosting shows stable values and quality measures above 77% for the test dataset. A consensus between the applied methods is noted for important classification terms.展开更多
针对国产大语言模型(large language models,LLMs)在地理信息科学(geographic information science,GIS)领域缺乏系统性评估基准问题,构建Geo-Text-700测试集的GIS领域定制化测评体系,基于优劣解距离层次分析法(technique for order pre...针对国产大语言模型(large language models,LLMs)在地理信息科学(geographic information science,GIS)领域缺乏系统性评估基准问题,构建Geo-Text-700测试集的GIS领域定制化测评体系,基于优劣解距离层次分析法(technique for order preference by similarity to ideal solution,TOPSIS)对10个主流国产模型进行多维度评估。测评结果显示:模型表现呈现显著题型分化,客观题平均得分为68.4(标准差±5.2),较主观题低21.7%(P<0.05);Doubao-pro-32k综合得分最优(87.3),客观题优势显著(单选86,填空77);hunyuan-turbo在主观题(简答88.1,编程90.83)方面展现高阶任务潜力;领域知识盲区突出,如GIS拓扑规则题错误率为43.6%。展开更多
It is necessary for undergraduates majoring in geography to learn the history of geographic thought. Although there are different cultural and educational backgrounds between China and the West, teaching methods such ...It is necessary for undergraduates majoring in geography to learn the history of geographic thought. Although there are different cultural and educational backgrounds between China and the West, teaching methods such as text teaching, students' presentations and group learning are suitable for most of teachers and students even from different countries and regions. The blended method is helpful to popularize history of geographic thought and improve the level of teaching and learning. Owing to lack of the class on the history of geographic thought in countries like China, the authors try to explore a blended method for the first-year geography undergraduates and to assess the effects of this teaching based on some questionnaires. The students have different benefits and responses to this class. A special group consisting of one teacher and several undergraduates does the research and coauthors the paper through making questionnaire, interviewing and analyzing materials from 67 freshmen majoring in human geography and geography science(teacher-training) in China. For the undergraduates especially from the countries like China, it is well worth making the history of geographic thought become a necessary and interesting class.展开更多
<strong>Aim: </strong>To clarify transformation of the participants’ consciousness for rebuilding the community and its factors from the discussion contents by actions for male elderly people in Town A in...<strong>Aim: </strong>To clarify transformation of the participants’ consciousness for rebuilding the community and its factors from the discussion contents by actions for male elderly people in Town A in Fukushima prefecture. <strong>Design: </strong>This study was an action research. <strong>Method: </strong>The author verbalized discussion contents of the action conducted in 2018-2019 and analyzed them for each year by the text mining method. <strong>Results: </strong>The word appearance frequency was high in the order of “Person” and “Town A” in both years. One large word network was formed in 2018 and its topic was about what the participants feel in their life in Town A. Two large word networks were formed in 2019 and their topic was about the community participation including difficulty in motivating others such as how people who do not participate can feel like joining it.展开更多
Nowadays, China has witnessed vigorous development in tourism industry, and it has made a great contribution to Chinese economic growth. In order to draw more foreign tourists and demonstrate the unique charm and cult...Nowadays, China has witnessed vigorous development in tourism industry, and it has made a great contribution to Chinese economic growth. In order to draw more foreign tourists and demonstrate the unique charm and cultural deposits of Chinese landscapes, the translators should capitalize on appropriate translation methods so as to guarantee the translation quality.The thesis analyzes the guiding role of Skopos Theory in tourism texts with a lot of examples, taking the Hubei scenic-spot translation as a carrier, which has important guiding significanse to translators.展开更多
在绿色发展的时代背景下,以企业可持续发展报告(Environmental,Social and Governance Report,ESG)为研究对象,分析了ESG中文报告的文本特征与翻译方法。从文本类型角度来看,ESG报告是典型的信息型文本,数字、图表、标题等信息内容居多...在绿色发展的时代背景下,以企业可持续发展报告(Environmental,Social and Governance Report,ESG)为研究对象,分析了ESG中文报告的文本特征与翻译方法。从文本类型角度来看,ESG报告是典型的信息型文本,数字、图表、标题等信息内容居多为其主要的文本特征。翻译ESG中文报告时,可采用顺译、转换与仿译等翻译方法。展开更多
基金supported by the Innovation Platform Construction of Qinghai Province(No.2016-ZJ-Y04)the Basic Research Program of Qinghai Province(No.2016-ZJ-740)
文摘Text extraction is an important initial step in digitizing the historical documents. In this paper, we present a text extraction method for historical Tibetan document images based on block projections. The task of text extraction is considered as text area detection and location problem. The images are divided equally into blocks and the blocks are filtered by the information of the categories of connected components and corner point density. By analyzing the filtered blocks' projections, the approximate text areas can be located, and the text regions are extracted. Experiments on the dataset of historical Tibetan documents demonstrate the effectiveness of the proposed method.
文摘Aim: To explore and analyze the feasibility of establishing a program of complex intervention in Traditional Chinese Medicine (TCM) based on Text Mining and Interviewing method. Methods: According to MRC, Constructing the program of complex intervention in TCM by Text Mining and Interviewing method should include 4 steps: 1) establishment of interview framework via normalization of extraction of ancient documents and Effectiveness of collection of modern periodical literatures;2) materialization of interview outline based on Focus Group Interview;3) rudimentary construction of complex intervention program based on Semi-structured Interview;4) evaluation of curative effect of complex intervention. Conclusions: It is feasible and significative to establish a program of complex intervention in TCM based on Text Mining and Interviewing method.
文摘With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working with Text Mining. This study is based on The Women’s Clothing E-Commerce Reviews database, which consists of reviews written by real customers. The aim of this paper is to conduct a Text Mining approach on a set of customer reviews. Each review was classified as either a positive or negative review by employing a classification method. Four tree-based methods were applied to solve the classification problem, namely Classification Tree, Random Forest, Gradient Boosting and XGBoost. The dataset was categorized into training and test sets. The results indicate that the Random Forest method displays an overfitting, XGBoost displays an overfitting if the number of trees is too high, Classification Tree is good at detecting negative reviews and bad at detecting positive reviews and the Gradient Boosting shows stable values and quality measures above 77% for the test dataset. A consensus between the applied methods is noted for important classification terms.
文摘针对国产大语言模型(large language models,LLMs)在地理信息科学(geographic information science,GIS)领域缺乏系统性评估基准问题,构建Geo-Text-700测试集的GIS领域定制化测评体系,基于优劣解距离层次分析法(technique for order preference by similarity to ideal solution,TOPSIS)对10个主流国产模型进行多维度评估。测评结果显示:模型表现呈现显著题型分化,客观题平均得分为68.4(标准差±5.2),较主观题低21.7%(P<0.05);Doubao-pro-32k综合得分最优(87.3),客观题优势显著(单选86,填空77);hunyuan-turbo在主观题(简答88.1,编程90.83)方面展现高阶任务潜力;领域知识盲区突出,如GIS拓扑规则题错误率为43.6%。
基金Peak Discipline Construction Project of Education at East China Normal UniversityNational Natural Science Foundation of China,No.41571138,No.41471135,No.41871143
文摘It is necessary for undergraduates majoring in geography to learn the history of geographic thought. Although there are different cultural and educational backgrounds between China and the West, teaching methods such as text teaching, students' presentations and group learning are suitable for most of teachers and students even from different countries and regions. The blended method is helpful to popularize history of geographic thought and improve the level of teaching and learning. Owing to lack of the class on the history of geographic thought in countries like China, the authors try to explore a blended method for the first-year geography undergraduates and to assess the effects of this teaching based on some questionnaires. The students have different benefits and responses to this class. A special group consisting of one teacher and several undergraduates does the research and coauthors the paper through making questionnaire, interviewing and analyzing materials from 67 freshmen majoring in human geography and geography science(teacher-training) in China. For the undergraduates especially from the countries like China, it is well worth making the history of geographic thought become a necessary and interesting class.
文摘<strong>Aim: </strong>To clarify transformation of the participants’ consciousness for rebuilding the community and its factors from the discussion contents by actions for male elderly people in Town A in Fukushima prefecture. <strong>Design: </strong>This study was an action research. <strong>Method: </strong>The author verbalized discussion contents of the action conducted in 2018-2019 and analyzed them for each year by the text mining method. <strong>Results: </strong>The word appearance frequency was high in the order of “Person” and “Town A” in both years. One large word network was formed in 2018 and its topic was about what the participants feel in their life in Town A. Two large word networks were formed in 2019 and their topic was about the community participation including difficulty in motivating others such as how people who do not participate can feel like joining it.
文摘Nowadays, China has witnessed vigorous development in tourism industry, and it has made a great contribution to Chinese economic growth. In order to draw more foreign tourists and demonstrate the unique charm and cultural deposits of Chinese landscapes, the translators should capitalize on appropriate translation methods so as to guarantee the translation quality.The thesis analyzes the guiding role of Skopos Theory in tourism texts with a lot of examples, taking the Hubei scenic-spot translation as a carrier, which has important guiding significanse to translators.
文摘在绿色发展的时代背景下,以企业可持续发展报告(Environmental,Social and Governance Report,ESG)为研究对象,分析了ESG中文报告的文本特征与翻译方法。从文本类型角度来看,ESG报告是典型的信息型文本,数字、图表、标题等信息内容居多为其主要的文本特征。翻译ESG中文报告时,可采用顺译、转换与仿译等翻译方法。