期刊文献+
共找到82篇文章
< 1 2 5 >
每页显示 20 50 100
Visual analytics tool for the interpretation of hidden states in recurrent neural networks 被引量:1
1
作者 Rafael Garcia Tanja Munz Daniel Weiskopf 《Visual Computing for Industry,Biomedicine,and Art》 EI 2021年第1期233-245,共13页
In this paper,we introduce a visual analytics approach aimed at helping machine learning experts analyze the hidden states of layers in recurrent neural networks.Our technique allows the user to interactively inspect ... In this paper,we introduce a visual analytics approach aimed at helping machine learning experts analyze the hidden states of layers in recurrent neural networks.Our technique allows the user to interactively inspect how hidden states store and process information throughout the feeding of an input sequence into the network.The technique can help answer questions,such as which parts of the input data have a higher impact on the prediction and how the model correlates each hidden state configuration with a certain output.Our visual analytics approach comprises several components:First,our input visualization shows the input sequence and how it relates to the output(using color coding).In addition,hidden states are visualized through a nonlinear projection into a 2-D visualization space using t-distributed stochastic neighbor embedding to understand the shape of the space of the hidden states.Trajectories are also employed to show the details of the evolution of the hidden state configurations.Finally,a time-multi-class heatmap matrix visualizes the evolution of the expected predictions for multi-class classifiers,and a histogram indicates the distances between the hidden states within the original space.The different visualizations are shown simultaneously in multiple views and support brushing-and-linking to facilitate the analysis of the classifications and debugging for misclassified input sequences.To demonstrate the capability of our approach,we discuss two typical use cases for long short-term memory models applied to two widely used natural language processing datasets. 展开更多
关键词 visual analytics visualIZATION Machine learning Classification Recurrent neural networks Long shortterm memory Hidden states INTERPRETABILITY Natural language processing Nonlinear projection
在线阅读 下载PDF
Exploring the Association between Climate Change and Human Development: A Visual Analytics Study
2
作者 Dongli Zhang Wullianallur Raghupathi Viju Raghupathi 《Atmospheric and Climate Sciences》 2024年第4期368-395,共28页
This study explores the complex relationship between climate change and human development. The aim is to understand how climate change affects human development across countries, regions, and the global population. Vi... This study explores the complex relationship between climate change and human development. The aim is to understand how climate change affects human development across countries, regions, and the global population. Visual analytics were used to examine the impact of various climate change indicators on different aspects of human development. The study highlights the urgent need for climate change action and encourages policymakers to make decisive moves. Climate change adversely affects numerous aspects of daily life, leading to significant consequences that must be addressed through policy changes and global governance recommendations. Key findings include that regions with higher CO2 emissions experience a significantly higher incidence of life-threatening diseases compared to regions with lower emissions. Additionally, higher CO2 emissions correlate with consistent death rates. Increased pollution exposure is associated with a higher prevalence of life-threatening diseases and higher rates of malnutrition. Moreover, greater mineral depletion is linked to more frequent life-threatening diseases, suggesting that industrialization contributes to adverse health effects. These results provide valuable insights for policy and decision-making aimed at mitigating the impact of climate change on human development. 展开更多
关键词 Air Pollution Climate Change CO2 Emissions Death Rate GDP Human Development visual analytics
在线阅读 下载PDF
Contextualized visual analytics for multivariate events
3
作者 Lei Peng Ziyue Lin +2 位作者 Natalia Andrienko Gennady Andrienko Siming Chen 《Visual Informatics》 2025年第2期14-29,共16页
For event analysis,the information from both before and after the event can be crucial in certain scenarios.By incorporating a contextualized perspective in event analysis,analysts can gain deeper insights from the ev... For event analysis,the information from both before and after the event can be crucial in certain scenarios.By incorporating a contextualized perspective in event analysis,analysts can gain deeper insights from the events.We propose a contextualized visual analysis framework which enables the identification and interpretation of temporal patterns within and across multivariate events.The framework consists of a design of visual representation for multivariate event contexts,a data processing workflow to support the visualization,and a context-centered visual analysis system to facilitate the interactive exploration of temporal patterns.To demonstrate the applicability and effectiveness of our framework,we present case studies using real-world datasets from two different domains and an expert study conducted with experienced data analysts. 展开更多
关键词 visual analytics Event analysis Contextualized analysis Interactive exploration visualization design
原文传递
A methodological approach towards human-centered visual analytics
4
作者 Emmanouil Adamakis George Margetis +1 位作者 Stavroula Ntoa Constantine Stephanidis 《Visual Informatics》 2025年第4期72-83,共12页
Visual analytics focuses on amplifying users’reasoning and understanding by enhancing data analysis procedures with the efficient incorporation of information visualization and data processing techniques.In this stud... Visual analytics focuses on amplifying users’reasoning and understanding by enhancing data analysis procedures with the efficient incorporation of information visualization and data processing techniques.In this study,we conduct an overview of this multidisciplinary field,focusing on both the process that formalizes its primary concepts and the affiliated research areas.We identify key developments in each area,as well as the challenges that arise when these areas are interconnected under the visual analytics process.We consider that to address the identified challenges,an appropriate representation of key user needs is essential.Therefore,inspired by human-centered design and its principles,we propose a novel methodological approach comprising a human-centered definition of visual analytics that expands on models of the field and quantifies the intermediate states of a data analysis.In addition to the theoretical aspects of the definition,we also provide a set of directions that align the process with technical aspects of the development cycle.In this respect,our research endeavor aims to transform the visual analytics process into an essential method for both conceptualizing data analysis systems capable of anticipating user needs and for streamlining their technical implementation. 展开更多
关键词 visual analytics Analytical reasoning Human-centered design
原文传递
Recent progress and trends in predictive visual analytics 被引量:1
5
作者 Junhua LU Wei CHEN +4 位作者 Yuxin MA Junming KE Zongzhuang LI Fan ZHANG Ross MACIEJEWSKI 《Frontiers of Computer Science》 SCIE EI CSCD 2017年第2期192-207,共16页
A wide variety of predictive analytics techniques have been developed in statistics, machine learning and data mining; however, many of these algorithms take a black-box approach in which data is input and future pred... A wide variety of predictive analytics techniques have been developed in statistics, machine learning and data mining; however, many of these algorithms take a black-box approach in which data is input and future predictions are output with no insight into what goes on during the process. Unfortunately, such a closed system approach often leaves little room for injecting domain expertise and can result in frustration from analysts when results seem snurious or confusing. In order to allow for more human-centric approaches, the visualization community has begun developing methods to enable users to incorporate expert knowledge into the pre- diction process at all stages, including data cleaning, feature selection, model building and model validation. This paper surveys current progress and trends in predictive visual ana- lytics, identifies the common framework in which predictive visual analytics systems operate, and develops a summariza- tion of the predictive analytics workfiow. 展开更多
关键词 predictive visual analytics visualIZATION visual analytics data mining predictive analysis
原文传递
A survey of visual analytics techniques for machine learning 被引量:13
6
作者 Jun Yuan Changjian Chen +3 位作者 Weikai Yang Mengchen Liu Jiazhi Xia Shixia Liu 《Computational Visual Media》 EI CSCD 2021年第1期3-36,共34页
Visual analytics for machine learning has recently evolved as one of the most exciting areas in the field of visualization.To better identify which research topics are promising and to learn how to apply relevant tech... Visual analytics for machine learning has recently evolved as one of the most exciting areas in the field of visualization.To better identify which research topics are promising and to learn how to apply relevant techniques in visual analytics,we systematically review259 papers published in the last ten years together with representative works before 2010.We build a taxonomy,which includes three first-level categories:techniques before model building,techniques during modeling building,and techniques after model building.Each category is further characterized by representative analysis tasks,and each task is exemplified by a set of recent influential works.We also discuss and highlight research challenges and promising potential future research opportunities useful for visual analytics researchers. 展开更多
关键词 visual analytics machine learning data quality feature selection model understanding content analysis
原文传递
A Survey of Visual Analytics Techniques and Applications: State-of-the-Art Research and Future Challenges 被引量:8
7
作者 孙国道 巫英才 +1 位作者 梁荣华 刘世霞 《Journal of Computer Science & Technology》 SCIE EI CSCD 2013年第5期852-867,共16页
Visual analytics employs interactive visualizations to integrate users' knowledge and inference capability into numerical/algorithmic data analysis processes. It is an active research field that has applications in m... Visual analytics employs interactive visualizations to integrate users' knowledge and inference capability into numerical/algorithmic data analysis processes. It is an active research field that has applications in many sectors, such as security, finance, and business. The growing popularity of visual analytics in recent years creates the need for a broad survey that reviews and assesses the recent developments in the field. This report reviews and classifies recent work into a set of application categories including space and time, multivariate, text, graph and network, and other applications. More importantly, this report presents analytics space, inspired by design space, which relates each application category to the key steps in visual analytics, including visual mapping, model-based analysis, and user interactions. We explore and discuss the analytics space to acld the current understanding and better understand research trends in the field. 展开更多
关键词 visual analytics information visualization data analysis user interaction
原文传递
A survey of urban visual analytics:Advances and future directions 被引量:3
8
作者 Zikun Deng Di Weng +3 位作者 Shuhan Liu Yuan Tian Mingliang Xu Yingcai Wu 《Computational Visual Media》 SCIE EI CSCD 2023年第1期3-39,共37页
Developing effective visual analytics systems demands care in characterization of domain problems and integration of visualization techniques and computational models.Urban visual analytics has already achieved remark... Developing effective visual analytics systems demands care in characterization of domain problems and integration of visualization techniques and computational models.Urban visual analytics has already achieved remarkable success in tackling urban problems and providing fundamental services for smart cities.To promote further academic research and assist the development of industrial urban analytics systems,we comprehensively review urban visual analytics studies from four perspectives.In particular,we identify 8 urban domains and 22 types of popular visualization,analyze 7 types of computational method,and categorize existing systems into 4 types based on their integration of visualization techniques and computational models.We conclude with potential research directions and opportunities. 展开更多
关键词 visual analytics smart city spatiotemporal data analysis urban analytics
原文传递
Steering data quality with visual analytics:The complexity challenge 被引量:6
9
作者 Shixia Liu Gennady Andrienko +5 位作者 Yingcai Wu Nan Cao Liu Jiang Conglei Shi Yu-Shuen Wang Seokhee Hong 《Visual Informatics》 EI 2018年第4期191-197,共7页
Data quality management,especially data cleansing,has been extensively studied for many years in the areas of data management and visual analytics.In the paper,we first review and explore the relevant work from the re... Data quality management,especially data cleansing,has been extensively studied for many years in the areas of data management and visual analytics.In the paper,we first review and explore the relevant work from the research areas of data management,visual analytics and human-computer interaction.Then for different types of data such as multimedia data,textual data,trajectory data,and graph data,we summarize the common methods for improving data quality by leveraging data cleansing techniques at different analysis stages.Based on a thorough analysis,we propose a general visual analytics framework for interactively cleansing data.Finally,the challenges and opportunities are analyzed and discussed in the context of data and humans. 展开更多
关键词 Data quality management visual analytics Data cleansing
原文传递
Visual analytics of taxi trajectory data via topical sub-trajectories 被引量:4
10
作者 Huan Liu Sichen Jin +2 位作者 Yuyu Yan Yubo Tao Hai Lin 《Visual Informatics》 EI 2019年第3期140-149,共10页
GPS-based taxi trajectories contain valuable knowledge about movement patterns for transportation and urban planning.Topic modeling is an effective tool to extract semantic information from taxi trajectory data.Howeve... GPS-based taxi trajectories contain valuable knowledge about movement patterns for transportation and urban planning.Topic modeling is an effective tool to extract semantic information from taxi trajectory data.However,previous methods generally ignore trajectory directions that are important in the analysis of movement patterns.In this paper,we employ the bigram topic model rather than traditional topic models to analyze textualized trajectories and consider the direction information of trajectories.We further propose a modified Apriori algorithm to extract topical sub-trajectories and use them to represent each topic.Finally,we design a visual analytics system with several linked views to facilitate users to interactively explore movement patterns from topics and topical sub-trajectories.The case studies with Chengdu taxi trajectory data demonstrate the effectiveness of the proposed system. 展开更多
关键词 Trajectory pattern mining Trajectory visualization visual analytics Topic model
原文传递
MessageLens:A Visual Analytics System to Support Multifaceted Exploration of MOOC Forum Discussions 被引量:3
11
作者 Jian-Syuan Wong Xiaolong"Luke”Zhang 《Visual Informatics》 EI 2018年第1期37-49,共13页
Massive Open Online Courses(MOOCs)often provide online discussion forum tools to facilitate learner interaction and communication.Having massive forum messages posted by learners everyday,MOOC forums are regarded as a... Massive Open Online Courses(MOOCs)often provide online discussion forum tools to facilitate learner interaction and communication.Having massive forum messages posted by learners everyday,MOOC forums are regarded as an important source for understanding learners activities and opinions.However,the high volume and heterogeneity of MOOC forum contents make it challenging to analyze forum data effectively from different perspectives of discussions and to integrate diverse information into a coherent understanding of issues of concern.In this paper,we report a study on the design of a visual analytics tool to facilitate the multifaceted analysis of online discussion forums.This tool,called MessageLens,aims at helping MOOC instructors to gain a better understanding of forum discussions from three facets:discussion topic,learner attitude,and communication among learners.With various visualization tools,instructors can investigate learner activities from different perspectives.We report a case study with real-world MOOC forum data to present the features of MessageLens and a preliminary evaluation study on the benefits and areas of improvement of the system.Our research suggests an approach to analyzing rich communication contents as well as dynamic social interactions among people. 展开更多
关键词 Multifaceted analysis MOOC forum visual analytics
原文传递
A theoretical model for pattern discovery in visual analytics 被引量:2
12
作者 Natalia Andrienko Gennady Andrienko +2 位作者 Silvia Miksch Heidrun Schumann Stefan Wrobel 《Visual Informatics》 EI 2021年第1期23-42,共20页
The word‘pattern’frequently appears in the visualisation and visual analytics literature,but what do we mean when we talk about patterns?We propose a practicable definition of the concept of a pattern in a data dist... The word‘pattern’frequently appears in the visualisation and visual analytics literature,but what do we mean when we talk about patterns?We propose a practicable definition of the concept of a pattern in a data distribution as a combination of multiple interrelated elements of two or more data components that can be represented and treated as a unified whole.Our theoretical model describes how patterns are made by relationships existing between data elements.Knowing the types of these relationships,it is possible to predict what kinds of patterns may exist.We demonstrate how our model underpins and refines the established fundamental principles of visualisation.The model also suggests a range of interactive analytical operations that can support visual analytics workflows where patterns,once discovered,are explicitly involved in further data analysis. 展开更多
关键词 visual analytics Data distribution PATTERN ABSTRACTION Data organisation Data arrangement Data variation Pattern discovery
原文传递
An Interactive Visual Analytics Framework for Multi-Field Data in a Geo-Spatial Context 被引量:1
13
作者 Zhiyuan Zhang Xiaonan Tong +3 位作者 Kevin T. McDonnell Alla Zelenyuk Dan Imre Klaus Mueller 《Tsinghua Science and Technology》 SCIE EI CAS 2013年第2期111-124,共14页
Climate research produces a wealth of multivariate data. These data often have a geospatial reference and so it is of interest to show them within their geospatial context. One can consider this configuration as a mul... Climate research produces a wealth of multivariate data. These data often have a geospatial reference and so it is of interest to show them within their geospatial context. One can consider this configuration as a multifield visualization problem, where the geo-space provides the expanse of the field. However, there is a limit on the amount of multivariate information that can be fit within a certain spatial location, and the use of linked multivariate information displays has previously been devised to bridge this gap. In this paper we focus on the interactions in the geographical display, present an implementation that uses Google Earth, and demonstrate it within a tightly linked parallel coordinates display. Several other visual representations, such as pie and bar charts are integrated into the Google Earth display and can be interactively manipulated. Further, we also demonstrate new brushing and visualization techniques for parallel coordinates, such as fixed-window brushing and correlation-enhanced display. We conceived our system with a team of climate researchers, who already made a few important discoveries using it. This demonstrates our system's great potential to enable scientific discoveries, possibly also in other domains where data have a geospatial reference. 展开更多
关键词 geospatial visualization visual analytics information visualization multivariate visualization parallel coordinates coordinated displays linking and brushing
原文传递
Graph-based visual analytics for cyber threat intelligence 被引量:2
14
作者 Fabian Bohm Florian Menges Gunther Pernul 《Cybersecurity》 2018年第1期279-297,共19页
The ever-increasing amount of major security incidents has led to an emerging interest in cooperative approaches to encounter cyber threats.To enable cooperation in detecting and preventing attacks it is an inevitable... The ever-increasing amount of major security incidents has led to an emerging interest in cooperative approaches to encounter cyber threats.To enable cooperation in detecting and preventing attacks it is an inevitable necessity to have structured and standardized formats to describe an incident.Corresponding formats are complex and of an extensive nature as they are often designed for automated processing and exchange.These characteristics hamper the readability and,therefore,prevent humans from understanding the documented incident.This is a major problem since the success and effectiveness of any security measure rely heavily on the contribution of security experts.To meet these shortcomings we propose a visual analytics concept enabling security experts to analyze and enrich semi-structured cyber threat intelligence information.Our approach combines an innovative way of persisting this data with an interactive visualization component to analyze and edit the threat information.We demonstrate the feasibility of our concept using the Structured Threat Information eXpression,the state-ofthe-art format for reporting cyber security issues. 展开更多
关键词 Cyber threat intelligence visual analytics Usable cybersecurity STIX
原文传递
AVA:An automated and AI-driven intelligent visual analytics framework 被引量:1
15
作者 Jiazhe Wang Xi Li +11 位作者 Chenlu Li Di Peng Arran Zeyu Wang Yuhui Gu Xingui Lai Haifeng Zhang Xinyue Xu Xiaoqing Dong Zhifeng Lin Jiehui Zhou Xingyu Liu Wei Chen 《Visual Informatics》 EI 2024年第2期106-114,共9页
With the incredible growth of the scale and complexity of datasets,creating proper visualizations for users becomes more and more challenging in large datasets.Though several visualization recommendation systems have ... With the incredible growth of the scale and complexity of datasets,creating proper visualizations for users becomes more and more challenging in large datasets.Though several visualization recommendation systems have been proposed,so far,the lack of practical engineering inputs is still a major concern regarding the usage of visualization recommendations in the industry.In this paper,we proposed AVA,an open-sourced web-based framework for Automated Visual Analytics.AVA contains both empiric-driven and insight-driven visualization recommendation methods to meet the demands of creating aesthetic visualizations and understanding expressible insights respectively.The code is available at https://github.com/antvis/AVA. 展开更多
关键词 Automated visual analytics visualization recommendation Insight mining
原文传递
Comparative visual analytics for assessing medical records with sequence embedding 被引量:1
16
作者 Rongchen Guo Takanori Fujiwara +4 位作者 Yiran Li Kelly M.Lima Soman Sen Nam K.Tran Kwan-Liu Ma 《Visual Informatics》 EI 2020年第2期72-85,共14页
Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians t... Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence.However,such analysis is not straightforward due to the characteristics of medical records:high dimensionality,irregularity in time,and sparsity.To address this challenge,we introduce a method for similarity calculation of medical records.Our method employs event and sequence embeddings.While we use an autoencoder for the event embedding,we apply its variant with the self-attention mechanism for the sequence embedding.Moreover,in order to better handle the irregularity of data,we enhance the self-attention mechanism with consideration of different time intervals.We have developed a visual analytics system to support comparative studies of patient records.To make a comparison of sequences with different lengths easier,our system incorporates a sequence alignment method.Through its interactive interface,the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records.We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis. 展开更多
关键词 Electronic medical records Event sequence data Autoencoder Self-attention Sequence similarity visual analytics
原文传递
LongLine:Visual Analytics System for Large-scale Audit Logs 被引量:1
17
作者 Seunghoon Yoo Jaemin Jo +1 位作者 Bohyoung Kim Jinwook Seo 《Visual Informatics》 EI 2018年第1期82-97,共16页
Audit logs are different from other software logs in that they record the most primitive events(i.e.,system calls)in modem operating systems.Audit logs contain a detailed trace of an operating system,and thus have rec... Audit logs are different from other software logs in that they record the most primitive events(i.e.,system calls)in modem operating systems.Audit logs contain a detailed trace of an operating system,and thus have received great attention from security experts and system administrators.However,the complexity and size of audit logs,which increase in real time,have hindered analysts from understanding and analyzing them.In this paper,we present a novel visual analytics system,LongLine,which enables interactive visual analyses of large-scale audit logs.LongLine lowers the interpretation barrier of audit logs by employing human-understandable representations(e.g.,file paths and commands)instead of abstract indicators of operating systems(e.g.,file descriptors)as well as revealing the temporal patterns of the logs in a multi-scale fashion with meaningful granularity of time in mind(e.g.,hourly,daily,and weekly).LongLine also streamlines comparative analysis between interesting subsets of logs,which is essential in detecting anomalous behaviors of systems.In addition,LongLine allows analysts to monitor the system state in a streaming fashion,keeping the latency between log creation and visualization less than one minute.Finally,we evaluate our system through a case study and a scenario analysis with security experts. 展开更多
关键词 visual analytics Log visualization Multidimensional Data
原文传递
VISTopic:A visual analytics system for making sense of large document collections using hierarchical topic modeling 被引量:1
18
作者 Yi Yang Quanming Yao Huamin Qu 《Visual Informatics》 EI 2017年第1期40-47,共8页
Effective analysis of large text collections remains a challenging problem given the growing volume of available text data.Recently,text mining techniques have been rapidly developed for automatically extracting key i... Effective analysis of large text collections remains a challenging problem given the growing volume of available text data.Recently,text mining techniques have been rapidly developed for automatically extracting key information from massive text data.Topic modeling,as one of the novel techniques that extracts a thematic structure from documents,is widely used to generate text summarization and foster an overall understanding of the corpus content.Although powerful,this technique may not be directly applicable for general analytics scenarios since the topics and topic-document relationship are often presented probabilistically in models.Moreover,information that plays an important role in knowledge discovery,for example,times and authors,is hardly reflected in topic modeling for comprehensive analysis.In this paper,we address this issue by presenting a visual analytics system,VISTopic,to help users make sense of large document collections based on topic modeling.VISTopic first extracts a set of hierarchical topics using a novel hierarchical latent tree model(HLTM)(Liu et al.,2014).In specific,a topic view accounting for the model features is designed for overall understanding and interactive exploration of the topic organization.To leverage multi-perspective information for visual analytics,VISTopic further provides an evolution view to reveal the trend of topics and a document view to show details of topical documents.Three case studies based on the dataset of IEEE VIS conference demonstrate the effectiveness of our system in gaining insights from large document collections. 展开更多
关键词 Topic-modeling Text visualization visual analytics
原文传递
rafPS:A shapley-based visual analytics approach to interpret traffic 被引量:1
19
作者 Zezheng Feng Yifan Jiang +5 位作者 Hongjun Wang Zipei Fan Yuxin Ma Shuang-Hua Yang Huamin Qu Xuan Song 《Computational Visual Media》 CSCD 2024年第6期1101-1119,共19页
Recent achievements in deep learning(DL)have demonstrated its potential in predicting traffic flows.Such predictions are beneficial for understanding the situation and making traffic control decisions.However,most sta... Recent achievements in deep learning(DL)have demonstrated its potential in predicting traffic flows.Such predictions are beneficial for understanding the situation and making traffic control decisions.However,most state-of-the-art DL models are consi-dered“black boxes”with little to no transparency of the underlying mechanisms for end users.Some previous studies attempted to“open the black box”and increase the interpretability of generated predictions.However,handling complex models on large-scale spatiotemporal data and discovering salient spatial and temporal patterns that significantly influence traffic flow remain challenging.To overcome these challenges,we present TrafPS,a visual analytics approach for interpreting traffic prediction outcomes to support decision-making in traffic management and urban planning.The measurements region SHAP and trajectory SHAP are proposed to quantify the impact of flow patterns on urban traffic at different levels.Based on the task requirements from domain experts,we employed an interactive visual interface for the multi-aspect exploration and analysis of significant flow patterns.Two real-world case studies demonstrate the effectiveness of TrafPS in identifying key routes and providing decision-making support for urban planning. 展开更多
关键词 data visualization model interpretation urban planning urban visual analytics
原文传递
Visual Analytics of Large-scale E-government Text Data via Simplified Word Cloud 被引量:1
20
作者 Yanan Liu Fang He +2 位作者 Jin Wen Zhiguang Zhou Jinchang Li 《Data Science and Informetrics》 2021年第1期29-51,共23页
With the rapid development of Internet technology,a rich set of e-government data are collected by the government departments.For example,a variety of feedback text data can be obtained quickly and efficiently through... With the rapid development of Internet technology,a rich set of e-government data are collected by the government departments.For example,a variety of feedback text data can be obtained quickly and efficiently through various channels such as the mayor’s mailbox.It is an effective way to improve the working efficiency of the government to extract hot topics from large-scale e-government text data,establish the correlation between topics and geographic space,and interactively explore the sources of public feedback problems.However,it is a difficult task to explore the large-scale e-government text data with traditional visualization methods such as word cloud,because too many words are hardly distributed in a limited space which will largely disturb the visual perception.In this paper,we propose a visual analytics system for large-scale e-government data exploration by means of simplified word cloud.Firstly,a representation learning model is used to embed the text data into high-dimensional space to quantitatively represent the semantic structure features of e-government text data.Then,the high-dimensional vectors are projected into a two-dimensional space where the coordinate distribution of points effectively expresses the semantic similarity of original words,which also presents geographic features that can be quantized by means of a similarity computing model.In order to simplify the understanding of large-scale e-government data and improve the cognitive efficiency of word could,we adopt the adaptive blue noise method to sample the topic words,which can simplify the visual expression of word cloud and improve the understanding efficiency of e-government data without losing the semantic structure features.Furthermore,an abstraction and visual analysis system for large-scale e-government text data is designed and implemented by integrating the above representation learning model,sampling-based abstraction model of word cloud,and topic and geographic correlation analysis model.This system provides convenient human-computer interaction modes and supports users to explore the analysis and extraction of the characteristics hidden in large-scale e-government data.It also helps government departments quickly locate the hot topics of public concern and their related regional distribution,and provides decision support to further improve the work efficiency of the government.Case studies based on real-world datasets further verify the effectiveness and practicability of our system. 展开更多
关键词 E-GOVERNMENT Text mining Text visualization visual analytics
原文传递
上一页 1 2 5 下一页 到第
使用帮助 返回顶部