摘要
在网络数据膨胀的今天,将网络中有用数据摘录下来,并组织成脱机Web应用程序形式,不但便于移动设备浏览,而且能有效减少移动流量消耗,节省费用,提高安全性。为此,讨论了HTML5应用程序缓存机制、数据挖掘规则定义与数据提取、数据清洗入库、脱机Web应用程序的实现。在数据挖掘的过程中,一些动态网页需要进行登录认证才能访问,登录认证的方式有多种,如HTTPS、HTTP Digest、HTTP Basic和网页表单认证等,为减少复杂性,文中采用了FireFox浏览器安全认证方式。在数据清洗入库和脱机Web应用程序生成的过程中,利用PHP脚本和Html5语言,实现了相关功能。实验表明,文中方法可行,效果较好。
In today's expanding networks, that summarize cyber information and build offline web applications not only to facilitate the mobile devices to browse, but also can effectively reduce mobile traffic consumption, save cost and improve security. Therefore, this paper discusses on some issues, including the cache mechanism of HTML5 application, data mining and data extraction rule definition, data cleaning storage, offline web application implementation. In the data mining process, some dynamic pages need to log in credentials to access. There are a variety of login authentication methods, such as HTI'PS, HTI'P Digest, HTYP Basic authentication and web forms, etc. In order to reduce complexity, it uses the FireFox browser security authentication. In the process of data cleaning storage and offline web application generation, it uses PHP scripting language and Html5 to realize related functions. The experiments show that this method is feasible and the effect is good.
出处
《信息技术》
2014年第2期163-166,共4页
Information Technology
关键词
信息摘录
脱机Web应用程序
数据提取
summary of information
offline Web applications
data extraction