期刊文献+

基于熵减和马尔科夫链的中小企业客户数据治理技术

Customer Data Governance Technology of Small and Medium Enterprises Based on Entropy Decrease and Markov Chain
在线阅读 下载PDF
导出
摘要 针对传统中小企业客户数据呈现杂乱无序状态且缺乏标准化的现状,提出一种创新的数据治理技术。该技术整合多源异构数据,该技术汇聚多源异构数据,融合光学字符识别(Optical Character Recognition,OCR)等多种方法,构建标准化的中小企业基础信息数据湖,从源头提升数据质量。引入“熵减”理念,利用智能算法对数据质量进行量化评估,能够及时定位并解决数据质量问题。同时,搭建时序数据库并构建基于熵减的马尔科夫链模型,以此预测未来数据质量趋势,精准治理潜在问题区域。该技术不仅实现了数据价值的最大化,还显著降低了治理成本,提高了数据治理的效率与准确性,为企业降本增效提供了有力支撑。 Aiming at the current situation that the customer data of traditional small and medium enterprises is disorderly and lacks standardization,an innovative data governance technology is proposed.This technology integrates multi-source heterogeneous data,fuses Optical Character Recognition(OCR)and other methods,and constructs a standardized basic information data lake of small and medium enterprises,to improve data quality from the source.By introducing the concept of“entropy decrease”and using intelligent algorithms to quantitatively evaluate data quality,data quality problems can be located and solved in time.At the same time,a time series database is built and a Markov Chain model based on entropy decrease is constructed to predict future data quality trends and accurately govern potential problem areas.This technology not only maximizes the value of data,but also significantly reduces the cost of governance.It improves the efficiency and accuracy of data governance and provides strong support for enterprises to decrease costs and increase efficiency.
作者 刘敏 黄倚霄 陈智扬 张湛梅 LIU Min;HUANG Yixiao;CHEN Zhiyang;ZHANG Zhanmei(China Mobile Communications Group Guangdong Co.,Ltd.,Guangzhou 510623,China)
出处 《现代信息科技》 2025年第3期140-145,152,共7页 Modern Information Technology
关键词 熵减 数据治理 马尔科夫链 中小企数据湖 时序数据库 entropy decrease data governance Markov Chain data lake of small and medium enterprises time series database
  • 相关文献

参考文献10

二级参考文献64

共引文献251

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部