期刊文献+
共找到492篇文章
< 1 2 25 >
每页显示 20 50 100
Hybrid Scalable Researcher Recommendation System Using Azure Data Lake Analytics
1
作者 Dinesh Kalla Nathan Smith +1 位作者 Fnu Samaah Kiran Polimetla 《Journal of Data Analysis and Information Processing》 2024年第1期76-88,共13页
This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of co... This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of computer science in different fields of study. The technique used in this paper is handling the inadequate Information for citation;it removes the problem of cold start, which is encountered by very many other recommender systems. In this paper, abstracts, the titles, and the Microsoft academic graphs have been used in coming up with the recommendation list for every document, which is used to combine the content-based approaches and the co-citations. Prioritization and the blending of every technique have been allowed by the tuning system parameters, allowing for the authority in results of recommendation versus the paper novelty. In the end, we do observe that there is a direct correlation between the similarity rankings that have been produced by the system and the scores of the participant. The results coming from the associated scrips of analysis and the user survey have been made available through the recommendation system. Managers must gain the required expertise to fully utilize the benefits that come with business intelligence systems [1]. Data mining has become an important tool for managers that provides insights about their daily operations and leverage the information provided by decision support systems to improve customer relationships [2]. Additionally, managers require business intelligence systems that can rank the output in the order of priority. Ranking algorithm can replace the traditional data mining algorithms that will be discussed in-depth in the literature review [3]. 展开更多
关键词 Azure data lake U-SQL Author Recommendation System Power BI Microsoft Academic Big data Word Embedding
在线阅读 下载PDF
Optimizing Multimodal Data Queries in Data Lakes
2
作者 Runqun Xiong Shiyuan Zhao +1 位作者 Ciyuan Chen Zhuqing Xu 《Tsinghua Science and Technology》 2025年第6期2625-2637,共13页
This paper addresses the challenge of efficiently querying multimodal related data in data lakes,a large-scale storage and management system that supports heterogeneous data formats,including structured,semi-structure... This paper addresses the challenge of efficiently querying multimodal related data in data lakes,a large-scale storage and management system that supports heterogeneous data formats,including structured,semi-structured,and unstructured data.Multimodal data queries are crucial because they enable seamless retrieval of related data across modalities,such as tables,images,and text,which has applications in fields like e-commerce,healthcare,and education.However,existing methods primarily focus on single-modality queries,such as joinable or unionable table discovery,and struggle to handle the heterogeneity and lack of metadata in data lakes while balancing accuracy and efficiency.To tackle these challenges,we propose a Multimodal data Query mechanism for Data Lakes(MQDL),which employs a modality-adaptive indexing mechanism raleted and contrastive learning based embeddings to unify representations across modalities.Additionally,we introduce product quantization to optimize candidate verification during queries,reducing computational overhead while maintaining precision.We evaluate MQDL using a table-image dataset across multiple business scenarios,measuring metrics such as precision,recall,and F1-score.Results show that MQDL achieves an accuracy rate of approximately 90%,while demonstrating strong scalability and reduced query response time compared to traditional methods.These findings highlight MQDL's potential to enhance multimodal data retrieval in complex data lake environments. 展开更多
关键词 multimodal data query data lake contrastive learning related data query
原文传递
Data Lakes as a Centralized Integration Layer in Enterprise Environments:Approaches and Benefits for Scalability and Performance
3
作者 Carlos Diego Cavalcanti Pereira 《Journal of Data Analysis and Information Processing》 2025年第4期467-486,共20页
Enterprise application integration encounters substantial hurdles,particularly in intricate contexts that require elevated scalability and speed.Transactional applications directly accessed by many systems frequently ... Enterprise application integration encounters substantial hurdles,particularly in intricate contexts that require elevated scalability and speed.Transactional applications directly accessed by many systems frequently overload databases,undermining process efficiency.This paper examines the utilization of data lakes-historically used for data analysis-as a centralized integration layer that accommodates various temporalities and consumption modalities.The sug-gested method diminishes system interdependence and the burden on transac-tional databases,enhancing scalability and data governance in both mono-lithic and distributed frameworks. 展开更多
关键词 Application Integration data lakes data Governance
在线阅读 下载PDF
Realising Data-Centric Scientific Workflows with Provenance-Capturing on Data Lakes
4
作者 Hendrik Noltet Philipp Wieder 《Data Intelligence》 EI 2022年第2期426-438,共13页
Since their introduction by James Dixon in 2010,data lakes get more and more attention,driven by the promise of high reusability of the stored data due to the schema-on-read semantics.Building on this idea,several add... Since their introduction by James Dixon in 2010,data lakes get more and more attention,driven by the promise of high reusability of the stored data due to the schema-on-read semantics.Building on this idea,several additional requirements were discussed in literature to improve the general usability of the concept,like a central metadata catalog including all provenance information,an overarching data governance,or the integration with(high-performance)processing capabilities.Although the necessity for a logical and a physical organisation of data lakes in order to meet those requirements is widely recognized,no concrete guidelines are yet provided.The most common architecture implementing this conceptual organisation is the zone architecture,where data is assigned to a certain zone depending on the degree of processing.This paper discusses how FAIR Digital Objects can be used in a novel approach to organize a data lake based on data types instead of zones,how they can be used to abstract the physical implementation,and how they empower generic and portable processing capabilities based on a provenance-based approach. 展开更多
关键词 data lake PROVENANCE WORKFLOWS FAIRDigital Objects CWFR
原文传递
Wetland vegetation biomass estimation and mapping from Landsat ETM data: a case study of Poyang Lake 被引量:3
5
作者 LI Ren-dong1, 2, LIU Ji-yuan2 (1. Institute of Geodesy and Geophysics, CAS, Wuhan 430077, China 2. Institute of Geographic Sciences and Natural Resources Research, CAS, Beijing 100101 China) 《Journal of Geographical Sciences》 SCIE CSCD 2002年第1期35-41,共7页
Poyang Lake is the largest freshwater lake in China. This paper conducted a digital and rapid investigation of the lake’s wetland vegetation biomass using Landsat ETM data acquired on April 16, 2000. First, utilizing... Poyang Lake is the largest freshwater lake in China. This paper conducted a digital and rapid investigation of the lake’s wetland vegetation biomass using Landsat ETM data acquired on April 16, 2000. First, utilizing the false color composite derived from the ETM data as one of the main references, the authors designed a reasonable sampling route for field measurement of the biomass, and carried it out on April 18–28, 2000. Then after both the sampling data and the ETM data were geometrically corrected to an equal-area projection of Albers, linear relationships among the sampling data and some transformed data derived from the ETM data and the ETM 4 were calculated. The results show that the sampling data is best relative to the band 4 data with a high correlation coefficient of 0.86, followed by the DVI and NDVI data with 0.83 and 0.80 respectively. Therefore, a linear regression model, which was based on the field data and band 4 data, was used to estimate the total biomass of entire Poyang Lake, and then the map of the biomass distribution was compiled. 展开更多
关键词 Poyang lake BIOMASS wetland vegetation Landsat ETM data
在线阅读 下载PDF
Mapping of moraine dammed glacial lakes and assessment of their areal changes in the central and eastern Himalayas using satellite data 被引量:3
6
作者 Sazeda BEGAM Dhrubajyoti SEN 《Journal of Mountain Science》 SCIE CSCD 2019年第1期77-94,共18页
The relatively rapid recession of glaciers in the Himalayas and formation of moraine dammed glacial lakes(MDGLs) in the recent past have increased the risk of glacier lake outburst floods(GLOF) in the countries of Nep... The relatively rapid recession of glaciers in the Himalayas and formation of moraine dammed glacial lakes(MDGLs) in the recent past have increased the risk of glacier lake outburst floods(GLOF) in the countries of Nepal and Bhutan and in the mountainous territory of Sikkim in India. As a product of climate change and global warming, such a risk has not only raised the level of threats to the habitation and infrastructure of the region, but has also contributed to the worsening of the balance of the unique ecosystem that exists in this domain that sustains several of the highest mountain peaks of the world. This study attempts to present an up to date mapping of the MDGLs in the central and eastern Himalayan regions using remote sensing data, with an objective to analyse their surface area variations with time from 1990 through 2015, disaggregated over six episodes. The study also includes the evaluation for susceptibility of MDGLs to GLOF with the least criteria decision analysis(LCDA). Forty two major MDGLs, each having a lake surface area greater than 0.2 km2, that were identified in the Himalayan ranges of Nepal, Bhutan, and Sikkim, have been categorized according to their surface area expansion rates in space and time. The lakes have been identified as located within the elevation range of 3800 m and6800 m above mean sea level(a msl). With a total surface area of 37.9 km2, these MDGLs as a whole were observed to have expanded by an astonishing 43.6% in area over the 25 year period of this study. A factor is introduced to numerically sort the lakes in terms of their relative yearly expansion rates, based on their interpretation of their surface area extents from satellite imageries. Verification of predicted GLOF events in the past using this factor with the limited field data as reported in literature indicates that the present analysis may be considered a sufficiently reliable and rapid technique for assessing the potential bursting susceptibility of the MDGLs. The analysis also indicates that, as of now, there are eight MDGLs in the region which appear to be in highly vulnerable states and have high chances in causing potential GLOF events anytime in the recent future. 展开更多
关键词 GLACIER RETREAT lakeS MAPPING MORAINE dammed GLACIAL lake(MDGL) Surface area change of lakeS Landsat imagery data Least criteria decision analysis(LCDA)
原文传递
Study on the Applicability of ERA5 Reanalysis Data at Lake Taihu
7
作者 Bo Wang Dongmei Chen Meiqi Song 《Journal of Geoscience and Environment Protection》 2022年第12期1-16,共16页
Lakes are an important component of the earth climate system. They play an important role in the study of basin weather forecasting, air quality forecasting, and regional climate research. The accuracy of driving vari... Lakes are an important component of the earth climate system. They play an important role in the study of basin weather forecasting, air quality forecasting, and regional climate research. The accuracy of driving variables is the basic premise to ensure the rationality of lake mode simulation. Based on the in-situ observations at Bifenggang site of the Lake Taihu Eddy flux Network from 2012 to 2017, this paper investigated temporal variations in temperature, relative humidity, wind speed, radiation components at different time scales (hourly, seasonal and interannual). ERA5 reanalysis data were compared with in-situ observation to quantify the error and evaluate the performance of reanalysis data. The results show that: 1) On the hourly scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. 2) On the seasonal variation scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. However, the descriptions of wind speed, relative humidity and downward short-wave have large deviations. 3) On the interannual scale, the ERA5 reanalysis data show a good performance for temperature, followed by downward longwave radiation, downward shortwave radiation and relative humidity. 展开更多
关键词 lake Taihu ERA5 Reanalysis data Meteorological Variables COMPARISON APPLICABILITY
在线阅读 下载PDF
塔里木油田数据生态建设实践
8
作者 尚新璐 银宏亮 +9 位作者 钱芸芸 季伟哲 陶嘉伟 刘滨源 钟泽义 冯宇 李晓林 那日松 唐超 阿丽娅·阿不都热依木 《现代信息科技》 2026年第3期133-139,共7页
塔里木油田自石油会战以来积累了海量勘探开发与生产经营数据,且数据量逐年呈爆炸性增长。随着新一代信息技术快速发展,数据已成为企业核心资产,传统建设模式导致的系统壁垒、数据共享困难等问题,制约了数据价值转化,因此油田数据生态... 塔里木油田自石油会战以来积累了海量勘探开发与生产经营数据,且数据量逐年呈爆炸性增长。随着新一代信息技术快速发展,数据已成为企业核心资产,传统建设模式导致的系统壁垒、数据共享困难等问题,制约了数据价值转化,因此油田数据生态建设势在必行。该研究立足中国石油“梦想云连环湖”战略,结合企业数字化现状,以构建高效数据生态、助力数智化转型为目标,搭建了以区域数据湖为核心的“六全”数据生态体系。通过统一数据模型与存储标准、搭建数据湖基础环境,打破专业与部门壁垒,同步建立生态运行机制,开展全流程数据治理与定制化数据服务。该体系有效实现了数据互联互通、各领域“金”数据集中管理及数据质量持续提升,达成数据资产价值最大化与“六全”生态良性运转,为油气企业从“非数字原生”向数智化转型提供了实践支撑。 展开更多
关键词 数字化转型 数据生态体系 数据湖 数据治理 数据服务
在线阅读 下载PDF
鄱阳湖洪泛系统水文干旱对水文连通性的影响
9
作者 岳恩馨 赵林 +4 位作者 钱品瑜 刘意 李相虎 赵华琼 叶许春 《湖泊科学》 北大核心 2026年第1期353-366,共14页
洪泛湖泊水文干旱及其对水文连通性的影响对当地水资源管理和湿地生态保护至关重要。基于ESTARFM模型重构的鄱阳湖区2000-2023年高时空分辨率淹水数据集,采用标准化淹水面积指数定量表征鄱阳湖洪泛系统水文干旱,并结合地统计水文连通性... 洪泛湖泊水文干旱及其对水文连通性的影响对当地水资源管理和湿地生态保护至关重要。基于ESTARFM模型重构的鄱阳湖区2000-2023年高时空分辨率淹水数据集,采用标准化淹水面积指数定量表征鄱阳湖洪泛系统水文干旱,并结合地统计水文连通性函数,分析了鄱阳湖区水文干旱和水文连通性的演变特征;在此基础上,采用STL时间序列分解及多元线性函数拟合法,明确了水文连通性对水文干旱变化的响应规律。研究表明,鄱阳湖区水文干旱的年内和年际变化较为复杂,干旱事件的发生频率较高,且整体上呈现旱情加重趋势。鄱阳湖区南北方向水文连通性强于东西方向水文连通性,近年来湖区水文连通性呈现出波动下降趋势,与湖区水文干旱强度变化有关。定量分析发现,随着水文干旱程度的增强,鄱阳湖区水文连通性呈现下降趋势。其中,东西方向上,鄱阳湖区轻旱、中旱、重旱、极旱较无旱情境下年均水文连通性分别降低45.2%、50.0%、54.6%和70.7%;南北方向上,年均水文连通性分别降低32.1%、35.6%、39.0%和50.7%。鄱阳湖区水文干旱引起的水文连通性变化将对湿地植被的生长分布产生重要影响,研究结果可为极端水情下的湖泊生态系统管理实践提供依据。 展开更多
关键词 洪泛系统 水文干旱 水文连通性 数据融合 鄱阳湖
在线阅读 下载PDF
A Systematic Review of Automated Classification for Simple and Complex Query SQL on NoSQL Database
10
作者 Nurhadi Rabiah Abdul Kadir +1 位作者 Ely Salwana Mat Surin Mahidur R.Sarker 《Computer Systems Science & Engineering》 2024年第6期1405-1435,共31页
A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various form... A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various forms of semi-structured,structured,and unstructured information.These systems use a flat architecture and run different types of data analytics.NoSQL databases are nontabular and store data in a different manner than the relational table.NoSQL databases come in various forms,including key-value pairs,documents,wide columns,and graphs,each based on its data model.They offer simpler scalability and generally outperform traditional relational databases.While NoSQL databases can store diverse data types,they lack full support for atomicity,consistency,isolation,and durability features found in relational databases.Consequently,employing machine learning approaches becomes necessary to categorize complex structured query language(SQL)queries.Results indicate that the most frequently used automatic classification technique in processing SQL queries on NoSQL databases is machine learning-based classification.Overall,this study provides an overview of the automatic classification techniques used in processing SQL queries on NoSQL databases.Understanding these techniques can aid in the development of effective and efficient NoSQL database applications. 展开更多
关键词 NoSQL database data lake machine learning ACID complex query smart city
在线阅读 下载PDF
直升机大数据治理系统架构设计研究
11
作者 鲁兴 孟浩 +1 位作者 吕少杰 张京亮 《网络安全与数据治理》 2026年第2期60-64,共5页
高质量数据是模型算法迭代更新的先决条件,是实现直升机高质量发展的基础。直升机研制和使用过程中产生了大量技术与运用数据,但存在“有数据、无治理”、数据缺乏统一管理、价值挖掘不足等问题。结合直升机领域机械化、信息化、智能化... 高质量数据是模型算法迭代更新的先决条件,是实现直升机高质量发展的基础。直升机研制和使用过程中产生了大量技术与运用数据,但存在“有数据、无治理”、数据缺乏统一管理、价值挖掘不足等问题。结合直升机领域机械化、信息化、智能化转型发展需求,对直升机大数据治理体系进行了探讨,提出了基于云边端协同的直升机大数据治理架构,并详细分析了直升机湖仓一体数据云总体建设思路,为构建直升机领域数据治理体系提供技术支撑,奠定了直升机“+智能”的数据高质量基础。 展开更多
关键词 直升机 大数据治理系统 云边端协同 湖仓一体数据云
在线阅读 下载PDF
基于Flink的流批一体数据集成平台构建
12
作者 金鑫会 《计算机应用文摘》 2026年第2期122-124,共3页
以医院数据中台建设为背景,探讨基于Apache Flink构建流批一体数据集成平台的技术方案与实施路径。针对传统医疗数据架构中存在的实时性差、“数据孤岛”、口径不一致等问题,提出基于Flink SQL与数据湖(Iceberg)的流批一体架构,实现电... 以医院数据中台建设为背景,探讨基于Apache Flink构建流批一体数据集成平台的技术方案与实施路径。针对传统医疗数据架构中存在的实时性差、“数据孤岛”、口径不一致等问题,提出基于Flink SQL与数据湖(Iceberg)的流批一体架构,实现电子病历、影像系统、实验室数据等异构数据源的实时整合与批量分析。 展开更多
关键词 Flink 医疗数据 数据湖 电子病历 影像系统
在线阅读 下载PDF
云原生实时数据湖平台设计与实现
13
作者 郭炜 傅正斌 +5 位作者 王新东 张世富 丛新法 李张体 张亚威 李昌盛 《计算机技术与发展》 2026年第2期188-194,共7页
随着大数据和云计算技术的快速发展,实时流数据处理在业务决策和运营中的重要性日益凸显。传统实时计算引擎部署方式存在资源利用率低、运维成本高等问题。为此,业界开始将Flink计算引擎与云原生技术融合,充分利用容器技术的高可用、可... 随着大数据和云计算技术的快速发展,实时流数据处理在业务决策和运营中的重要性日益凸显。传统实时计算引擎部署方式存在资源利用率低、运维成本高等问题。为此,业界开始将Flink计算引擎与云原生技术融合,充分利用容器技术的高可用、可移植性等特点。但是Flink社区与云原生适配存在多个问题,例如Flink Jar任务不支持远程作业提交、部分模式不支持SQL作业、缺少Flink云原生作业统一管理工具等问题,在此基础上,该文提出一种基于云原生架构的实时数据湖平台设计方法。一方面,自研Flink集群模式,解决原生Flink云原生Application模式不支持SQL作业提交的问题,并支持远程Jar作业提交;另一方面,研发作业全生命周期管理和作业大状态优化等能力,实现实时任务的高效管理,降低用户干预率和运维负担,提升作业稳定性;此外,研发弹性调度功能提升实时作业的资源利用率。云原生实时数据湖平台为实时流处理场景提供了高效、稳定的解决方案,实验结果表明通过云原生实时数据湖平台提交的实时任务相比Flink on Yarn的传统架构资源利用率提升30%以上。 展开更多
关键词 实时数据湖 云原生技术 弹性调度 作业全生命周期管理 大状态优化
在线阅读 下载PDF
基于多源异构评论的扬州瘦西湖景区旅游形象演化特征与影响因素研究
14
作者 肖洁 董双茹 侯国林 《南京师大学报(自然科学版)》 北大核心 2026年第1期15-23,共9页
游客互联网评论数据涵盖了旅游吸引物、服务与情感体验等信息,是分析游客对景区形象认知的重要依据.本文基于“认知-情感-整体”三维模型,以扬州瘦西湖景区为研究案例地,采集多平台游客评论数据,运用LDA提炼形象主题,通过SnowNLP自然语... 游客互联网评论数据涵盖了旅游吸引物、服务与情感体验等信息,是分析游客对景区形象认知的重要依据.本文基于“认知-情感-整体”三维模型,以扬州瘦西湖景区为研究案例地,采集多平台游客评论数据,运用LDA提炼形象主题,通过SnowNLP自然语言处理库分析情感,并基于Gephi构建共现关系网络图谱,以此分析评论数据内含的认知形象、情感形象和整体形象的演化特征及动因.研究表明:(1)在认知形象上,景点观光、意境氛围始终是瘦西湖景区的核心部分,历史文化、活动和体验权重呈现显著上升趋势.(2)在情感形象上,瘦西湖景区以积极情感为主,2016—2025年情感形象呈小幅度波动态势,积极情感占比始终突出.(3)在整体形象上,瘦西湖景区呈现层次化结构,情感与认知的深度互动展示了瘦西湖景区古今交织、文旅融合的整体旅游形象特征.本文为利用大规模多源异构数据分析旅游目的地形象、推进景区运营管理与旅游营销工作提供了参考与借鉴. 展开更多
关键词 旅游形象 多源数据 LDA 情感分析 瘦西湖景区
在线阅读 下载PDF
河湖长制信息系统在秦淮河流域的应用效果分析
15
作者 缪璐 王宗辉 吴奇刚 《黄河水利职业技术大学学报》 2026年第1期28-33,共6页
针对传统河湖长制实施中存在的采集数据复杂多样、部门调度难及公众参与度低等问题,研发了一套融合多源异构数据的河湖长制信息系统。该系统通过嵌入数据标准化预处理、时空对齐、智能融合技术与河湖健康指数评价体系,可快速整合结构化... 针对传统河湖长制实施中存在的采集数据复杂多样、部门调度难及公众参与度低等问题,研发了一套融合多源异构数据的河湖长制信息系统。该系统通过嵌入数据标准化预处理、时空对齐、智能融合技术与河湖健康指数评价体系,可快速整合结构化数据、半结构化数据、非结构化数据等河湖管理数据,实现“河湖长制一张图”、动态巡查追踪、幸福河湖智能评价等核心功能。将其应用于南京市秦淮河流域,强化了三级协同与责任落实,提升了河湖管理的精准性和效率。 展开更多
关键词 河湖长制信息系统 多源异构数据 标准化处理 数据融合 时空对齐 河湖健康指数 秦淮河流域
在线阅读 下载PDF
基于淹水面积构建的鄱阳湖水文干旱定量表征及变化特征 被引量:1
16
作者 叶许春 岳恩馨 +1 位作者 李相虎 李传哲 《水科学进展》 北大核心 2025年第2期320-331,共12页
研究探讨洪泛湖泊淹水动态的时空异质性特征及其影响下的水文干旱定量表征,对提高洪泛湖泊生态系统管理实践和洪旱灾害防御能力具有重要意义。采用多源遥感数据和图像融合技术构建了鄱阳湖区2000—2023年间连续的高时空分辨率淹水面积数... 研究探讨洪泛湖泊淹水动态的时空异质性特征及其影响下的水文干旱定量表征,对提高洪泛湖泊生态系统管理实践和洪旱灾害防御能力具有重要意义。采用多源遥感数据和图像融合技术构建了鄱阳湖区2000—2023年间连续的高时空分辨率淹水面积数据,揭示了鄱阳湖淹水动态的时空异质性特征;借助标准化降水指数(SPI)原理提出了基于淹水面积的标准化水文干旱指数,并据此分析了鄱阳湖水文干旱的变化特征。结果表明:(1)鄱阳湖淹水动态时空异质性特征明显,主湖区和碟形湖区淹水面积的年内波动存在差异,在年际变化上呈现出相反趋势;(2)在定量反映鄱阳湖整体水文干旱时,基于站点的标准化水位指数存在较大的不确定性,相对而言,标准化淹水面积指数具有更好的科学性;(3)鄱阳湖水文干旱在时空分布上具有一定的复杂性,极端干旱主要发生在年内的4—10月,且更容易发生在主湖区。遥感大数据和图像融合技术结合可实现对大型洪泛湖泊水文干旱的精细定量研究,促进湖泊资源保护利用和洪旱灾害防治等工作的开展。 展开更多
关键词 水文干旱 淹水面积 洪泛湖泊 数据融合 遥感
在线阅读 下载PDF
基于熵减和马尔科夫链的中小企业客户数据治理技术
17
作者 刘敏 黄倚霄 +1 位作者 陈智扬 张湛梅 《现代信息科技》 2025年第3期140-145,152,共7页
针对传统中小企业客户数据呈现杂乱无序状态且缺乏标准化的现状,提出一种创新的数据治理技术。该技术整合多源异构数据,该技术汇聚多源异构数据,融合光学字符识别(Optical Character Recognition,OCR)等多种方法,构建标准化的中小企业... 针对传统中小企业客户数据呈现杂乱无序状态且缺乏标准化的现状,提出一种创新的数据治理技术。该技术整合多源异构数据,该技术汇聚多源异构数据,融合光学字符识别(Optical Character Recognition,OCR)等多种方法,构建标准化的中小企业基础信息数据湖,从源头提升数据质量。引入“熵减”理念,利用智能算法对数据质量进行量化评估,能够及时定位并解决数据质量问题。同时,搭建时序数据库并构建基于熵减的马尔科夫链模型,以此预测未来数据质量趋势,精准治理潜在问题区域。该技术不仅实现了数据价值的最大化,还显著降低了治理成本,提高了数据治理的效率与准确性,为企业降本增效提供了有力支撑。 展开更多
关键词 熵减 数据治理 马尔科夫链 中小企数据湖 时序数据库
在线阅读 下载PDF
Reconstruction of the Lacustrine Delta and Lake Level Change Analyzing Subsurface Geology and Geomorphology: Changes That Occurred during the Holocene in the Oguraike Reclaimed Land Area, Southern Kyoto, Japan
18
作者 Yuka Ito Fujio Masuda 《Open Journal of Geology》 2012年第3期203-211,共9页
A paleo-lacustrine delta in Kyoto, Japan was reconstructed on the basis of subsurface geological and geomorphological analysis, and paleo-lake level changes were estimated from the structure of the delta. These analys... A paleo-lacustrine delta in Kyoto, Japan was reconstructed on the basis of subsurface geological and geomorphological analysis, and paleo-lake level changes were estimated from the structure of the delta. These analyses of the study region, i.e., the Oguraike reclaimed land area provided evidence that Lake Ogura existed until about 60 years ago in southern Kyoto, Japan. The Uji river delta was provided influents to this lake until ca. 400 years ago, as is indicated by an upward-coarsening delta succession of about 2 - 4 m thickness. The lake level could also have changed in the past as a result of a change in altitude of the delta-front (foreset) and delta-plain boundary, which probably reflects the lake surface elevation. About 400 years ago, the Paleo-Uji River was separated from Ogura Lake because a levee was constructed along the river for building a castle and for constructing a waterway for transportation. As a result of this construction, the lake level that was more than 13.0 m in elevation was reduced by 1.5 m. In a more ancient times, the lake level experienced two stages—one in which the elevation was more than 13.5 m, and one in which the elevation was reduced to less than 10 m. These changes in the lake level are represented by a flat surface with four steps and small cliff of height ca. 0.5 - 2 m (relative elevation) separating them, recognized at the southern lakeshore. The observation of strata along with the archaeological survey in the north of Ogura Lake reveals that the lake level was decreased ca. 800 - 680 years ago. The lake level was at its highest during two periods, the first from before the 8th century to the end of the 8th century and the second from the 14th century to 400 years ago. 展开更多
关键词 lake Level lake Ogura DELTA LACUSTRINE Deposit BOREHOLE data
暂未订购
鄱阳湖洪泛系统水文连通性演变特征及对湿地植被生长的影响 被引量:4
19
作者 岳恩馨 刘意 +2 位作者 李相虎 赵华琼 叶许春 《生态学报》 北大核心 2025年第4期1938-1949,共12页
水文连通性是影响洪泛区水文过程及生态系统结构和功能的关键要素,对湿地植被的生长与分布尤为重要。基于ESTARFM(Enhanced Spatial and Temporal Adaptive Reflectance Fusion Model)模型重构了2000—2022年鄱阳湖洪泛系统高时空分辨... 水文连通性是影响洪泛区水文过程及生态系统结构和功能的关键要素,对湿地植被的生长与分布尤为重要。基于ESTARFM(Enhanced Spatial and Temporal Adaptive Reflectance Fusion Model)模型重构了2000—2022年鄱阳湖洪泛系统高时空分辨率水体指数NDWI(Normalized Difference Water Index)(8d,30m)和增强型植被指数EVI(Enhanced Vegetation Index)(16 d,30 m)数据集,并结合地统计水文连通性函数,系统研究了鄱阳湖区多维水文连通性的演变特征及其对湿地植被生长的影响规律。结果表明:1)鄱阳湖区不同水文期内东西和南北方向的水文连通性随距离增加均呈现高度动态变化特征,水文连通性函数曲线的变化速率为:枯水期>退水期>涨水期>丰水期;2)研究时段内,鄱阳湖南北水文连通性明显高于东西水文连通性,但就不同区域而言,主湖区和南矶保护区的主导连通性随时间发生变化,碟形湖区及鄱阳湖保护区以南北水文连通性为主导;不同区域东西水文连通性呈现较为一致的波动下降趋势,南北水文连通性演变趋势差异较大;3)鄱阳湖湿地植被EVI与水文连通性之间呈现显著的负相关关系,其中,主湖区植被EVI主要受东西水文连通性控制,碟形湖区及鄱阳湖保护区植被EVI受东西和南北水文连通性的共同作用,南矶保护区植被EVI更多的受南北水文连通性影响。加强变化环境下水文连通性对湿地生态系统“结构-过程-功能”的影响规律研究,对促进湖泊系统水资源管理和湿地生态保护至关重要。 展开更多
关键词 水文连通性 长时间序列 鄱阳湖 湿地植被
在线阅读 下载PDF
河湖底泥治理科技创新研究与产业化发展
20
作者 唐彤芝 吴志强 +3 位作者 徐锴 黄英豪 关云飞 陈海波 《水利水运工程学报》 北大核心 2025年第5期42-53,共12页
中国江河湖库淤积状况日趋恶化,严重危害工程安全与效益、水质与生态环境,底泥治理已成为国家江河战略的重要内容。当前迫切需要构建“精准探测-高效脱水-安全利用”的技术与产业链,加强多技术融合创新,推动底泥从“安全隐患、治理负担... 中国江河湖库淤积状况日趋恶化,严重危害工程安全与效益、水质与生态环境,底泥治理已成为国家江河战略的重要内容。当前迫切需要构建“精准探测-高效脱水-安全利用”的技术与产业链,加强多技术融合创新,推动底泥从“安全隐患、治理负担”向“战略资源、产业化新质生产力”转变。科学谋划和开展底泥探测、底泥快速固结硬化以及资源化利用技术的科研攻关,有利于提升国家水安全保障能力,推动新时期水利水运建设高质量发展,促进底泥资源产业化发展。总结提出了底泥原位探测与治理利用技术研发的总体任务,分析了需要解决的关键科学问题,构建了创新性与实用性突出、具有自主知识产权与技术特色的淤积智能探测技术与一体化装备、“天空地水工”大数据系统、分布式多功能快速排水固结技术与资源化利用设备技术框架。可为满足国家和行业公益性重大需求、推进底泥产业化形成新质生产力提供科技支撑。 展开更多
关键词 河湖库淤积 智能探测 大数据 固结硬化 产业化
在线阅读 下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部