Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of ...Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of wind farm data.Consequently,a method for cleaning wind power anomaly data by combining image processing with community detection algorithms(CWPAD-IPCDA)is proposed.To precisely identify and initially clean anomalous data,wind power curve(WPC)images are converted into graph structures,which employ the Louvain community recognition algorithm and graph-theoretic methods for community detection and segmentation.Furthermore,the mathematical morphology operation(MMO)determines the main part of the initially cleaned wind power curve images and maps them back to the normal wind power points to complete the final cleaning.The CWPAD-IPCDA method was applied to clean datasets from 25 wind turbines(WTs)in two wind farms in northwest China to validate its feasibility.A comparison was conducted using density-based spatial clustering of applications with noise(DBSCAN)algorithm,an improved isolation forest algorithm,and an image-based(IB)algorithm.The experimental results demonstrate that the CWPAD-IPCDA method surpasses the other three algorithms,achieving an approximately 7.23%higher average data cleaning rate.The mean value of the sum of the squared errors(SSE)of the dataset after cleaning is approximately 6.887 lower than that of the other algorithms.Moreover,the mean of overall accuracy,as measured by the F1-score,exceeds that of the other methods by approximately 10.49%;this indicates that the CWPAD-IPCDA method is more conducive to improving the accuracy and reliability of wind power curve modeling and wind farm power forecasting.展开更多
在大数据时代,数据的类别标签数量激增,对现有的分类方法构成了重大挑战。为解决此问题,利用类别间的相似性,将数据类别标签以层次化方式处理。但现有的类别间相似性度量均使用欧氏距离,由于欧氏距离无法有效处理高维数据,因此,受Tanim...在大数据时代,数据的类别标签数量激增,对现有的分类方法构成了重大挑战。为解决此问题,利用类别间的相似性,将数据类别标签以层次化方式处理。但现有的类别间相似性度量均使用欧氏距离,由于欧氏距离无法有效处理高维数据,因此,受Tanimoto系数启发,提出一种新的类别相似性度量方法,使用Louvain算法构建树结构(TaniVT),考虑数据分布,设计基于类内散度的模糊粗糙分层分类器(fuzzy rough hierarchical classifier based on intra-class divergence,IDFRHC),将所提方法与已有的方法进行比较,通过实验验证了所提方法的有效性。展开更多
目的:分析真实世界生脉注射液治疗心肌梗死的联合用药方案,为生脉注射液更好地应用于心肌梗死的治疗提供参考。方法:提取全国24家三甲医院信息系统(hospital information system,HIS)数据库中使用生脉注射液治疗的1208名心肌梗死患者的...目的:分析真实世界生脉注射液治疗心肌梗死的联合用药方案,为生脉注射液更好地应用于心肌梗死的治疗提供参考。方法:提取全国24家三甲医院信息系统(hospital information system,HIS)数据库中使用生脉注射液治疗的1208名心肌梗死患者的用药信息,利用Louvain算法对数据进行复杂网络分析,筛选出针对心肌梗死病因、症状、并发症的中西药联合用药方案。结果:针对病因的治疗可以联合使用“单硝酸异山梨酯+乙酰水杨酸+丹参注射液+麝香保心丸”扩冠抗栓、“人胰岛素+阿卡波糖+阿伐他汀+贝那普利”降糖调脂降压;针对症状的治疗可以联合使用“吗啡+地西泮+丹红注射液”缓解疼痛、“氯吡格雷+奥美拉唑+泮托拉唑+甲氧氯普胺”改善消化道症状;针对并发症的治疗可以联合应用“地高辛+参附注射液+呋塞米+螺内酯”强心利尿,缓解心衰。结论:生脉注射液联合其他中成药和西药治疗心肌梗死体现了中西结合治疗的优势,有利于今后更好地展开心肌梗死个体化综合治疗,未来仍需要大量的临床研究来探索更多疗效显著的联合用药方案。展开更多
基金supported by the National Natural Science Foundation of China(Project No.51767018)Natural Science Foundation of Gansu Province(Project No.23JRRA836).
文摘Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of wind farm data.Consequently,a method for cleaning wind power anomaly data by combining image processing with community detection algorithms(CWPAD-IPCDA)is proposed.To precisely identify and initially clean anomalous data,wind power curve(WPC)images are converted into graph structures,which employ the Louvain community recognition algorithm and graph-theoretic methods for community detection and segmentation.Furthermore,the mathematical morphology operation(MMO)determines the main part of the initially cleaned wind power curve images and maps them back to the normal wind power points to complete the final cleaning.The CWPAD-IPCDA method was applied to clean datasets from 25 wind turbines(WTs)in two wind farms in northwest China to validate its feasibility.A comparison was conducted using density-based spatial clustering of applications with noise(DBSCAN)algorithm,an improved isolation forest algorithm,and an image-based(IB)algorithm.The experimental results demonstrate that the CWPAD-IPCDA method surpasses the other three algorithms,achieving an approximately 7.23%higher average data cleaning rate.The mean value of the sum of the squared errors(SSE)of the dataset after cleaning is approximately 6.887 lower than that of the other algorithms.Moreover,the mean of overall accuracy,as measured by the F1-score,exceeds that of the other methods by approximately 10.49%;this indicates that the CWPAD-IPCDA method is more conducive to improving the accuracy and reliability of wind power curve modeling and wind farm power forecasting.
文摘在大数据时代,数据的类别标签数量激增,对现有的分类方法构成了重大挑战。为解决此问题,利用类别间的相似性,将数据类别标签以层次化方式处理。但现有的类别间相似性度量均使用欧氏距离,由于欧氏距离无法有效处理高维数据,因此,受Tanimoto系数启发,提出一种新的类别相似性度量方法,使用Louvain算法构建树结构(TaniVT),考虑数据分布,设计基于类内散度的模糊粗糙分层分类器(fuzzy rough hierarchical classifier based on intra-class divergence,IDFRHC),将所提方法与已有的方法进行比较,通过实验验证了所提方法的有效性。
文摘目的:分析真实世界生脉注射液治疗心肌梗死的联合用药方案,为生脉注射液更好地应用于心肌梗死的治疗提供参考。方法:提取全国24家三甲医院信息系统(hospital information system,HIS)数据库中使用生脉注射液治疗的1208名心肌梗死患者的用药信息,利用Louvain算法对数据进行复杂网络分析,筛选出针对心肌梗死病因、症状、并发症的中西药联合用药方案。结果:针对病因的治疗可以联合使用“单硝酸异山梨酯+乙酰水杨酸+丹参注射液+麝香保心丸”扩冠抗栓、“人胰岛素+阿卡波糖+阿伐他汀+贝那普利”降糖调脂降压;针对症状的治疗可以联合使用“吗啡+地西泮+丹红注射液”缓解疼痛、“氯吡格雷+奥美拉唑+泮托拉唑+甲氧氯普胺”改善消化道症状;针对并发症的治疗可以联合应用“地高辛+参附注射液+呋塞米+螺内酯”强心利尿,缓解心衰。结论:生脉注射液联合其他中成药和西药治疗心肌梗死体现了中西结合治疗的优势,有利于今后更好地展开心肌梗死个体化综合治疗,未来仍需要大量的临床研究来探索更多疗效显著的联合用药方案。