Due to the complex structural hierarchy,with deeply nested associative relations between entities such as equipment,specifications,and business processes,intelligent power grid engineering is challenging.Meanwhile,lim...Due to the complex structural hierarchy,with deeply nested associative relations between entities such as equipment,specifications,and business processes,intelligent power grid engineering is challenging.Meanwhile,limited by the fragmented data and loss of contextual information,the generated reports are prone to the problems such as content redundancy and omission of critical information,failing to meet the demands of efficient decision-making and accurate management in modern power systems.To address these issues,this paper proposes a knowledge graph(KG)-enhanced framework to automatically generate electric power engineering reports.In the KG construction phase,a feature-fused entity recognition model named BERT-BiLSTM-CRF is adopted to improve the accuracy of entity recognition in scenarios involving power engineering professional terminology,thereby solving the problem of ambiguous entity boundaries in traditional models;then a BERT-attention relation extraction model is proposed to enhance the completeness of extracting complex hierarchical and implicit relations in power grid data.In the report generation phase,an improved Transformer architecture is adopted to accurately transform structured knowledge into natural language reports that comply with engineering specifications,addressing the issue of semantic inconsistency caused by the loss of structural information in existing models.By validating with real-world projects,the results show that the proposed framework significantly outperforms existing baseline models in entity recognition,confirming its superiority and applicability in practical engineering.展开更多
Chinese medicine(CM)diagnosis intellectualization is one of the hotspots in the research of CM modernization.The traditional CM intelligent diagnosis models transform the CM diagnosis issues into classification issues...Chinese medicine(CM)diagnosis intellectualization is one of the hotspots in the research of CM modernization.The traditional CM intelligent diagnosis models transform the CM diagnosis issues into classification issues,however,it is difficult to solve the problems such as excessive or similar categories.With the development of natural language processing techniques,text generation technique has become increasingly mature.In this study,we aimed to establish the CM diagnosis generation model by transforming the CM diagnosis issues into text generation issues.The semantic context characteristic learning capacity was enhanced referring to Bidirectional Long Short-Term Memory(BILSTM)with Transformer as the backbone network.Meanwhile,the CM diagnosis generation model Knowledge Graph Enhanced Transformer(KGET)was established by introducing the knowledge in medical field to enhance the inferential capability.The KGET model was established based on 566 CM case texts,and was compared with the classic text generation models including Long Short-Term Memory sequence-to-sequence(LSTM-seq2seq),Bidirectional and Auto-Regression Transformer(BART),and Chinese Pre-trained Unbalanced Transformer(CPT),so as to analyze the model manifestations.Finally,the ablation experiments were performed to explore the influence of the optimized part on the KGET model.The results of Bilingual Evaluation Understudy(BLEU),Recall-Oriented Understudy for Gisting Evaluation 1(ROUGE1),ROUGE2 and Edit distance of KGET model were 45.85,73.93,54.59 and 7.12,respectively in this study.Compared with LSTM-seq2seq,BART and CPT models,the KGET model was higher in BLEU,ROUGE1 and ROUGE2 by 6.00–17.09,1.65–9.39 and 0.51–17.62,respectively,and lower in Edit distance by 0.47–3.21.The ablation experiment results revealed that introduction of BILSTM model and prior knowledge could significantly increase the model performance.Additionally,the manual assessment indicated that the CM diagnosis results of the KGET model used in this study were highly consistent with the practical diagnosis results.In conclusion,text generation technology can be effectively applied to CM diagnostic modeling.It can effectively avoid the problem of poor diagnostic performance caused by excessive and similar categories in traditional CM diagnostic classification models.CM diagnostic text generation technology has broad application prospects in the future.展开更多
基金supported by State Grid Shanghai Economic Research Institute under Grant No.SGTYHT/23-JS-004.
文摘Due to the complex structural hierarchy,with deeply nested associative relations between entities such as equipment,specifications,and business processes,intelligent power grid engineering is challenging.Meanwhile,limited by the fragmented data and loss of contextual information,the generated reports are prone to the problems such as content redundancy and omission of critical information,failing to meet the demands of efficient decision-making and accurate management in modern power systems.To address these issues,this paper proposes a knowledge graph(KG)-enhanced framework to automatically generate electric power engineering reports.In the KG construction phase,a feature-fused entity recognition model named BERT-BiLSTM-CRF is adopted to improve the accuracy of entity recognition in scenarios involving power engineering professional terminology,thereby solving the problem of ambiguous entity boundaries in traditional models;then a BERT-attention relation extraction model is proposed to enhance the completeness of extracting complex hierarchical and implicit relations in power grid data.In the report generation phase,an improved Transformer architecture is adopted to accurately transform structured knowledge into natural language reports that comply with engineering specifications,addressing the issue of semantic inconsistency caused by the loss of structural information in existing models.By validating with real-world projects,the results show that the proposed framework significantly outperforms existing baseline models in entity recognition,confirming its superiority and applicability in practical engineering.
基金Supported by the National Natural Science Foundation of China(No.82174276 and 82074580)the Key Research and Development Program of Jiangsu Province(No.BE2022712)+2 种基金China Postdoctoral Foundation(No.2021M701674)Postdoctoral Research Program of Jiangsu Province(No.2021K457C)Qinglan Project of Jiangsu Universities 2021。
文摘Chinese medicine(CM)diagnosis intellectualization is one of the hotspots in the research of CM modernization.The traditional CM intelligent diagnosis models transform the CM diagnosis issues into classification issues,however,it is difficult to solve the problems such as excessive or similar categories.With the development of natural language processing techniques,text generation technique has become increasingly mature.In this study,we aimed to establish the CM diagnosis generation model by transforming the CM diagnosis issues into text generation issues.The semantic context characteristic learning capacity was enhanced referring to Bidirectional Long Short-Term Memory(BILSTM)with Transformer as the backbone network.Meanwhile,the CM diagnosis generation model Knowledge Graph Enhanced Transformer(KGET)was established by introducing the knowledge in medical field to enhance the inferential capability.The KGET model was established based on 566 CM case texts,and was compared with the classic text generation models including Long Short-Term Memory sequence-to-sequence(LSTM-seq2seq),Bidirectional and Auto-Regression Transformer(BART),and Chinese Pre-trained Unbalanced Transformer(CPT),so as to analyze the model manifestations.Finally,the ablation experiments were performed to explore the influence of the optimized part on the KGET model.The results of Bilingual Evaluation Understudy(BLEU),Recall-Oriented Understudy for Gisting Evaluation 1(ROUGE1),ROUGE2 and Edit distance of KGET model were 45.85,73.93,54.59 and 7.12,respectively in this study.Compared with LSTM-seq2seq,BART and CPT models,the KGET model was higher in BLEU,ROUGE1 and ROUGE2 by 6.00–17.09,1.65–9.39 and 0.51–17.62,respectively,and lower in Edit distance by 0.47–3.21.The ablation experiment results revealed that introduction of BILSTM model and prior knowledge could significantly increase the model performance.Additionally,the manual assessment indicated that the CM diagnosis results of the KGET model used in this study were highly consistent with the practical diagnosis results.In conclusion,text generation technology can be effectively applied to CM diagnostic modeling.It can effectively avoid the problem of poor diagnostic performance caused by excessive and similar categories in traditional CM diagnostic classification models.CM diagnostic text generation technology has broad application prospects in the future.