Since Google introduced the concept of Knowledge Graphs(KGs)in 2012,their construction technologies have evolved into a comprehensive methodological framework encompassing knowledge acquisition,extraction,representati...Since Google introduced the concept of Knowledge Graphs(KGs)in 2012,their construction technologies have evolved into a comprehensive methodological framework encompassing knowledge acquisition,extraction,representation,modeling,fusion,computation,and storage.Within this framework,knowledge extraction,as the core component,directly determines KG quality.In military domains,traditional manual curation models face efficiency constraints due to data fragmentation,complex knowledge architectures,and confidentiality protocols.Meanwhile,crowdsourced ontology construction approaches from general domains prove non-transferable,while human-crafted ontologies struggle with generalization deficiencies.To address these challenges,this study proposes an OntologyAware LLM Methodology for Military Domain Knowledge Extraction(LLM-KE).This approach leverages the deep semantic comprehension capabilities of Large Language Models(LLMs)to simulate human experts’cognitive processes in crowdsourced ontology construction,enabling automated extraction of military textual knowledge.It concurrently enhances knowledge processing efficiency and improves KG completeness.Empirical analysis demonstrates that this method effectively resolves scalability and dynamic adaptation challenges in military KG construction,establishing a novel technological pathway for advancing military intelligence development.展开更多
Accurately identifying building distribution from remote sensing images with complex background information is challenging.The emergence of diffusion models has prompted the innovative idea of employing the reverse de...Accurately identifying building distribution from remote sensing images with complex background information is challenging.The emergence of diffusion models has prompted the innovative idea of employing the reverse denoising process to distill building distribution from these complex backgrounds.Building on this concept,we propose a novel framework,building extraction diffusion model(BEDiff),which meticulously refines the extraction of building footprints from remote sensing images in a stepwise fashion.Our approach begins with the design of booster guidance,a mechanism that extracts structural and semantic features from remote sensing images to serve as priors,thereby providing targeted guidance for the diffusion process.Additionally,we introduce a cross-feature fusion module(CFM)that bridges the semantic gap between different types of features,facilitating the integration of the attributes extracted by booster guidance into the diffusion process more effectively.Our proposed BEDiff marks the first application of diffusion models to the task of building extraction.Empirical evidence from extensive experiments on the Beijing building dataset demonstrates the superior performance of BEDiff,affirming its effectiveness and potential for enhancing the accuracy of building extraction in complex urban landscapes.展开更多
Processing police incident data in public security involves complex natural language processing(NLP)tasks,including information extraction.This data contains extensive entity information—such as people,locations,and ...Processing police incident data in public security involves complex natural language processing(NLP)tasks,including information extraction.This data contains extensive entity information—such as people,locations,and events—while also involving reasoning tasks like personnel classification,relationship judgment,and implicit inference.Moreover,utilizing models for extracting information from police incident data poses a significant challenge—data scarcity,which limits the effectiveness of traditional rule-based and machine-learning methods.To address these,we propose TIPS.In collaboration with public security experts,we used de-identified police incident data to create templates that enable large language models(LLMs)to populate data slots and generate simulated data,enhancing data density and diversity.We then designed schemas to efficiently manage complex extraction and reasoning tasks,constructing a high-quality dataset and fine-tuning multiple open-source LLMs.Experiments showed that the fine-tuned ChatGLM-4-9B model achieved an F1 score of 87.14%,nearly 30%higher than the base model,significantly reducing error rates.Manual corrections further improved performance by 9.39%.This study demonstrates that combining largescale pre-trained models with limited high-quality domain-specific data can greatly enhance information extraction in low-resource environments,offering a new approach for intelligent public security applications.展开更多
Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support v...Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support vector machine(SVM),as well as ensemble methods,such as Gradient Boosting and eXtreme gradient boosting(XGBoost),are often plagued by high computational costs,which makes it challenging for them to perform real-time detection.In this regard,we suggested an attack detection approach that integrates Visual Geometry Group 16(VGG16),Artificial Rabbits Optimizer(ARO),and Random Forest Model to increase detection accuracy and operational efficiency in Internet of Things(IoT)networks.In the suggested model,the extraction of features from malware pictures was accomplished with the help of VGG16.The prediction process is carried out by the random forest model using the extracted features from the VGG16.Additionally,ARO is used to improve the hyper-parameters of the random forest model of the random forest.With an accuracy of 96.36%,the suggested model outperforms the standard models in terms of accuracy,F1-score,precision,and recall.The comparative research highlights our strategy’s success,which improves performance while maintaining a lower computational cost.This method is ideal for real-time applications,but it is effective.展开更多
Accurate vector extraction from design drawings is required first to automatically create 3D models from pixel-level engineering design drawings. However, this task faces the challenges of complicated design shapes as...Accurate vector extraction from design drawings is required first to automatically create 3D models from pixel-level engineering design drawings. However, this task faces the challenges of complicated design shapes as well as cumbersome and cluttered annotations on drawings, which interfere with the vector extraction heavily. In this article, the transmission tower containing the most complex structure is taken as the research object, and a semantic segmentation network is constructed to first segment the shape masks from the pixel-level drawings. Preprocessing and postprocessing are also proposed to ensure the stability and accuracy of the shape mask segmentation. Then, based on the obtained shape masks, a vector extraction network guided by heatmaps is designed to extract structural vectors by fusing the features from node heatmap and skeleton heatmap, respectively. Compared with the state-of-the-art methods, experiment results illustrate that the proposed semantic segmentation method can effectively eliminate the interference of many elements on drawings to segment the shape masks effectively, meanwhile, the model trained by the proposed vector extraction network can accurately extract the vectors such as nodes and line connections, avoiding redundant vector detection. The proposed method lays a solid foundation for automatic 3D model reconstruction and contributes to technological advancements in relevant fields.展开更多
Background:Acquiring relevant information about procurement targets is fundamental to procuring medical devices.Although traditional Natural Language Processing(NLP)and Machine Learning(ML)methods have improved inform...Background:Acquiring relevant information about procurement targets is fundamental to procuring medical devices.Although traditional Natural Language Processing(NLP)and Machine Learning(ML)methods have improved information retrieval efficiency to a certain extent,they exhibit significant limitations in adaptability and accuracy when dealing with procurement documents characterized by diverse formats and a high degree of unstructured content.The emergence of Large Language Models(LLMs)offers new possibilities for efficient procurement information processing and extraction.Methods:This study collected procurement transaction documents from public procurement websites,and proposed a procurement Information Extraction(IE)method based on LLMs.Unlike traditional approaches,this study systematically explores the applicability of LLMs in both structured and unstructured entities in procurement documents,addressing the challenges posed by format variability and content complexity.Furthermore,an optimized prompt framework tailored for procurement document extraction tasks is developed to enhance the accuracy and robustness of IE.The aim is to process and extract key information from medical device procurement quickly and accurately,meeting stakeholders'demands for precision and timeliness in information retrieval.Results:Experimental results demonstrate that,compared to traditional methods,the proposed approach achieves an F1 Score of 0.9698,representing a 4.85%improvement over the best baseline model.Moreover,both recall and precision rates are close to 97%,significantly outperforming other models and exhibiting exceptional overall recognition capabilities.Notably,further analysis reveals that the proposed method consistently maintains high performance across both structured and unstructured entities in procurement documents while balancing recall and precision effectively,demonstrating its adaptability in handling varying document formats.The results of ablation experiments validate the effectiveness of the proposed prompting strategy.Conclusion:Additionally,this study explores the challenges and potential improvements of the proposed method in IE tasks and provides insights into its feasibility for real-world deployment and application directions,further clarifying its adaptability and value.This method not only exhibits significant advantages in medical device procurement but also holds promise for providing new approaches to information processing and decision support in various domains.展开更多
Boundary extraction of watershed is an important step in forest landscape research. The boundary of the upriver wa-tershed of the Hunhe River in the sub-alpine Qingyuan County of eastern Liaoning Province, China was e...Boundary extraction of watershed is an important step in forest landscape research. The boundary of the upriver wa-tershed of the Hunhe River in the sub-alpine Qingyuan County of eastern Liaoning Province, China was extracted by digital elevation modeling (DEM) data in ArcInfo8.1. Remote sensing image of the corresponding region was applied to help modify its copy according to Enhanced Thematic Mapper (ETM) image抯 profuse geomorphological structure information. Both the DEM-dependent boundary and modified copy were overlapped with county map and drainage network map to visually check the effects of result. Overlap of county map suggested a nice extraction of the boundary line since the two layers matched precisely, which indicated the DEM-dependent boundary by program was effective and precise. Further upload of drainage network showed discrepancies between the boundary and the drainage network. Altogether, there were three sections of the extraction result that needed to correct. Compared with this extraction boundary, the modified boundary had a better match to the drainage network as well as to the county map. Comprehensive analysis demonstrated that the program extraction has generally fine precision in position and excels the digitized result by hand. The errors of the DEM-dependant extraction are due to the fact that it is difficult for program to recognize sections of complex landform especially altered by human activities, but these errors are discernable and adjustable because the spatial resolution of ETM image is less than that of DEM. This study result proved that application of remote sensing information could help obtain better result when DEM method is used in extraction of watershed boundary.展开更多
A novel parameter extraction method with rational functions is presented for the 2-πequivalent circuit model of RF CMOS spiral inductors. The final S-parameters simulated by the circuit model closely match experiment...A novel parameter extraction method with rational functions is presented for the 2-πequivalent circuit model of RF CMOS spiral inductors. The final S-parameters simulated by the circuit model closely match experimental data. The extraction strategy is straightforward and can be easily implemented as a CAD tool to model spiral inductors. The resulting circuit models will be very useful for RF circuit designers.展开更多
It is well known that the human auditory system possesses remarkable capabilities to analyze and identify signals. Therefore, it would be significant to build an auditory model based on the mechanism of human auditory...It is well known that the human auditory system possesses remarkable capabilities to analyze and identify signals. Therefore, it would be significant to build an auditory model based on the mechanism of human auditory systems, which may improve the effects of mechanical signal analysis and enrich the methods of mechanical faults features extraction. However the existing methods are all based on explicit senses of mathematics or physics, and have some shortages on distinguishing different faults, stability, and suppressing the disturbance noise, etc. For the purpose of improving the performances of the work of feature extraction, an auditory model, early auditory(EA) model, is introduced for the first time. This auditory model transforms time domain signal into auditory spectrum via bandpass filtering, nonlinear compressing, and lateral inhibiting by simulating the principle of the human auditory system. The EA model is developed with the Gammatone filterbank as the basilar membrane. According to the characteristics of vibration signals, a method is proposed for determining the parameter of inner hair cells model of EA model. The performance of EA model is evaluated through experiments on four rotor faults, including misalignment, rotor-to-stator rubbing, oil film whirl, and pedestal looseness. The results show that the auditory spectrum, output of EA model, can effectively distinguish different faults with satisfactory stability and has the ability to suppress the disturbance noise. Then, it is feasible to apply auditory model, as a new method, to the feature extraction for mechanical faults diagnosis with effect.展开更多
This paper presents an accurate small-signal model for multi-gate GaAs pHEMTs in switching-mode.The extraction method for the proposed model is developed.A 2-gate switch structure is fabricated on a commercial 0.5μm ...This paper presents an accurate small-signal model for multi-gate GaAs pHEMTs in switching-mode.The extraction method for the proposed model is developed.A 2-gate switch structure is fabricated on a commercial 0.5μm AlGaAs/GaAs pHEMT technology to verify the proposed model.Excellent agreement has been obtained between the measured and simulated results over a wide frequency range.展开更多
Ultrasonically assisted extraction of isoflavones from the stem of Pueraria lobata (Willd.) Ohwi has been carried out with an ultrasonic extracting apparatus (20kHz, electrical power input to the transducer in 0-6...Ultrasonically assisted extraction of isoflavones from the stem of Pueraria lobata (Willd.) Ohwi has been carried out with an ultrasonic extracting apparatus (20kHz, electrical power input to the transducer in 0-650W). The influence of the electrical power input and extraction time on the'extraction yield is investigated in water, n-butanol, and 95% (by volume) and 50% (by volume) ethanol aqueous solution. The experimental results indicate that the yields of total isoflavones are higher in ultrasonically assisted extraction than those obtained from con-ventional extraction.Moreover,a mathematical model is proposed,by introducing the electrical power input to index the ultrsound intensity,to describe the behavior of ultrasonically assisted extraction.It is found that the model calcuations are in good agreement with the experimental data.展开更多
In China′s Loess Plateau area, gully head is the most active zone of a drainage system in gully areas. The differentiation of loess gully head follows geospatial patterns and reflects the process of the loess landfor...In China′s Loess Plateau area, gully head is the most active zone of a drainage system in gully areas. The differentiation of loess gully head follows geospatial patterns and reflects the process of the loess landform development and evolution of its drainage system to some extent. In this study, the geomorphic meaning, basic characteristics, morphological structure and the basic types of loess gully heads were systematically analysed. Then, the loess gully head′s conceptual model was established, and an extraction method based on Digital Elevation Model(DEM) for loess gully head features and elements was proposed. Through analysing the achieved statistics of loess gully head features, loess gully heads have apparently similar and different characteristics depending on the different loess landforms where they are found. The loess head characteristics reflect their growth period and evolution tendency to a certain degree, and they indirectly represent evolutionary mechanisms. In addition, the loess gully developmental stages and the evolutionary processes can be deduced by using loess gully head characteristics. This study is of great significance for development and improvement of the theoretical system for describing loess gully landforms.展开更多
The supercritical carbon dioxide extraction was applied to obtain essential oil from Pogostemon cablin in this work.Effect of extraction parameters including temperature,pressure,extraction time and particle size on e...The supercritical carbon dioxide extraction was applied to obtain essential oil from Pogostemon cablin in this work.Effect of extraction parameters including temperature,pressure,extraction time and particle size on extraction yield was investigated,and the response surface methodology with a Box–Behnken Design was used to achieve the optimized extraction conditions.The maximum yield of essential oil was 2.4356%under the conditions of extraction temperature 47°C,pressure 24.5 MPa and extraction time 119 min.Moreover,based on the Brunauer–Emmett–Teller theory of adsorption,a mathematical modeling was performed to correlate the measured data.The model shows a function relationship between extraction yield and time by a simple equation with three significantly adjustable parameters.These model parameters have been optimized through simulated annealing algorithm.The predicted data from the mathematical model show a good agreement with the experimental data of the different extraction parameters.展开更多
A sub circuit model for VDMOS is built according to its physical structure.Parameters and formulas describing the device are also derived from this model.Comparing to former results,this model avoids too many technic...A sub circuit model for VDMOS is built according to its physical structure.Parameters and formulas describing the device are also derived from this model.Comparing to former results,this model avoids too many technical parameters and simplify the sub circuit efficiently.As a result of numeric computation,this simple model with clear physical conception demonstrates excellent agreements between measured and modeled response (DC error within 5%,AC error within 10%).Such a model is now available for circuit simulation and parameter extraction.展开更多
In consideration of the online measurement of the component content in rare earth countercurrent extraction separation process, the soft sensor method based on hybrid modeling was proposed to measure the rare earth co...In consideration of the online measurement of the component content in rare earth countercurrent extraction separation process, the soft sensor method based on hybrid modeling was proposed to measure the rare earth component content. The hybrid models were composed of the extraction equilibrium calculation model and the Radial Basis Function (RBF) Neural Network (NN) error compensation model; the parameters of compensation model were optimized by the hierarchical genetic algorithms (HGA). In addition, application experiment research of this proposed method was carried out in the rare earth separation production process of a corporation. The result shows that this method is effective and can realize online measurement for the component content of rare earth in the countercurrent extraction.展开更多
Geological knowledge can provide support for knowledge discovery, knowledge inference and mineralization predictions of geological big data. Entity identification and relationship extraction from geological data descr...Geological knowledge can provide support for knowledge discovery, knowledge inference and mineralization predictions of geological big data. Entity identification and relationship extraction from geological data description text are the key links for constructing knowledge graphs. Given the lack of publicly annotated datasets in the geology domain, this paper illustrates the construction process of geological entity datasets, defines the types of entities and interconceptual relationships by using the geological entity concept system, and completes the construction of the geological corpus. To address the shortcomings of existing language models(such as Word2vec and Glove) that cannot solve polysemous words and have a poor ability to fuse contexts, we propose a geological named entity recognition and relationship extraction model jointly with Bidirectional Encoder Representation from Transformers(BERT) pretrained language model. To effectively represent the text features, we construct a BERT-bidirectional gated recurrent unit network(BiGRU)-conditional random field(CRF)-based architecture to extract the named entities and the BERT-BiGRU-Attention-based architecture to extract the entity relations. The results show that the F1-score of the BERT-BiGRU-CRF named entity recognition model is 0.91 and the F1-score of the BERT-BiGRU-Attention relationship extraction model is 0.84, which are significant performance improvements when compared to classic language models(e.g., word2vec and Embedding from Language Models(ELMo)).展开更多
A model GM (grey model) (1,1) for forecasting the rate of copper extraction during the bioleaching of primary sulphide ore was established on the basis of the mathematical theory and the modeling process of grey s...A model GM (grey model) (1,1) for forecasting the rate of copper extraction during the bioleaching of primary sulphide ore was established on the basis of the mathematical theory and the modeling process of grey system theory. It was used for forecasting the rate of copper extraction from the primary sulfide ore during a laboratory microbial column leaching experiment. The precision of the forecasted results were examined and modified via "posterior variance examination". The results show that the forecasted values coincide with the experimental values. GM (1,1) model has high forecast accuracy; and it is suitable for simulation control and prediction analysis of the original data series of the processes that have grey characteristics, such as mining, metallurgical and mineral processing, etc. The leaching rate of such copper sulphide ore is low. The grey forecasting result indicates that the rate of copper extraction is approximately 20% even after leaching for six months.展开更多
Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing m...Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing micro-TESE.This retrospective study analyzed the clinical records of 200 nonobstructive azoospermia patients between January 2021 and December 2021.The backward method was used to perform binary logistic regression analysis and identify factors that predicted a successful micro-TESE sperm retrieval.The prediction model was constructed using acquired regression coefficients,and its predictive performance was assessed using the receiver operating characteristic curve.In all,67 patients(sperm retrieval rate:33.5%)underwent successful micro-TESE.Follicle-stimulating hormone,anti-Miillerian hormone,and inhibin B levels varied significantly between patients who underwent successful and unsuccessful micro-TESE.Binary logistic regression analysis yielded the following six predictors:anti-Mullerian hormone(odds ratio[OR]=0.902,95%confidence interval[Cl]:0.821-0.990),inhibin B(OR=1.012,95%Cl:1.001-1.024),Klinefelter’s syndrome(OR=0.022,95%Cl:0.002-0.243),Y chromosome microdeletion(OR=0.050,95%Cl:0.005-0.504),cryptorchidism with orchiopexy(OR=0.085,95%Cl:0.008-0.929),and idiopathic nonobstructive azoospermia(OR=0.031,95%Cl:0.003-0.277).The prediction model had an area under the curve of 0.720(95%Cl:0.645-0.794),sensitivity of 65.7%,specificity of 72.2%,Youden index of 0.379,and cut-off value of 0.305 overall,indicating good predictive value and accuracy.This model can assist clinicians and nonobstructive azoospermia patients in decision-making and avoiding negative micro-TESE results.展开更多
In order to accurately describe the dynamic characteristics of flight vehicles through aerodynamic modeling, an adaptive wavelet neural network (AWNN) aerodynamic modeling method is proposed, based on subset kernel pr...In order to accurately describe the dynamic characteristics of flight vehicles through aerodynamic modeling, an adaptive wavelet neural network (AWNN) aerodynamic modeling method is proposed, based on subset kernel principal components analysis (SKPCA) feature extraction. Firstly, by fuzzy C-means clustering, some samples are selected from the training sample set to constitute a sample subset. Then, the obtained samples subset is used to execute SKPCA for extracting basic features of the training samples. Finally, using the extracted basic features, the AWNN aerodynamic model is established. The experimental results show that, in 50 times repetitive modeling, the modeling ability of the method proposed is better than that of other six methods. It only needs about half the modeling time of KPCA-AWNN under a close prediction accuracy, and can easily determine the model parameters. This enables it to be effective and feasible to construct the aerodynamic modeling for flight vehicles.展开更多
The flow field of liquid phase (water) of agitated extraction columns is simulated with the help of computational fluid dynamics (CFD). Four kinds of Reynolds-averaged turbulence models, i.e. the standard k-ε model, ...The flow field of liquid phase (water) of agitated extraction columns is simulated with the help of computational fluid dynamics (CFD). Four kinds of Reynolds-averaged turbulence models, i.e. the standard k-ε model, the RNG (renormalization group) k-s model, the realizable k-ε model and the Reynolds stress model, are compared in detail in order to judge which is the best model in terms of the accuracy, less CPU time and memory required. The performance of the realizable k-s model is obviously improved by reducing the model constant from C2 = 1.90 to C2 = 1.61. It is concluded that the improved realizable k-e model is the optimal model.展开更多
文摘Since Google introduced the concept of Knowledge Graphs(KGs)in 2012,their construction technologies have evolved into a comprehensive methodological framework encompassing knowledge acquisition,extraction,representation,modeling,fusion,computation,and storage.Within this framework,knowledge extraction,as the core component,directly determines KG quality.In military domains,traditional manual curation models face efficiency constraints due to data fragmentation,complex knowledge architectures,and confidentiality protocols.Meanwhile,crowdsourced ontology construction approaches from general domains prove non-transferable,while human-crafted ontologies struggle with generalization deficiencies.To address these challenges,this study proposes an OntologyAware LLM Methodology for Military Domain Knowledge Extraction(LLM-KE).This approach leverages the deep semantic comprehension capabilities of Large Language Models(LLMs)to simulate human experts’cognitive processes in crowdsourced ontology construction,enabling automated extraction of military textual knowledge.It concurrently enhances knowledge processing efficiency and improves KG completeness.Empirical analysis demonstrates that this method effectively resolves scalability and dynamic adaptation challenges in military KG construction,establishing a novel technological pathway for advancing military intelligence development.
基金supported by the National Natural Science Foundation of China(Nos.61906168,62202429 and 62272267)the Zhejiang Provincial Natural Science Foundation of China(No.LY23F020023)the Construction of Hubei Provincial Key Laboratory for Intelligent Visual Monitoring of Hydropower Projects(No.2022SDSJ01)。
文摘Accurately identifying building distribution from remote sensing images with complex background information is challenging.The emergence of diffusion models has prompted the innovative idea of employing the reverse denoising process to distill building distribution from these complex backgrounds.Building on this concept,we propose a novel framework,building extraction diffusion model(BEDiff),which meticulously refines the extraction of building footprints from remote sensing images in a stepwise fashion.Our approach begins with the design of booster guidance,a mechanism that extracts structural and semantic features from remote sensing images to serve as priors,thereby providing targeted guidance for the diffusion process.Additionally,we introduce a cross-feature fusion module(CFM)that bridges the semantic gap between different types of features,facilitating the integration of the attributes extracted by booster guidance into the diffusion process more effectively.Our proposed BEDiff marks the first application of diffusion models to the task of building extraction.Empirical evidence from extensive experiments on the Beijing building dataset demonstrates the superior performance of BEDiff,affirming its effectiveness and potential for enhancing the accuracy of building extraction in complex urban landscapes.
文摘Processing police incident data in public security involves complex natural language processing(NLP)tasks,including information extraction.This data contains extensive entity information—such as people,locations,and events—while also involving reasoning tasks like personnel classification,relationship judgment,and implicit inference.Moreover,utilizing models for extracting information from police incident data poses a significant challenge—data scarcity,which limits the effectiveness of traditional rule-based and machine-learning methods.To address these,we propose TIPS.In collaboration with public security experts,we used de-identified police incident data to create templates that enable large language models(LLMs)to populate data slots and generate simulated data,enhancing data density and diversity.We then designed schemas to efficiently manage complex extraction and reasoning tasks,constructing a high-quality dataset and fine-tuning multiple open-source LLMs.Experiments showed that the fine-tuned ChatGLM-4-9B model achieved an F1 score of 87.14%,nearly 30%higher than the base model,significantly reducing error rates.Manual corrections further improved performance by 9.39%.This study demonstrates that combining largescale pre-trained models with limited high-quality domain-specific data can greatly enhance information extraction in low-resource environments,offering a new approach for intelligent public security applications.
基金funded by Institutional Fund Projects under grant no.(IFPDP-261-22)。
文摘Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support vector machine(SVM),as well as ensemble methods,such as Gradient Boosting and eXtreme gradient boosting(XGBoost),are often plagued by high computational costs,which makes it challenging for them to perform real-time detection.In this regard,we suggested an attack detection approach that integrates Visual Geometry Group 16(VGG16),Artificial Rabbits Optimizer(ARO),and Random Forest Model to increase detection accuracy and operational efficiency in Internet of Things(IoT)networks.In the suggested model,the extraction of features from malware pictures was accomplished with the help of VGG16.The prediction process is carried out by the random forest model using the extracted features from the VGG16.Additionally,ARO is used to improve the hyper-parameters of the random forest model of the random forest.With an accuracy of 96.36%,the suggested model outperforms the standard models in terms of accuracy,F1-score,precision,and recall.The comparative research highlights our strategy’s success,which improves performance while maintaining a lower computational cost.This method is ideal for real-time applications,but it is effective.
基金funded by the Chinese State Grid Jiangsu Electric Power Co.,Ltd.Science and Technology Project Funding,Grant Number J2023031.
文摘Accurate vector extraction from design drawings is required first to automatically create 3D models from pixel-level engineering design drawings. However, this task faces the challenges of complicated design shapes as well as cumbersome and cluttered annotations on drawings, which interfere with the vector extraction heavily. In this article, the transmission tower containing the most complex structure is taken as the research object, and a semantic segmentation network is constructed to first segment the shape masks from the pixel-level drawings. Preprocessing and postprocessing are also proposed to ensure the stability and accuracy of the shape mask segmentation. Then, based on the obtained shape masks, a vector extraction network guided by heatmaps is designed to extract structural vectors by fusing the features from node heatmap and skeleton heatmap, respectively. Compared with the state-of-the-art methods, experiment results illustrate that the proposed semantic segmentation method can effectively eliminate the interference of many elements on drawings to segment the shape masks effectively, meanwhile, the model trained by the proposed vector extraction network can accurately extract the vectors such as nodes and line connections, avoiding redundant vector detection. The proposed method lays a solid foundation for automatic 3D model reconstruction and contributes to technological advancements in relevant fields.
文摘Background:Acquiring relevant information about procurement targets is fundamental to procuring medical devices.Although traditional Natural Language Processing(NLP)and Machine Learning(ML)methods have improved information retrieval efficiency to a certain extent,they exhibit significant limitations in adaptability and accuracy when dealing with procurement documents characterized by diverse formats and a high degree of unstructured content.The emergence of Large Language Models(LLMs)offers new possibilities for efficient procurement information processing and extraction.Methods:This study collected procurement transaction documents from public procurement websites,and proposed a procurement Information Extraction(IE)method based on LLMs.Unlike traditional approaches,this study systematically explores the applicability of LLMs in both structured and unstructured entities in procurement documents,addressing the challenges posed by format variability and content complexity.Furthermore,an optimized prompt framework tailored for procurement document extraction tasks is developed to enhance the accuracy and robustness of IE.The aim is to process and extract key information from medical device procurement quickly and accurately,meeting stakeholders'demands for precision and timeliness in information retrieval.Results:Experimental results demonstrate that,compared to traditional methods,the proposed approach achieves an F1 Score of 0.9698,representing a 4.85%improvement over the best baseline model.Moreover,both recall and precision rates are close to 97%,significantly outperforming other models and exhibiting exceptional overall recognition capabilities.Notably,further analysis reveals that the proposed method consistently maintains high performance across both structured and unstructured entities in procurement documents while balancing recall and precision effectively,demonstrating its adaptability in handling varying document formats.The results of ablation experiments validate the effectiveness of the proposed prompting strategy.Conclusion:Additionally,this study explores the challenges and potential improvements of the proposed method in IE tasks and provides insights into its feasibility for real-world deployment and application directions,further clarifying its adaptability and value.This method not only exhibits significant advantages in medical device procurement but also holds promise for providing new approaches to information processing and decision support in various domains.
基金This work was supported by Knowledge Innovation Pro-gram Chinese Academy of Sciences (No. KZCX2-SW-320-3 & KZCX3-SW-425).
文摘Boundary extraction of watershed is an important step in forest landscape research. The boundary of the upriver wa-tershed of the Hunhe River in the sub-alpine Qingyuan County of eastern Liaoning Province, China was extracted by digital elevation modeling (DEM) data in ArcInfo8.1. Remote sensing image of the corresponding region was applied to help modify its copy according to Enhanced Thematic Mapper (ETM) image抯 profuse geomorphological structure information. Both the DEM-dependent boundary and modified copy were overlapped with county map and drainage network map to visually check the effects of result. Overlap of county map suggested a nice extraction of the boundary line since the two layers matched precisely, which indicated the DEM-dependent boundary by program was effective and precise. Further upload of drainage network showed discrepancies between the boundary and the drainage network. Altogether, there were three sections of the extraction result that needed to correct. Compared with this extraction boundary, the modified boundary had a better match to the drainage network as well as to the county map. Comprehensive analysis demonstrated that the program extraction has generally fine precision in position and excels the digitized result by hand. The errors of the DEM-dependant extraction are due to the fact that it is difficult for program to recognize sections of complex landform especially altered by human activities, but these errors are discernable and adjustable because the spatial resolution of ETM image is less than that of DEM. This study result proved that application of remote sensing information could help obtain better result when DEM method is used in extraction of watershed boundary.
文摘A novel parameter extraction method with rational functions is presented for the 2-πequivalent circuit model of RF CMOS spiral inductors. The final S-parameters simulated by the circuit model closely match experimental data. The extraction strategy is straightforward and can be easily implemented as a CAD tool to model spiral inductors. The resulting circuit models will be very useful for RF circuit designers.
基金supported by National Natural Science Foundation of China (Grant No. 50805021)
文摘It is well known that the human auditory system possesses remarkable capabilities to analyze and identify signals. Therefore, it would be significant to build an auditory model based on the mechanism of human auditory systems, which may improve the effects of mechanical signal analysis and enrich the methods of mechanical faults features extraction. However the existing methods are all based on explicit senses of mathematics or physics, and have some shortages on distinguishing different faults, stability, and suppressing the disturbance noise, etc. For the purpose of improving the performances of the work of feature extraction, an auditory model, early auditory(EA) model, is introduced for the first time. This auditory model transforms time domain signal into auditory spectrum via bandpass filtering, nonlinear compressing, and lateral inhibiting by simulating the principle of the human auditory system. The EA model is developed with the Gammatone filterbank as the basilar membrane. According to the characteristics of vibration signals, a method is proposed for determining the parameter of inner hair cells model of EA model. The performance of EA model is evaluated through experiments on four rotor faults, including misalignment, rotor-to-stator rubbing, oil film whirl, and pedestal looseness. The results show that the auditory spectrum, output of EA model, can effectively distinguish different faults with satisfactory stability and has the ability to suppress the disturbance noise. Then, it is feasible to apply auditory model, as a new method, to the feature extraction for mechanical faults diagnosis with effect.
文摘This paper presents an accurate small-signal model for multi-gate GaAs pHEMTs in switching-mode.The extraction method for the proposed model is developed.A 2-gate switch structure is fabricated on a commercial 0.5μm AlGaAs/GaAs pHEMT technology to verify the proposed model.Excellent agreement has been obtained between the measured and simulated results over a wide frequency range.
基金Supported by the National Natural Science Foundation of China(Nos.20176049 and 20576113)
文摘Ultrasonically assisted extraction of isoflavones from the stem of Pueraria lobata (Willd.) Ohwi has been carried out with an ultrasonic extracting apparatus (20kHz, electrical power input to the transducer in 0-650W). The influence of the electrical power input and extraction time on the'extraction yield is investigated in water, n-butanol, and 95% (by volume) and 50% (by volume) ethanol aqueous solution. The experimental results indicate that the yields of total isoflavones are higher in ultrasonically assisted extraction than those obtained from con-ventional extraction.Moreover,a mathematical model is proposed,by introducing the electrical power input to index the ultrsound intensity,to describe the behavior of ultrasonically assisted extraction.It is found that the model calcuations are in good agreement with the experimental data.
基金Under the auspices of National Youth Science Foundation of China(No.41001294)Key Project of National Natural Science Foundation of China(No.40930531)Research Fund of State Key Laboratory Resources and Environment Information System(No.2010KF0002SA)
文摘In China′s Loess Plateau area, gully head is the most active zone of a drainage system in gully areas. The differentiation of loess gully head follows geospatial patterns and reflects the process of the loess landform development and evolution of its drainage system to some extent. In this study, the geomorphic meaning, basic characteristics, morphological structure and the basic types of loess gully heads were systematically analysed. Then, the loess gully head′s conceptual model was established, and an extraction method based on Digital Elevation Model(DEM) for loess gully head features and elements was proposed. Through analysing the achieved statistics of loess gully head features, loess gully heads have apparently similar and different characteristics depending on the different loess landforms where they are found. The loess head characteristics reflect their growth period and evolution tendency to a certain degree, and they indirectly represent evolutionary mechanisms. In addition, the loess gully developmental stages and the evolutionary processes can be deduced by using loess gully head characteristics. This study is of great significance for development and improvement of the theoretical system for describing loess gully landforms.
文摘The supercritical carbon dioxide extraction was applied to obtain essential oil from Pogostemon cablin in this work.Effect of extraction parameters including temperature,pressure,extraction time and particle size on extraction yield was investigated,and the response surface methodology with a Box–Behnken Design was used to achieve the optimized extraction conditions.The maximum yield of essential oil was 2.4356%under the conditions of extraction temperature 47°C,pressure 24.5 MPa and extraction time 119 min.Moreover,based on the Brunauer–Emmett–Teller theory of adsorption,a mathematical modeling was performed to correlate the measured data.The model shows a function relationship between extraction yield and time by a simple equation with three significantly adjustable parameters.These model parameters have been optimized through simulated annealing algorithm.The predicted data from the mathematical model show a good agreement with the experimental data of the different extraction parameters.
文摘A sub circuit model for VDMOS is built according to its physical structure.Parameters and formulas describing the device are also derived from this model.Comparing to former results,this model avoids too many technical parameters and simplify the sub circuit efficiently.As a result of numeric computation,this simple model with clear physical conception demonstrates excellent agreements between measured and modeled response (DC error within 5%,AC error within 10%).Such a model is now available for circuit simulation and parameter extraction.
文摘In consideration of the online measurement of the component content in rare earth countercurrent extraction separation process, the soft sensor method based on hybrid modeling was proposed to measure the rare earth component content. The hybrid models were composed of the extraction equilibrium calculation model and the Radial Basis Function (RBF) Neural Network (NN) error compensation model; the parameters of compensation model were optimized by the hierarchical genetic algorithms (HGA). In addition, application experiment research of this proposed method was carried out in the rare earth separation production process of a corporation. The result shows that this method is effective and can realize online measurement for the component content of rare earth in the countercurrent extraction.
基金financially supported by the National Key R&D Program of China (No.2022YFF0711601)the Natural Science Foundation of Hubei Province of China (No.2022CFB640)+2 种基金the Open Fund of Key Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Natural Resources (No.KF-2022-07-014)the Opening Fund of Hubei Key Laboratory of Intelligent Vision-Based Monitoring for Hydroelectric Engineering (No.2022SDSJ04)the Beijing Key Laboratory of Urban Spatial Information Engineering (No.20220108)。
文摘Geological knowledge can provide support for knowledge discovery, knowledge inference and mineralization predictions of geological big data. Entity identification and relationship extraction from geological data description text are the key links for constructing knowledge graphs. Given the lack of publicly annotated datasets in the geology domain, this paper illustrates the construction process of geological entity datasets, defines the types of entities and interconceptual relationships by using the geological entity concept system, and completes the construction of the geological corpus. To address the shortcomings of existing language models(such as Word2vec and Glove) that cannot solve polysemous words and have a poor ability to fuse contexts, we propose a geological named entity recognition and relationship extraction model jointly with Bidirectional Encoder Representation from Transformers(BERT) pretrained language model. To effectively represent the text features, we construct a BERT-bidirectional gated recurrent unit network(BiGRU)-conditional random field(CRF)-based architecture to extract the named entities and the BERT-BiGRU-Attention-based architecture to extract the entity relations. The results show that the F1-score of the BERT-BiGRU-CRF named entity recognition model is 0.91 and the F1-score of the BERT-BiGRU-Attention relationship extraction model is 0.84, which are significant performance improvements when compared to classic language models(e.g., word2vec and Embedding from Language Models(ELMo)).
基金supported by the National Key Basic Research and Development Programme of China(No.2004CB619200)the National Science Foundation for Distinguished Young Scholars of China(No.50325415)the National Natural Science Foundation of China(No.50321402).
文摘A model GM (grey model) (1,1) for forecasting the rate of copper extraction during the bioleaching of primary sulphide ore was established on the basis of the mathematical theory and the modeling process of grey system theory. It was used for forecasting the rate of copper extraction from the primary sulfide ore during a laboratory microbial column leaching experiment. The precision of the forecasted results were examined and modified via "posterior variance examination". The results show that the forecasted values coincide with the experimental values. GM (1,1) model has high forecast accuracy; and it is suitable for simulation control and prediction analysis of the original data series of the processes that have grey characteristics, such as mining, metallurgical and mineral processing, etc. The leaching rate of such copper sulphide ore is low. The grey forecasting result indicates that the rate of copper extraction is approximately 20% even after leaching for six months.
基金supported by a grant from the Peking University Clinical Medicine Youth Special Fund(PKU20222LCXQ042).
文摘Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing micro-TESE.This retrospective study analyzed the clinical records of 200 nonobstructive azoospermia patients between January 2021 and December 2021.The backward method was used to perform binary logistic regression analysis and identify factors that predicted a successful micro-TESE sperm retrieval.The prediction model was constructed using acquired regression coefficients,and its predictive performance was assessed using the receiver operating characteristic curve.In all,67 patients(sperm retrieval rate:33.5%)underwent successful micro-TESE.Follicle-stimulating hormone,anti-Miillerian hormone,and inhibin B levels varied significantly between patients who underwent successful and unsuccessful micro-TESE.Binary logistic regression analysis yielded the following six predictors:anti-Mullerian hormone(odds ratio[OR]=0.902,95%confidence interval[Cl]:0.821-0.990),inhibin B(OR=1.012,95%Cl:1.001-1.024),Klinefelter’s syndrome(OR=0.022,95%Cl:0.002-0.243),Y chromosome microdeletion(OR=0.050,95%Cl:0.005-0.504),cryptorchidism with orchiopexy(OR=0.085,95%Cl:0.008-0.929),and idiopathic nonobstructive azoospermia(OR=0.031,95%Cl:0.003-0.277).The prediction model had an area under the curve of 0.720(95%Cl:0.645-0.794),sensitivity of 65.7%,specificity of 72.2%,Youden index of 0.379,and cut-off value of 0.305 overall,indicating good predictive value and accuracy.This model can assist clinicians and nonobstructive azoospermia patients in decision-making and avoiding negative micro-TESE results.
基金Project(51209167) supported by Youth Project of the National Natural Science Foundation of ChinaProject(2012JM8026) supported by Shaanxi Provincial Natural Science Foundation, China
文摘In order to accurately describe the dynamic characteristics of flight vehicles through aerodynamic modeling, an adaptive wavelet neural network (AWNN) aerodynamic modeling method is proposed, based on subset kernel principal components analysis (SKPCA) feature extraction. Firstly, by fuzzy C-means clustering, some samples are selected from the training sample set to constitute a sample subset. Then, the obtained samples subset is used to execute SKPCA for extracting basic features of the training samples. Finally, using the extracted basic features, the AWNN aerodynamic model is established. The experimental results show that, in 50 times repetitive modeling, the modeling ability of the method proposed is better than that of other six methods. It only needs about half the modeling time of KPCA-AWNN under a close prediction accuracy, and can easily determine the model parameters. This enables it to be effective and feasible to construct the aerodynamic modeling for flight vehicles.
文摘The flow field of liquid phase (water) of agitated extraction columns is simulated with the help of computational fluid dynamics (CFD). Four kinds of Reynolds-averaged turbulence models, i.e. the standard k-ε model, the RNG (renormalization group) k-s model, the realizable k-ε model and the Reynolds stress model, are compared in detail in order to judge which is the best model in terms of the accuracy, less CPU time and memory required. The performance of the realizable k-s model is obviously improved by reducing the model constant from C2 = 1.90 to C2 = 1.61. It is concluded that the improved realizable k-e model is the optimal model.