Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep lear...Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep learning retrieval method for process reuse was proposed,which integrates process design features into the retrieval of casting 3D models.This method leverages the comparative language-image pretraining(CLIP)model to extract shape features from the three views and sectional views of the casting model and combines them with process design features such as modulus,main wall thickness,symmetry,and length-to-height ratio to enhance process reusability.A database of 230 production casting models was established for model validation.Results indicate that incorporating process design features improves model accuracy by 6.09%,reaching 97.82%,and increases process similarity by 30.25%.The reusability of the process was further verified using the casting simulation software EasyCast.The results show that the process retrieved after integrating process design features produces the least shrinkage in the target model,demonstrating this method’s superior ability for process reuse.This approach does not require a large dataset for training and optimization,making it highly applicable to casting process design and related manufacturing processes.展开更多
Recent advancements in large language models(LLMs)have driven remarkable progress in text process-ing,opening new avenues for medical knowledge discovery.In this study,we present ERQA,a mEdical knowledge Retrieval and...Recent advancements in large language models(LLMs)have driven remarkable progress in text process-ing,opening new avenues for medical knowledge discovery.In this study,we present ERQA,a mEdical knowledge Retrieval and Question-Answering framework powered by an enhanced LLM that integrates a semantic vector database and a curated literature repository.The ERQA framework leverages domain-specific incremental pretraining and conducts supervised fine-tuning on medical literature,enabling retrieval and question-answering(QA)tasks to be completed with high precision.Performance evaluations implemented on the coronavirus disease 2019(COVID-19)and TripClick data-sets demonstrate the robust capabilities of ERQA across multiple tasks.On the COVID-19 dataset,ERQA-13B achieves state-of-the-art retrieval metrics,with normalized discounted cumulative gain at top 10(NDCG@10)0.297,recall values at top 10(Recall@10)0.347,and mean reciprocal rank(MRR)=0.370;it also attains strong abstract summarization performance,with a recall-oriented understudy for gisting evaluation(ROUGE)-1 score of 0.434,and QA performance,with a bilingual evaluation understudy(BLEU)-1 score of 7.851.The comparable performance achieved on the TripClick dataset further under-scores the adaptability of ERQA across diverse medical topics.These findings suggest that ERQA repre-sents a significant step toward efficient biomedical knowledge retrieval and QA.展开更多
With increasing the number of wind power generators,the consumption time of electromagnetic simulation of the wind farm explodes.To reduce the simulation time while meeting the accuracy requirement,a genetic clusterin...With increasing the number of wind power generators,the consumption time of electromagnetic simulation of the wind farm explodes.To reduce the simulation time while meeting the accuracy requirement,a genetic clustering-based equivalent model is proposed for the wind farm with numerous doubly fed induction generators.In the proposed model,active power together with the reactive power and the wind speed are selected to form the set of clustering indicators.A normalization technique is utilized to cope with the multiple orders of magnitude in these factors.An exponential fitness value is formulated as a function of the sorting number of the primary fitness value,and the fitness-based selection probability is constructed to overcome the property of premature and slow convergence of the genetic clustering algorithm.The sum of squares due to error is used to determine the optimal clustering number.In addition,a decoupled parameter equivalence method is adopted to obtain the equivalent parameters of the collection network.Simulation results and comparisons with various methods under different voltage scenarios show the feasibility and effectiveness of the proposed model.展开更多
The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural net...The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural network is trained by a set of the measurements of active and passive remote sensing and the ground truth data versus Day of Year during growth. Once the network training is complete, the model can be used to retrieve the temporal variations of the biomass parameters from another set of observation data. The model was used in weights and microware observation data of wheat growth in 1989 to retrieve biomass parameters change of wheat growth this year. The retrieved biomass parameters correspond well with the real data of the growth, which shows that the BP model is scientific and sound.展开更多
Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low tim...Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low time efficiency. This paper presents two new methods with high efficiency to build a Content-based 3D model retrieval system. First, an improvement is made on the "Shape Distribution (D2)" algorithm, and a new algorithm named "Quick D2" is proposed. Four sample 3D mechanical models are used in an experiment to compare the time cost of the two algorithms. The result indicates that the time cost of Quick D2 is much lower than that of D2, while the descriptors extracted by the two algorithms are almost the same. Second, an expandable 3D model repository index method with high performance, namely, RBK index, is presented. On the basis of RBK index, the search space is pruned effectively during the search process, leading to a speed up of the whole system. The factors that influence the values of the key parameters of RBK index are discussed and an experimental method to find the optimal values of the key parameters is given. Finally, "3D Searcher", a content-based 3D model retrieval system is developed. By using the methods proposed, the time cost for the system to respond one query online is reduced by 75% on average. The system has been implemented in a manufacturing enterprise, and practical query examples during a case of the automobile rear axle design are also shown. The research method presented shows a new research perspective and can effectively improve the content-based 3D model retrieval efficiency.展开更多
An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential pha...An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential phase shift and correlation coefficients between the horizontal and vertical polarization signals at zero lag can be derived from rain, snow and hail contents of numerical model outputs. Effects of environmental temperature and the melting process on polarimetric variables are considered in the algorithm. The algorithm is applied to the Advanced Regional Prediction System (ARPS) model simulation results for a hail storm. The spatial distributions of the derived parameters are reasonable when compared with observational knowledge. This work provides a forward model for assimilation of dual linear polarization radar data into a mesoscale model.展开更多
A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of key...A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of keywords retrieval and concept retrieval but also can compensate for their shortcomings. Their parameters can be adjusted according to different usage in order to accept the best information retrieval result, and it has been proved by our experiments.展开更多
Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing m...Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing micro-TESE.This retrospective study analyzed the clinical records of 200 nonobstructive azoospermia patients between January 2021 and December 2021.The backward method was used to perform binary logistic regression analysis and identify factors that predicted a successful micro-TESE sperm retrieval.The prediction model was constructed using acquired regression coefficients,and its predictive performance was assessed using the receiver operating characteristic curve.In all,67 patients(sperm retrieval rate:33.5%)underwent successful micro-TESE.Follicle-stimulating hormone,anti-Miillerian hormone,and inhibin B levels varied significantly between patients who underwent successful and unsuccessful micro-TESE.Binary logistic regression analysis yielded the following six predictors:anti-Mullerian hormone(odds ratio[OR]=0.902,95%confidence interval[Cl]:0.821-0.990),inhibin B(OR=1.012,95%Cl:1.001-1.024),Klinefelter’s syndrome(OR=0.022,95%Cl:0.002-0.243),Y chromosome microdeletion(OR=0.050,95%Cl:0.005-0.504),cryptorchidism with orchiopexy(OR=0.085,95%Cl:0.008-0.929),and idiopathic nonobstructive azoospermia(OR=0.031,95%Cl:0.003-0.277).The prediction model had an area under the curve of 0.720(95%Cl:0.645-0.794),sensitivity of 65.7%,specificity of 72.2%,Youden index of 0.379,and cut-off value of 0.305 overall,indicating good predictive value and accuracy.This model can assist clinicians and nonobstructive azoospermia patients in decision-making and avoiding negative micro-TESE results.展开更多
In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects...In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.展开更多
A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several sha...A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several shape retrieval technologies have been developed, little attention has been given to the points on human's sense and impression (as known as Kansei) in the conventional techniques, In this paper, the authors propose a novel method of shape retrieval based on shape impression of human's Kansei. The key to the method is the Gaussian curvature distribution from 3D models as features for shape retrieval. Then it classifies the 3D models by extracted feature and measures similarity among models in storage.展开更多
In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature e...In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature extraction efficiency of 3D models. Based on the relationship between model and its projection,the intersection in 3D space is transformed into intersection in 2D space,which reduces the number of intersection and improves the efficiency of the extraction algorithm. In feature extraction,multi-layer spheres method is analyzed. The two-layer spheres method makes the feature vector more accurate and improves retrieval precision. Secondly,Semi-supervised Affinity Propagation ( S-AP) clustering is utilized because it can be applied to different cluster structures. The S-AP algorithm is adopted to find the center models and then the center model collection is built. During retrieval process,the collection is utilized to classify the query model into corresponding model base and then the most similar model is retrieved in the model base. Finally,75 sample models from Princeton library are selected to do the experiment and then 36 models are used for retrieval test. The results validate that the proposed method outperforms the original method and the retrieval precision and recall ratios are improved effectively.展开更多
The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays a...The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays an important role in an ocean wind vector retrieval. The performance of the existing Ku-band model function QSCAT-1 is considered to be effective at low and moderate wind speed ranges. However, in the conditions of higher wind speeds, the existing algorithms diverge alarmingly, owing to the lack of in situ data required for developing the GMF for the high wind conditions, the QSCAT-1 appears to overestimate the a0, which results in underestimating the wind speeds. Several match-up QuikSCAT and special sensor microwave/imager (SSM/I) wind speed measurements of the typhoons occurring in the west Pacific Ocean are analyzed. The results show that the SSM/I wind exhibits better agreement with the "best track" analysis wind speed than the QuikSCAT wind retrieved using QSCAT-1. On the basis of this evaluation, a correction of the QSCAT-1 model function for wind speed above 16 m/s is proposed, which uses the collocated SSM/I and QuikSCAT measurements as a training set, and a neural network approach as a multiple nonlinear regression technologytechnology.In order to validate the revised GMF for high winds, the modified GMF was applied to the QuikSCAT observations of Hurricane IOKE. The wind estimated by the QuikSCAT for Typhoon IOKE in 2006 was improved with the maximum wind speed reaching 55 m/s. An error analysis was performed using the wind fields from the Holland model as the surface truth. The results show an improved agreement with the Holland model wind when compared with the wind estimated using the QSCAT-1. However, large bias still existed, indicating that the effects of rain must be considered for further improvement.展开更多
CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. There...CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. Therefore, a functional semantic-based CAD model annotation and retrieval method is proposed to support mechanical conceptual design and design reuse, inspire designer creativity through existing CAD models, shorten design cycle, and reduce costs. Firstly, the CAD model functional semantic ontology is constructed to formally represent the functional semantics of CAD models and describe the mechanical conceptual design space comprehensively and consistently. Secondly, an approach to represent CAD models as attributed adjacency graphs(AAG) is proposed. In this method, the geometry and topology data are extracted from STEP models. On the basis of AAG, the functional semantics of CAD models are annotated semi-automatically by matching CAD models that contain the partial features of which functional semantics have been annotated manually, thereby constructing CAD Model Repository that supports model retrieval based on functional semantics. Thirdly, a CAD model retrieval algorithm that supports multi-function extended retrieval is proposed to explore more potential creative design knowledge in the semantic level. Finally, a prototype system, called Functional Semantic-based CAD Model Annotation and Retrieval System(FSMARS), is implemented. A case demonstrates that FSMARS can successfully botain multiple potential CAD models that conform to the desired function. The proposed research addresses actual needs and presents a new way to acquire CAD models in the mechanical conceptual design phase.展开更多
This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) a...This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences, which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.展开更多
To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capab...To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.展开更多
Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this p...Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.展开更多
Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirica...Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirical orthogonal function) temporal-spatial separation technique, the disassembled EOF time coefficients series were regarded as dynamical model variables, and dynamic system retrieval idea as well as genetic algorithm were introduced to make dynamical model parameters optimization search, then, a reasonable non-linear dynamic model of EOF time-coefficients was established. By dynamic model integral and EOF temporal-spatial components assembly, a mid-/long-term forecast of subtropical high was carried out. The experimental results show that the forecast results of dynamic model are superior to that of general numerical model forecast results. A new modeling idea and forecast technique is presented for diagnosing and forecasting such complicated weathers as subtropical high.展开更多
In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilizati...In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.展开更多
In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by u...In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by using each individual descriptor. Second, we propose an automatic relevant/irrelevant models selection (ARMS) approach to selecting the relevant and irrelevant 3D models automatically without any user interaction. A weighted distance, in which the weight associated with each individual descriptor is learnt by using the selected relevant and irrelevant models, is used to measure the similarity between two 3D models. Furthermore, a descriptor-dependent adaptive query point movement (AQPM) approach is employed to update every feature vector. This set of new feature vectors is used to index 3D models in the next search process. Four 3D model databases are used to compare the retrieval accuracy of our proposed DMDF approach with several descriptors as well as some well-known information fusion methods. Experimental results have shown that our proposed DMDF approach provides a promising retrieval result and always yields the best retrieval accuracy.展开更多
A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies...A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies of estimated document language models with respect to the estimated query language model. Two popular and relatively efficient smoothing methods, the Jelinek- Mercer method and the absolute discounting method, are used to smooth the document language model in estimation of the document language, A combined model composed of the feedback document language model and the collection language model is used to estimate the query model. A performacne comparison between the new retrieval method and the existing method with feedback is made, and the retrieval performances of the proposed method with the two different smoothing techniques are evaluated on three Text Retrieval Conference (TREC) data sets. Experimental results show that the method is effective and performs better than the basic language modeling approach; moreover, the method using the Jelinek-Mercer technique performs better than that using the absolute discounting technique, and the perfomance is sensitive to the smoothing peramters.展开更多
基金supported by the National Natural Science Foundation of China(Nos.52074246,52275390,52375394)the National Defense Basic Scientific Research Program of China(No.JCKY2020408B002)the Key R&D Program of Shanxi Province(No.202102050201011).
文摘Accurate retrieval of casting 3D models is crucial for process reuse.Current methods primarily focus on shape similarity,neglecting process design features,which compromises reusability.In this study,a novel deep learning retrieval method for process reuse was proposed,which integrates process design features into the retrieval of casting 3D models.This method leverages the comparative language-image pretraining(CLIP)model to extract shape features from the three views and sectional views of the casting model and combines them with process design features such as modulus,main wall thickness,symmetry,and length-to-height ratio to enhance process reusability.A database of 230 production casting models was established for model validation.Results indicate that incorporating process design features improves model accuracy by 6.09%,reaching 97.82%,and increases process similarity by 30.25%.The reusability of the process was further verified using the casting simulation software EasyCast.The results show that the process retrieved after integrating process design features produces the least shrinkage in the target model,demonstrating this method’s superior ability for process reuse.This approach does not require a large dataset for training and optimization,making it highly applicable to casting process design and related manufacturing processes.
基金supported by the Innovation Fund for Medical Sciences of the Chinese Academy of Medical Sciences(2021-I2M-1-033)the National Key Research and Development Program of China(2022YFF0711900).
文摘Recent advancements in large language models(LLMs)have driven remarkable progress in text process-ing,opening new avenues for medical knowledge discovery.In this study,we present ERQA,a mEdical knowledge Retrieval and Question-Answering framework powered by an enhanced LLM that integrates a semantic vector database and a curated literature repository.The ERQA framework leverages domain-specific incremental pretraining and conducts supervised fine-tuning on medical literature,enabling retrieval and question-answering(QA)tasks to be completed with high precision.Performance evaluations implemented on the coronavirus disease 2019(COVID-19)and TripClick data-sets demonstrate the robust capabilities of ERQA across multiple tasks.On the COVID-19 dataset,ERQA-13B achieves state-of-the-art retrieval metrics,with normalized discounted cumulative gain at top 10(NDCG@10)0.297,recall values at top 10(Recall@10)0.347,and mean reciprocal rank(MRR)=0.370;it also attains strong abstract summarization performance,with a recall-oriented understudy for gisting evaluation(ROUGE)-1 score of 0.434,and QA performance,with a bilingual evaluation understudy(BLEU)-1 score of 7.851.The comparable performance achieved on the TripClick dataset further under-scores the adaptability of ERQA across diverse medical topics.These findings suggest that ERQA repre-sents a significant step toward efficient biomedical knowledge retrieval and QA.
基金the National Key R&D Program of China(No.2019YFE0114700)the Key R&D Program in Hunan Province of China(No.2021GK2020)+1 种基金the Natural Science Foundation of Hunan Province of China(No.2021JJ30079)the Project of Philosophy and Social Science Research in Yiyang City(No.2022YS191)。
文摘With increasing the number of wind power generators,the consumption time of electromagnetic simulation of the wind farm explodes.To reduce the simulation time while meeting the accuracy requirement,a genetic clustering-based equivalent model is proposed for the wind farm with numerous doubly fed induction generators.In the proposed model,active power together with the reactive power and the wind speed are selected to form the set of clustering indicators.A normalization technique is utilized to cope with the multiple orders of magnitude in these factors.An exponential fitness value is formulated as a function of the sorting number of the primary fitness value,and the fitness-based selection probability is constructed to overcome the property of premature and slow convergence of the genetic clustering algorithm.The sum of squares due to error is used to determine the optimal clustering number.In addition,a decoupled parameter equivalence method is adopted to obtain the equivalent parameters of the collection network.Simulation results and comparisons with various methods under different voltage scenarios show the feasibility and effectiveness of the proposed model.
文摘The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural network is trained by a set of the measurements of active and passive remote sensing and the ground truth data versus Day of Year during growth. Once the network training is complete, the model can be used to retrieve the temporal variations of the biomass parameters from another set of observation data. The model was used in weights and microware observation data of wheat growth in 1989 to retrieve biomass parameters change of wheat growth this year. The retrieved biomass parameters correspond well with the real data of the growth, which shows that the BP model is scientific and sound.
基金supported by National Natural Science Foundation of China(Grant No. 51175287)National Science and Technology Major Project(Grant No. 2011ZX02403)
文摘Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low time efficiency. This paper presents two new methods with high efficiency to build a Content-based 3D model retrieval system. First, an improvement is made on the "Shape Distribution (D2)" algorithm, and a new algorithm named "Quick D2" is proposed. Four sample 3D mechanical models are used in an experiment to compare the time cost of the two algorithms. The result indicates that the time cost of Quick D2 is much lower than that of D2, while the descriptors extracted by the two algorithms are almost the same. Second, an expandable 3D model repository index method with high performance, namely, RBK index, is presented. On the basis of RBK index, the search space is pruned effectively during the search process, leading to a speed up of the whole system. The factors that influence the values of the key parameters of RBK index are discussed and an experimental method to find the optimal values of the key parameters is given. Finally, "3D Searcher", a content-based 3D model retrieval system is developed. By using the methods proposed, the time cost for the system to respond one query online is reduced by 75% on average. The system has been implemented in a manufacturing enterprise, and practical query examples during a case of the automobile rear axle design are also shown. The research method presented shows a new research perspective and can effectively improve the content-based 3D model retrieval efficiency.
文摘An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential phase shift and correlation coefficients between the horizontal and vertical polarization signals at zero lag can be derived from rain, snow and hail contents of numerical model outputs. Effects of environmental temperature and the melting process on polarimetric variables are considered in the algorithm. The algorithm is applied to the Advanced Regional Prediction System (ARPS) model simulation results for a hail storm. The spatial distributions of the derived parameters are reasonable when compared with observational knowledge. This work provides a forward model for assimilation of dual linear polarization radar data into a mesoscale model.
文摘A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of keywords retrieval and concept retrieval but also can compensate for their shortcomings. Their parameters can be adjusted according to different usage in order to accept the best information retrieval result, and it has been proved by our experiments.
基金supported by a grant from the Peking University Clinical Medicine Youth Special Fund(PKU20222LCXQ042).
文摘Microdissection testicular sperm extraction(micro-TESE)is widely used to treat nonobstructive azoospermia.However,a good prediction model is required to anticipate a successful sperm retrieval rate before performing micro-TESE.This retrospective study analyzed the clinical records of 200 nonobstructive azoospermia patients between January 2021 and December 2021.The backward method was used to perform binary logistic regression analysis and identify factors that predicted a successful micro-TESE sperm retrieval.The prediction model was constructed using acquired regression coefficients,and its predictive performance was assessed using the receiver operating characteristic curve.In all,67 patients(sperm retrieval rate:33.5%)underwent successful micro-TESE.Follicle-stimulating hormone,anti-Miillerian hormone,and inhibin B levels varied significantly between patients who underwent successful and unsuccessful micro-TESE.Binary logistic regression analysis yielded the following six predictors:anti-Mullerian hormone(odds ratio[OR]=0.902,95%confidence interval[Cl]:0.821-0.990),inhibin B(OR=1.012,95%Cl:1.001-1.024),Klinefelter’s syndrome(OR=0.022,95%Cl:0.002-0.243),Y chromosome microdeletion(OR=0.050,95%Cl:0.005-0.504),cryptorchidism with orchiopexy(OR=0.085,95%Cl:0.008-0.929),and idiopathic nonobstructive azoospermia(OR=0.031,95%Cl:0.003-0.277).The prediction model had an area under the curve of 0.720(95%Cl:0.645-0.794),sensitivity of 65.7%,specificity of 72.2%,Youden index of 0.379,and cut-off value of 0.305 overall,indicating good predictive value and accuracy.This model can assist clinicians and nonobstructive azoospermia patients in decision-making and avoiding negative micro-TESE results.
基金the National Basic Research Program (973) of China (No. 2004CB719401)the National Research Foundation for the Doctoral Program of Higher Education of China (No.20060003060)
文摘In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.
文摘A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several shape retrieval technologies have been developed, little attention has been given to the points on human's sense and impression (as known as Kansei) in the conventional techniques, In this paper, the authors propose a novel method of shape retrieval based on shape impression of human's Kansei. The key to the method is the Gaussian curvature distribution from 3D models as features for shape retrieval. Then it classifies the 3D models by extracted feature and measures similarity among models in storage.
基金Sponsored by the National Natural Science Foundation of China (Grant No. 51075083)
文摘In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature extraction efficiency of 3D models. Based on the relationship between model and its projection,the intersection in 3D space is transformed into intersection in 2D space,which reduces the number of intersection and improves the efficiency of the extraction algorithm. In feature extraction,multi-layer spheres method is analyzed. The two-layer spheres method makes the feature vector more accurate and improves retrieval precision. Secondly,Semi-supervised Affinity Propagation ( S-AP) clustering is utilized because it can be applied to different cluster structures. The S-AP algorithm is adopted to find the center models and then the center model collection is built. During retrieval process,the collection is utilized to classify the query model into corresponding model base and then the most similar model is retrieved in the model base. Finally,75 sample models from Princeton library are selected to do the experiment and then 36 models are used for retrieval test. The results validate that the proposed method outperforms the original method and the retrieval precision and recall ratios are improved effectively.
基金The National Natural Science Foundation of China under contract No.41106152the National Science and Technology Support Program under contract No.2013BAD13B01+3 种基金the National High Technology Research and Development Program(863 Program)of China under contract No.2013AA09A505the International Science and Technology Cooperation Program of China under contract No.2011DFA22260the National High Technology Industrialization Project under contract No.[2012]2083the Marine Public Projects of China under contract Nos 201105032,201305032 and 201105002-07
文摘The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays an important role in an ocean wind vector retrieval. The performance of the existing Ku-band model function QSCAT-1 is considered to be effective at low and moderate wind speed ranges. However, in the conditions of higher wind speeds, the existing algorithms diverge alarmingly, owing to the lack of in situ data required for developing the GMF for the high wind conditions, the QSCAT-1 appears to overestimate the a0, which results in underestimating the wind speeds. Several match-up QuikSCAT and special sensor microwave/imager (SSM/I) wind speed measurements of the typhoons occurring in the west Pacific Ocean are analyzed. The results show that the SSM/I wind exhibits better agreement with the "best track" analysis wind speed than the QuikSCAT wind retrieved using QSCAT-1. On the basis of this evaluation, a correction of the QSCAT-1 model function for wind speed above 16 m/s is proposed, which uses the collocated SSM/I and QuikSCAT measurements as a training set, and a neural network approach as a multiple nonlinear regression technologytechnology.In order to validate the revised GMF for high winds, the modified GMF was applied to the QuikSCAT observations of Hurricane IOKE. The wind estimated by the QuikSCAT for Typhoon IOKE in 2006 was improved with the maximum wind speed reaching 55 m/s. An error analysis was performed using the wind fields from the Holland model as the surface truth. The results show an improved agreement with the Holland model wind when compared with the wind estimated using the QSCAT-1. However, large bias still existed, indicating that the effects of rain must be considered for further improvement.
基金Supported by National Natural Science Foundation of China (Grant No.51175287)National Science and Technology Major Project of China (Grant No.2011ZX02403)
文摘CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. Therefore, a functional semantic-based CAD model annotation and retrieval method is proposed to support mechanical conceptual design and design reuse, inspire designer creativity through existing CAD models, shorten design cycle, and reduce costs. Firstly, the CAD model functional semantic ontology is constructed to formally represent the functional semantics of CAD models and describe the mechanical conceptual design space comprehensively and consistently. Secondly, an approach to represent CAD models as attributed adjacency graphs(AAG) is proposed. In this method, the geometry and topology data are extracted from STEP models. On the basis of AAG, the functional semantics of CAD models are annotated semi-automatically by matching CAD models that contain the partial features of which functional semantics have been annotated manually, thereby constructing CAD Model Repository that supports model retrieval based on functional semantics. Thirdly, a CAD model retrieval algorithm that supports multi-function extended retrieval is proposed to explore more potential creative design knowledge in the semantic level. Finally, a prototype system, called Functional Semantic-based CAD Model Annotation and Retrieval System(FSMARS), is implemented. A case demonstrates that FSMARS can successfully botain multiple potential CAD models that conform to the desired function. The proposed research addresses actual needs and presents a new way to acquire CAD models in the mechanical conceptual design phase.
基金Project (No. 2006CB303000) supported in part by the National Basic Research Program (973) of China
文摘This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences, which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.
基金The Basic Research of COSTIND,China (No.D0420060521)
文摘To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.
基金Supported by the National Natural Science Foundation of China(61370137,61672098,61272361)the Ministry of Education-China Mobile Research Foundation Project(2015/5-9,2016/2-7)
文摘Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.
基金Project supported by the National Natural Science Foundation of China (No.40375019) the Tropical Marine and Meteorology Science Foundation (No.200609) the Jiangsu Key Laboratory of Meteorological Disaster Foundation (No.KLME0507)
文摘Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirical orthogonal function) temporal-spatial separation technique, the disassembled EOF time coefficients series were regarded as dynamical model variables, and dynamic system retrieval idea as well as genetic algorithm were introduced to make dynamical model parameters optimization search, then, a reasonable non-linear dynamic model of EOF time-coefficients was established. By dynamic model integral and EOF temporal-spatial components assembly, a mid-/long-term forecast of subtropical high was carried out. The experimental results show that the forecast results of dynamic model are superior to that of general numerical model forecast results. A new modeling idea and forecast technique is presented for diagnosing and forecasting such complicated weathers as subtropical high.
文摘In the context of power generation companies, vast amounts of specialized data and expert knowledge have been accumulated. However, challenges such as data silos and fragmented knowledge hinder the effective utilization of this information. This study proposes a novel framework for intelligent Question-and-Answer (Q&A) systems based on Retrieval-Augmented Generation (RAG) to address these issues. The system efficiently acquires domain-specific knowledge by leveraging external databases, including Relational Databases (RDBs) and graph databases, without additional fine-tuning for Large Language Models (LLMs). Crucially, the framework integrates a Dynamic Knowledge Base Updating Mechanism (DKBUM) and a Weighted Context-Aware Similarity (WCAS) method to enhance retrieval accuracy and mitigate inherent limitations of LLMs, such as hallucinations and lack of specialization. Additionally, the proposed DKBUM dynamically adjusts knowledge weights within the database, ensuring that the most recent and relevant information is utilized, while WCAS refines the alignment between queries and knowledge items by enhanced context understanding. Experimental validation demonstrates that the system can generate timely, accurate, and context-sensitive responses, making it a robust solution for managing complex business logic in specialized industries.
基金supported in part by“MOST”under Grants No.102-2632-E-216-001-MY3 and No.104-2221-E-216-010-MY2
文摘In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by using each individual descriptor. Second, we propose an automatic relevant/irrelevant models selection (ARMS) approach to selecting the relevant and irrelevant 3D models automatically without any user interaction. A weighted distance, in which the weight associated with each individual descriptor is learnt by using the selected relevant and irrelevant models, is used to measure the similarity between two 3D models. Furthermore, a descriptor-dependent adaptive query point movement (AQPM) approach is employed to update every feature vector. This set of new feature vectors is used to index 3D models in the next search process. Four 3D model databases are used to compare the retrieval accuracy of our proposed DMDF approach with several descriptors as well as some well-known information fusion methods. Experimental results have shown that our proposed DMDF approach provides a promising retrieval result and always yields the best retrieval accuracy.
基金The National Natural Science Founda-tion of China ( No. 60473004)the Science and ResearchFoundation Program of Henan University of Science and Tech-nology (No.2004ZY041)the Natural and Science FoundationProgram of the Education Department of Henan Province (No.200410464004)
文摘A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies of estimated document language models with respect to the estimated query language model. Two popular and relatively efficient smoothing methods, the Jelinek- Mercer method and the absolute discounting method, are used to smooth the document language model in estimation of the document language, A combined model composed of the feedback document language model and the collection language model is used to estimate the query model. A performacne comparison between the new retrieval method and the existing method with feedback is made, and the retrieval performances of the proposed method with the two different smoothing techniques are evaluated on three Text Retrieval Conference (TREC) data sets. Experimental results show that the method is effective and performs better than the basic language modeling approach; moreover, the method using the Jelinek-Mercer technique performs better than that using the absolute discounting technique, and the perfomance is sensitive to the smoothing peramters.