期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
Heteroscedastic Laplace mixture of experts regression models and applications
1
作者 WU Liu-cang ZHANG Shu-yu LI Shuang-shuang 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2021年第1期60-69,共10页
Mixture of Experts(MoE)regression models are widely studied in statistics and machine learning for modeling heterogeneity in data for regression,clustering and classification.Laplace distribution is one of the most im... Mixture of Experts(MoE)regression models are widely studied in statistics and machine learning for modeling heterogeneity in data for regression,clustering and classification.Laplace distribution is one of the most important statistical tools to analyze thick and tail data.Laplace Mixture of Linear Experts(LMoLE)regression models are based on the Laplace distribution which is more robust.Similar to modelling variance parameter in a homogeneous population,we propose and study a new novel class of models:heteroscedastic Laplace mixture of experts regression models to analyze the heteroscedastic data coming from a heterogeneous population in this paper.The issues of maximum likelihood estimation are addressed.In particular,Minorization-Maximization(MM)algorithm for estimating the regression parameters is developed.Properties of the estimators of the regression coefficients are evaluated through Monte Carlo simulations.Results from the analysis of two real data sets are presented. 展开更多
关键词 mixture of experts regression models heteroscedastic mixture of experts regression models Laplace distribution MM algorithm
在线阅读 下载PDF
GoM-ICD:Automatic ICD Coding with Gap Schemes and Mixture of Experts
2
作者 Yifan Wu Weiyan Qiu +3 位作者 Min Zeng Xi Chen Min Li Hongtao Zhu 《Big Data Mining and Analytics》 2025年第6期1211-1224,共14页
Assigning standardized International Classification of Disease(ICD)codes to Electronic Medical Records(EMR)is crucial for enhancing the efficiency and accuracy of medical coding processes.However,existing methods face... Assigning standardized International Classification of Disease(ICD)codes to Electronic Medical Records(EMR)is crucial for enhancing the efficiency and accuracy of medical coding processes.However,existing methods face challenges in effectively capturing,integrating,and amalgamating specialized medical knowledge from complex textual data.In this study,we propose GoM-ICD,an advanced automatic ICD coding framework that integrates multiple gap schemes with a Mixture of Experts(MoE)architecture.GoM-ICD is designed to address the extreme multilabel text classification in ICD coding.It segments and reassembles text to facilitate seamless information exchange across different chunks,employing various segmentation methods derived from different gap schemes.Then the model-level MoE consolidates the predictions of these methods to enhance the prediction performance.Specifically,the segmented text is input to a Pretrained Language Model(PLM)to extract textual features.Next,a Bidirectional Long Short-Term Memory network(BiLSTM)is employed to capture long-term contextual information from the textual features.Finally,a text-level MoE,followed by a label-level MoE,enables precise attention matching between text and labels,thereby improving the fidelity of the coding process.The three levels of MoE leverage the collective insights of diverse expert models,effectively processing multi-dimensional text features and unifying model-level insights from various gap schemes.Extensive experimental results demonstrate that GoM-ICD achieves the state-of-the-art performance in automatic ICD coding tasks,reaching micro-F1 of 0.617,0.620,and 0.613 on datasets MIMIC III full,MIMIC-III clean,and MIMIC-IV ICD-10,respectively.The source code can be obtained from https://github.com/CSUBioGroup/GoM-ICD. 展开更多
关键词 automatic International Classification of Disease(ICD)coding mixture of experts(MoE) multi-label text classification Electronic Medical Record(EMR)
原文传递
A Latent Entity-Document Class Mixture of Experts Model for Cumulative Citation Recommendation 被引量:2
3
作者 Lerong Ma Lejian Liao +1 位作者 DANDan Song Jingang Wang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2018年第6期660-670,共11页
Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date informa... Knowledge Bases (KBs) are valuable resources of human knowledge which contribute to many applications. However, since they are manually maintained, there is a big lag between their contents and the upto-date information of entities. Considering a target entity in KBs, this paper investigates how Cumulative Citation Recommendation (CCR) can be used to effectively detect its worthy-citation documents in large volumes of stream data. Most global relevant models only consider semantic and temporat features of entity-document instances, which does not sufficiently exploit prior knowledge underlying entity-document instances. To tackle this problem, we present a Mixture of Experts (ME) model by introducing a latent layer to capture relationships between the entity-document instances and their latent class information. An extensive set of experiments was conducted on TREC-KBA-2013 dataset. The results show that the model can significantly achieve a better performance gain compared to state-of-the-art models in CCR. 展开更多
关键词 knowledge base acceleration cumulative citation recommendation mixture of experts (ME) LatentEntity-Document Classes (LEDCs)
原文传递
Rapid optimal control law generation: an MoE based method 被引量:1
4
作者 ZHANG Tengfei SU Hua +2 位作者 GONG Chunlin YANG Sizhi BAI Shaobo 《Journal of Systems Engineering and Electronics》 2025年第1期280-291,共12页
To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target... To better complete various missions, it is necessary to plan an optimal trajectory or provide the optimal control law for the multirole missile according to the actual situation, including launch conditions and target location. Since trajectory optimization struggles to meet real-time requirements, the emergence of data-based generation methods has become a significant focus in contemporary research. However, due to the large differences in the characteristics of the optimal control laws caused by the diversity of tasks, it is difficult to achieve good prediction results by modeling all data with one single model.Therefore, the modeling idea of the mixture of experts(MoE) is adopted. Firstly, the K-means clustering algorithm is used to partition the sample data set, and the corresponding neural network classification model is established as the gate switch of MoE. Then, the expert models, i.e., the mappings from the generation conditions to the optimal control law represented by the results of principal component analysis(PCA), are represented by Kriging models. Finally, multiple rounds of accuracy evaluation, sample supplementation, and model updating are conducted to improve the generation accuracy. The Monte Carlo simulation shows that the accuracy of the proposed model reaches 96% and the generation efficiency meets the real-time requirement. 展开更多
关键词 optimal control mixture of experts(MoE) K-MEANS Kriging model neural network classification principal component analysis(PCA)
在线阅读 下载PDF
Large language models in clinical psychiatry:Applications and optimization strategies
5
作者 Yi-Fan Wang Ming-Da Li +4 位作者 Su-Hong Wang Yin Fang Jie Sun Lin Lu Wei Yan 《World Journal of Psychiatry》 2025年第11期90-100,共11页
Psychiatric disorders constitute a complex health issue,primarily manifesting as significant disturbances in cognition,emotional regulation,and behavior.However,due to limited resources within health care systems,only... Psychiatric disorders constitute a complex health issue,primarily manifesting as significant disturbances in cognition,emotional regulation,and behavior.However,due to limited resources within health care systems,only a minority of patients can access effective treatment and care services,highlighting an urgent need for improvement.large language models(LLMs),with their natural language understanding and generation capabilities,are gradually penetrating the entire process of psychiatric diagnosis and treatment,including outpatient reception,diagnosis and therapy,clinical nursing,medication safety,and prognosis follow-up.They hold promise for improving the current severe shortage of health system resources and promoting equal access to mental health care.This article reviews the application scenarios and research progress of LLMs.It explores optimization methods for LLMs in psychiatry.Based on the research findings,we propose a clinical LLM for mental health using the Mixture of Experts framework to improve the accuracy of psychiatric diagnosis and therapeutic interventions. 展开更多
关键词 Large language models Clinical psychiatry mixture of experts Mental health Research progress
在线阅读 下载PDF
Adaptive Multi-modal Fusion Instance Segmentation for CAEVs in Complex Conditions:Dataset,Framework and Verifications 被引量:3
6
作者 Pai Peng Keke Geng +3 位作者 Guodong Yin Yanbo Lu Weichao Zhuang Shuaipeng Liu 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2021年第5期96-106,共11页
Current works of environmental perception for connected autonomous electrified vehicles(CAEVs)mainly focus on the object detection task in good weather and illumination conditions,they often perform poorly in adverse ... Current works of environmental perception for connected autonomous electrified vehicles(CAEVs)mainly focus on the object detection task in good weather and illumination conditions,they often perform poorly in adverse scenarios and have a vague scene parsing ability.This paper aims to develop an end-to-end sharpening mixture of experts(SMoE)fusion framework to improve the robustness and accuracy of the perception systems for CAEVs in complex illumination and weather conditions.Three original contributions make our work distinctive from the existing relevant literature.The Complex KITTI dataset is introduced which consists of 7481 pairs of modified KITTI RGB images and the generated LiDAR dense depth maps,and this dataset is fine annotated in instance-level with the proposed semi-automatic annotation method.The SMoE fusion approach is devised to adaptively learn the robust kernels from complementary modalities.Comprehensive comparative experiments are implemented,and the results show that the proposed SMoE framework yield significant improvements over the other fusion techniques in adverse environmental conditions.This research proposes a SMoE fusion framework to improve the scene parsing ability of the perception systems for CAEVs in adverse conditions. 展开更多
关键词 Connected autonomous electrified vehicles Multi-modal fusion Semi-automatic annotation Sharpening mixture of experts Comparative experiments
在线阅读 下载PDF
Improving Multi-task GNNs for Molecular Property Prediction via Missing Label Imputation
7
作者 Fenyu Hu Dingshuo Chen +1 位作者 Qiang Liu Shu Wu 《Machine Intelligence Research》 2025年第1期131-144,共14页
The prediction of molecular properties is a fundamental task in the field of drug discovery.Recently,graph neural networks(GNNs)have been gaining prominence in this area.Since a molecule tends to have multiple correla... The prediction of molecular properties is a fundamental task in the field of drug discovery.Recently,graph neural networks(GNNs)have been gaining prominence in this area.Since a molecule tends to have multiple correlated properties,there is a great need to develop the multi-task learning ability of GNNs.However,limited by expensive and time-consuming human annotations,collecting complete labels for each task is difficult.As a result,most existing benchmarks involve many missing labels in training data,and the performance of GNNs is impaired due to the lack of sufficient supervision information.To overcome this obstacle,we propose to improve multi-task molecular property prediction by missing label imputation.Specifically,a bipartite graph is first introduced to model the molecule-task co-occurrence relationships.Then,the imputation of missing labels is transformed into predicting missing edges on this bipartite graph.To predict the missing edges,a graph neural network is devised,which can learn the complex molecule-task co-occurrence relationships.After that,we select reliable pseudo labels according to the uncertainty of the prediction results.Boosting with enough and reliable supervision information,our approach achieves state-of-the-art performance on a variety of real-world datasets. 展开更多
关键词 Graph classification imbalance learning prediction bias mixture of experts multiview representations
原文传递
Space-time video super-resolution using long-term temporal feature aggregation
8
作者 Kuanhao Chen Zijie Yue Miaojing Shi 《Autonomous Intelligent Systems》 EI 2023年第1期75-83,共9页
Space-time video super-resolution(STVSR)serves the purpose to reconstruct high-resolution high-frame-rate videos from their low-resolution low-frame-rate counterparts.Recent approaches utilize end-to-end deep learning... Space-time video super-resolution(STVSR)serves the purpose to reconstruct high-resolution high-frame-rate videos from their low-resolution low-frame-rate counterparts.Recent approaches utilize end-to-end deep learning models to achieve STVSR.They first interpolate intermediate frame features between given frames,then perform local and global refinement among the feature sequence,and finally increase the spatial resolutions of these features.However,in the most important feature interpolation phase,they only capture spatial-temporal information from the most adjacent frame features,ignoring modelling long-term spatial-temporal correlations between multiple neighbouring frames to restore variable-speed object movements and maintain long-term motion continuity.In this paper,we propose a novel long-term temporal feature aggregation network(LTFA-Net)for STVSR.Specifically,we design a long-term mixture of experts(LTMoE)module for feature interpolation.LTMoE contains multiple experts to extract mutual and complementary spatial-temporal information from multiple consecutive adjacent frame features,which are then combined with different weights to obtain interpolation results using several gating nets.Next,we perform local and global feature refinement using the Locally-temporal Feature Comparison(LFC)module and bidirectional deformable ConvLSTM layer,respectively.Experimental results on two standard benchmarks,Adobe240 and GoPro,indicate the effectiveness and superiority of our approach over state of the art. 展开更多
关键词 Space-time video super-resolution mixture of experts Deformable convolutional layer Long-term temporal feature aggregation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部