We present an approach to classify medical text at a sentence level automatically.Given the inherent complexity of medical text classification,we employ adapters based on pre-trained language models to extract informa...We present an approach to classify medical text at a sentence level automatically.Given the inherent complexity of medical text classification,we employ adapters based on pre-trained language models to extract information from medical text,facilitating more accurate classification while minimizing the number of trainable parameters.Extensive experiments conducted on various datasets demonstrate the effectiveness of our approach.展开更多
Medical text representation is crucial for medical natural language processing(NLP)applications.Bidirectional encoder representations from transformers(BERT)has achieved the state-of-the-art performance in general dom...Medical text representation is crucial for medical natural language processing(NLP)applications.Bidirectional encoder representations from transformers(BERT)has achieved the state-of-the-art performance in general domain text representation.However,limited by the design of the pretraining task and the frequency of knowledge occurrence,it lacks understanding of medical knowledge.To overcome these problems,we proposed a selective knowledge extraction and fusion framework to enhance medical text representation.In the knowledge extraction phase,we first designed a semantic importance evaluation metric to extract internal knowledge.We then used large language models(LLMs)to extract external knowledge from systematized nomenclature of medicine clinical term(SNOMED CT).In the knowledge fusion phase,we utilized an attention mechanism and Siamese network to integrate internal knowledge and external knowledge.Extracting knowledge through large language models(LLMs)and integrating it into five different types of BERT models,we achieved significant improvements in the task of pulmonary disease text classification.展开更多
文摘We present an approach to classify medical text at a sentence level automatically.Given the inherent complexity of medical text classification,we employ adapters based on pre-trained language models to extract information from medical text,facilitating more accurate classification while minimizing the number of trainable parameters.Extensive experiments conducted on various datasets demonstrate the effectiveness of our approach.
基金supported by the R&D Program of Beijing Municipal Education Commission(No.KM202310025020)the Project of Cultivation for Young Top-notch Talents of Beijing Municipal Institutions(No.BPHR202203113)the National Natural Science Foundation of China(Nos.62103397 and 82100265).
文摘Medical text representation is crucial for medical natural language processing(NLP)applications.Bidirectional encoder representations from transformers(BERT)has achieved the state-of-the-art performance in general domain text representation.However,limited by the design of the pretraining task and the frequency of knowledge occurrence,it lacks understanding of medical knowledge.To overcome these problems,we proposed a selective knowledge extraction and fusion framework to enhance medical text representation.In the knowledge extraction phase,we first designed a semantic importance evaluation metric to extract internal knowledge.We then used large language models(LLMs)to extract external knowledge from systematized nomenclature of medicine clinical term(SNOMED CT).In the knowledge fusion phase,we utilized an attention mechanism and Siamese network to integrate internal knowledge and external knowledge.Extracting knowledge through large language models(LLMs)and integrating it into five different types of BERT models,we achieved significant improvements in the task of pulmonary disease text classification.