期刊文献+
共找到143篇文章
< 1 2 8 >
每页显示 20 50 100
MORLEY CATEGORICITY THEOREM FOR LATTICE-VALUED MODEL THEORY
1
作者 沈云付 《Chinese Science Bulletin》 SCIE EI CAS 1988年第22期1841-1844,共4页
This report is a continuation of (2—5)We introduce several notions such as Skolem functions and sets of indiscernibles, saturated and atomic models, and stable theories in power in lattice-valued version. On the basi... This report is a continuation of (2—5)We introduce several notions such as Skolem functions and sets of indiscernibles, saturated and atomic models, and stable theories in power in lattice-valued version. On the basis of [2—5] Morley categoricity theorem for finite valued lattice is deduced. 展开更多
关键词 lattice-valued model Skolem EXPANSION STABILITY categoricity
在线阅读 下载PDF
Neural Basis of Categorical Representations of Animal Body Silhouettes
2
作者 Yue Pu Shihui Han 《Neuroscience Bulletin》 2025年第2期211-223,共13页
Neural activities differentiating bodies versus non-body stimuli have been identified in the occipitotemporal cortex of both humans and nonhuman primates.However,the neural mechanisms of coding the similarity of diffe... Neural activities differentiating bodies versus non-body stimuli have been identified in the occipitotemporal cortex of both humans and nonhuman primates.However,the neural mechanisms of coding the similarity of different individuals’bodies of the same species to support their categorical representations remain unclear.Using electroencephalography(EEG)and magnetoencephalography(MEG),we investigated the temporal and spatial characteristics of neural processes shared by different individual body silhouettes of the same species by quantifying the repetition suppression of neural responses to human and animal(chimpanzee,dog,and bird)body silhouettes showing different postures.Our EEG results revealed significant repetition suppression of the amplitudes of early frontal/central activity at 180–220 ms(P2)and late occipitoparietal activity at 220–320 ms(P270)in response to animal(but not human)body silhouettes of the same species.Our MEG results further localized the repetition suppression effect related to animal body silhouettes in the left supramarginal gyrus and left frontal cortex at 200–440 ms after stimulus onset.Our findings suggest two neural processes that are involved in spontaneous categorical representations of animal body silhouettes as a cognitive basis of human-animal interactions. 展开更多
关键词 Body silhouette CATEGORIZATION Repression suppression EEG MEG
原文传递
Surprisal-based algorithm for detecting anomalies in categorical data
3
作者 Ossama Cherkaoui Houda Anoun Abderrahim Maizate 《Data Science and Management》 2025年第2期185-195,共11页
Anomaly detection is an important research area in a diverse range of real-world applications.Although many algorithms have been proposed to address anomaly detection for numerical datasets,categorical and mixed datas... Anomaly detection is an important research area in a diverse range of real-world applications.Although many algorithms have been proposed to address anomaly detection for numerical datasets,categorical and mixed datasets remain a significant challenge,primarily because a natural distance metric is lacking.Consequently,the methods proposed in the literature implement entirely different assumptions regarding the definition of cate-gorical anomalies.This paper presents a novel categorical anomaly detection approach,offering two key con-tributions to existing methods.First,a novel surprisal-based anomaly score is introduced,which provides a more accurate assessment of anomalies by considering the full distribution of categorical values.Second,the proposed method considers complex correlations in the data beyond the pairwise interactions of features.This study proposed and tested the novel categorical surprisal anomaly detection algorithm(CSAD)by comparing and evaluating it against six competitors.The experimental results indicate that CSAD produced the best overall performance,achieving the highest average ROC-AUC and PR-AUC values of 0.8 and 0.443,respectively.Furthermore,CSAD's execution time is satisfactory even when processing large,high-dimensional datasets. 展开更多
关键词 Unsupervised learning Anomaly detection Categorical data Surprisal anomaly score
在线阅读 下载PDF
CARE:Comprehensive Artificial Intelligence Techniques for Reliable Autism Evaluation in Pediatric Care
4
作者 Jihoon Moon Jiyoung Woo 《Computers, Materials & Continua》 2025年第10期1383-1425,共43页
Improving early diagnosis of autism spectrum disorder(ASD)in children increasingly relies on predictive models that are reliable and accessible to non-experts.This study aims to develop such models using Python-based ... Improving early diagnosis of autism spectrum disorder(ASD)in children increasingly relies on predictive models that are reliable and accessible to non-experts.This study aims to develop such models using Python-based tools to improve ASD diagnosis in clinical settings.We performed exploratory data analysis to ensure data quality and identify key patterns in pediatric ASD data.We selected the categorical boosting(CatBoost)algorithm to effectively handle the large number of categorical variables.We used the PyCaret automated machine learning(AutoML)tool to make the models user-friendly for clinicians without extensive machine learning expertise.In addition,we applied Shapley additive explanations(SHAP),an explainable artificial intelligence(XAI)technique,to improve the interpretability of the models.Models developed using CatBoost and other AI algorithms showed high accuracy in diagnosing ASD in children.SHAP provided clear insights into the influence of each variable on diagnostic outcomes,making model decisions transparent and understandable to healthcare professionals.By integrating robust machine learning methods with user-friendly tools such as PyCaret and leveraging XAI techniques such as SHAP,this study contributes to the development of reliable,interpretable,and accessible diagnostic tools for ASD.These advances hold great promise for supporting informed decision-making in clinical settings,ultimately improving early identification and intervention strategies for ASD in the pediatric population.However,the study is limited by the dataset’s demographic imbalance and the lack of external clinical validation,which should be addressed in future research. 展开更多
关键词 Autism spectrum disorder pediatric care exploratory data analysis categorical boosting automated machine learning explainable artificial intelligence Shapley additive explanations
在线阅读 下载PDF
The statusquo and advances in categorization of congenital cataract 被引量:2
5
作者 邹颖诗 李芸倩 刘臻臻 《Eye Science》 2024年第1期56-66,共11页
Congenital cataract(CC)is one of the most common causes of pediatric visual impairment.As our understanding of CC's etiology,clinical manifestations,and pathogenic genes deepens,various CC categorization systems b... Congenital cataract(CC)is one of the most common causes of pediatric visual impairment.As our understanding of CC's etiology,clinical manifestations,and pathogenic genes deepens,various CC categorization systems based on different classification criteria have been proposed.Regrettably,the application of the CC category in clinical practice and scientific research is limited.It is challenging to obtain precise information that could guide the timely treatment decision-making for pediatric cataract patients or predict their prognosis from a specific CC classification.This review aims to discuss the status quo of CC categorization systems and the potential directions for future research in this field,focusing on categorization principles and scientific application in clinical practice.Additionally,it aims to propose the potential directions for future research in this domain. 展开更多
关键词 CONGENITAL CATARACT CATEGORIZATION morphology ETIOLOGY genotype-phenoty
暂未订购
Automatic Rule Discovery for Data Transformation Using Fusion of Diversified Feature Formats
6
作者 G.Sunil Santhosh Kumar M.Rudra Kumar 《Computers, Materials & Continua》 SCIE EI 2024年第7期695-713,共19页
This article presents an innovative approach to automatic rule discovery for data transformation tasks leveraging XGBoost,a machine learning algorithm renowned for its efficiency and performance.The framework proposed... This article presents an innovative approach to automatic rule discovery for data transformation tasks leveraging XGBoost,a machine learning algorithm renowned for its efficiency and performance.The framework proposed herein utilizes the fusion of diversified feature formats,specifically,metadata,textual,and pattern features.The goal is to enhance the system’s ability to discern and generalize transformation rules fromsource to destination formats in varied contexts.Firstly,the article delves into the methodology for extracting these distinct features from raw data and the pre-processing steps undertaken to prepare the data for the model.Subsequent sections expound on the mechanism of feature optimization using Recursive Feature Elimination(RFE)with linear regression,aiming to retain the most contributive features and eliminate redundant or less significant ones.The core of the research revolves around the deployment of the XGBoostmodel for training,using the prepared and optimized feature sets.The article presents a detailed overview of the mathematical model and algorithmic steps behind this procedure.Finally,the process of rule discovery(prediction phase)by the trained XGBoost model is explained,underscoring its role in real-time,automated data transformations.By employingmachine learning and particularly,the XGBoost model in the context of Business Rule Engine(BRE)data transformation,the article underscores a paradigm shift towardsmore scalable,efficient,and less human-dependent data transformation systems.This research opens doors for further exploration into automated rule discovery systems and their applications in various sectors. 展开更多
关键词 XGBoost business rule engine machine learning categorical query language humanitarian computing environment
在线阅读 下载PDF
Multimodal Deep Neural Networks for Digitized Document Classification
7
作者 Aigerim Baimakhanova Ainur Zhumadillayeva +4 位作者 Bigul Mukhametzhanova Natalya Glazyrina Rozamgul Niyazova Nurseit Zhunissov Aizhan Sambetbayeva 《Computer Systems Science & Engineering》 2024年第3期793-811,共19页
As digital technologies have advanced more rapidly,the number of paper documents recently converted into a digital format has exponentially increased.To respond to the urgent need to categorize the growing number of d... As digital technologies have advanced more rapidly,the number of paper documents recently converted into a digital format has exponentially increased.To respond to the urgent need to categorize the growing number of digitized documents,the classification of digitized documents in real time has been identified as the primary goal of our study.A paper classification is the first stage in automating document control and efficient knowledge discovery with no or little human involvement.Artificial intelligence methods such as Deep Learning are now combined with segmentation to study and interpret those traits,which were not conceivable ten years ago.Deep learning aids in comprehending input patterns so that object classes may be predicted.The segmentation process divides the input image into separate segments for a more thorough image study.This study proposes a deep learning-enabled framework for automated document classification,which can be implemented in higher education.To further this goal,a dataset was developed that includes seven categories:Diplomas,Personal documents,Journal of Accounting of higher education diplomas,Service letters,Orders,Production orders,and Student orders.Subsequently,a deep learning model based on Conv2D layers is proposed for the document classification process.In the final part of this research,the proposed model is evaluated and compared with other machine-learning techniques.The results demonstrate that the proposed deep learning model shows high results in document categorization overtaking the other machine learning models by reaching 94.84%,94.79%,94.62%,94.43%,94.07%in accuracy,precision,recall,F-score,and AUC-ROC,respectively.The achieved results prove that the proposed deep model is acceptable to use in practice as an assistant to an office worker. 展开更多
关键词 Document categorization deep learning machine learning CLASSIFICATION DIGITIZATION
在线阅读 下载PDF
A survey of fine-grained visual categorization based on deep learning
8
作者 XIE Yuxiang GONG Quanzhi +2 位作者 LUAN Xidao YAN Jie ZHANG Jiahui 《Journal of Systems Engineering and Electronics》 CSCD 2024年第6期1337-1356,共20页
Deep learning has achieved excellent results in various tasks in the field of computer vision,especially in fine-grained visual categorization.It aims to distinguish the subordinate categories of the label-level categ... Deep learning has achieved excellent results in various tasks in the field of computer vision,especially in fine-grained visual categorization.It aims to distinguish the subordinate categories of the label-level categories.Due to high intra-class variances and high inter-class similarity,the fine-grained visual categorization is extremely challenging.This paper first briefly introduces and analyzes the related public datasets.After that,some of the latest methods are reviewed.Based on the feature types,the feature processing methods,and the overall structure used in the model,we divide them into three types of methods:methods based on general convolutional neural network(CNN)and strong supervision of parts,methods based on single feature processing,and meth-ods based on multiple feature processing.Most methods of the first type have a relatively simple structure,which is the result of the initial research.The methods of the other two types include models that have special structures and training processes,which are helpful to obtain discriminative features.We conduct a specific analysis on several methods with high accuracy on pub-lic datasets.In addition,we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power.In terms of tech-nology,the extraction of the subtle feature information with the burgeoning vision transformer(ViT)network is also an important research direction. 展开更多
关键词 deep learning fine-grained visual categorization convolutional neural network(CNN) visual attention
在线阅读 下载PDF
Ableism and (Neo)Racism in School Placement Processes in Quebec: School Personnel Interpretations of Immigrant Student Difficulties - A Secondary Publication
9
作者 Tya Collins Corina Borri-Anadon 《Journal of Contemporary Educational Research》 2024年第3期148-160,共13页
The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the sc... The school placement processes of students from immigrant backgrounds considered to be in“difficulty”is an international concern at the intersection of works relating to special education and those concerning the school experiences of students from immigrant backgrounds or racialized groups.The research problem of this article concerns the identification of these students as disabled or as having adjustment or learning difficulties.From a perspective anchored in Disability Critical Race Studies,this ethnographic study documents different interpretations of perceived difficulties made by school actors with regard to seven primary school students from immigrant backgrounds.Five interpretation types are presented:(1)medicalization by dismissal of cultural markers,(2)medicalization by professional constraint,(3)medicalization by cultural deficit,(4)precautionary wait,and(5)cultural differentialism.Our results help to shed light on the special education overrepresentation phenomenon regarding these students and to understand how ableism and(neo)racism contribute to it. 展开更多
关键词 Categorization in education Learning difficulties and students in difficulty Immigration and ethnicity Educational inclusion and exclusion Canada
在线阅读 下载PDF
Smart Approaches to Efficient Text Mining for Categorizing Sexual Reproductive Health Short Messages into Key Themes
10
作者 Tobias Makai Mayumbo Nyirenda 《Open Journal of Applied Sciences》 2024年第2期511-532,共22页
To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved a... To promote behavioral change among adolescents in Zambia, the National HIV/AIDS/STI/TB Council, in collaboration with UNICEF, developed the Zambia U-Report platform. This platform provides young people with improved access to information on various Sexual Reproductive Health topics through Short Messaging Service (SMS) messages. Over the years, the platform has accumulated millions of incoming and outgoing messages, which need to be categorized into key thematic areas for better tracking of sexual reproductive health knowledge gaps among young people. The current manual categorization process of these text messages is inefficient and time-consuming and this study aims to automate the process for improved analysis using text-mining techniques. Firstly, the study investigates the current text message categorization process and identifies a list of categories adopted by counselors over time which are then used to build and train a categorization model. Secondly, the study presents a proof of concept tool that automates the categorization of U-report messages into key thematic areas using the developed categorization model. Finally, it compares the performance and effectiveness of the developed proof of concept tool against the manual system. The study used a dataset comprising 206,625 text messages. The current process would take roughly 2.82 years to categorise this dataset whereas the trained SVM model would require only 6.4 minutes while achieving an accuracy of 70.4% demonstrating that the automated method is significantly faster, more scalable, and consistent when compared to the current manual categorization. These advantages make the SVM model a more efficient and effective tool for categorizing large unstructured text datasets. These results and the proof-of-concept tool developed demonstrate the potential for enhancing the efficiency and accuracy of message categorization on the Zambia U-report platform and other similar text messages-based platforms. 展开更多
关键词 Knowledge Discovery in Text (KDT) Sexual Reproductive Health (SRH) Text Categorization Text Classification Text Extraction Text Mining Feature Extraction Automated Classification Process Performance Stemming and Lemmatization Natural Language Processing (NLP)
在线阅读 下载PDF
某三级甲等专科医院推进互联网分级诊疗的思考 被引量:13
11
作者 谢诗蓉 叶卿云 +4 位作者 王晨颖 陈琼洲 史晓诞 丁晓璟 叶正强 《中国卫生资源》 CSCD 北大核心 2023年第4期393-396,403,共5页
复旦大学附属眼耳鼻喉科医院在发展互联网医院的基础上,积极探索建立互联网分级诊疗。与全国6省(市)11个地区的16家医院达成互联网分级诊疗战略合作,利用专科医院优势,通过远程会诊、远程教学、科普直播等多项举措共同推进互联网分级诊... 复旦大学附属眼耳鼻喉科医院在发展互联网医院的基础上,积极探索建立互联网分级诊疗。与全国6省(市)11个地区的16家医院达成互联网分级诊疗战略合作,利用专科医院优势,通过远程会诊、远程教学、科普直播等多项举措共同推进互联网分级诊疗。在推进互联网分级诊疗的过程中遇到了远程需求较少、药品配送困难、精准转诊率低、费用结算不明、利益分配不均等难点。应积极尝试从加强医疗联合体合作关系、增加医疗联合体药品目录、加强基层培训、完善医疗联合体利益分配和构建当地眼健康档案等方面进一步完善互联网分级诊疗。 展开更多
关键词 专科医院specialist hospital 互联网医院internet hospital 分级诊疗categorized treatment
暂未订购
Text categorization based on fuzzy classification rules tree 被引量:2
12
作者 郭玉琴 袁方 刘海博 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期339-342,共4页
To deal with the problem that arises when the conventional fuzzy class-association method applies repetitive scans of the classifier to classify new texts,which has low efficiency, a new approach based on the FCR-tree... To deal with the problem that arises when the conventional fuzzy class-association method applies repetitive scans of the classifier to classify new texts,which has low efficiency, a new approach based on the FCR-tree(fuzzy classification rules tree)for text categorization is proposed.The compactness of the FCR-tree saves significant space in storing a large set of rules when there are many repeated words in the rules.In comparison with classification rules,the fuzzy classification rules contain not only words,but also the fuzzy sets corresponding to the frequencies of words appearing in texts.Therefore,the construction of an FCR-tree and its structure are different from a CR-tree.To debase the difficulty of FCR-tree construction and rules retrieval,more k-FCR-trees are built.When classifying a new text,it is not necessary to search the paths of the sub-trees led by those words not appearing in this text,thus reducing the number of traveling rules.Experimental results show that the proposed approach obviously outperforms the conventional method in efficiency. 展开更多
关键词 text categorization fuzzy classification association rule classification rules tree fuzzy classification rules tree
在线阅读 下载PDF
认知过程的翻译理论研究 被引量:6
13
作者 李平 《上海翻译》 CSSCI 北大核心 1999年第4期14-15,共2页
The recent developments of cognitive theories may provide a better interpretation for studies of translation rather than a description.The paper tries to put categorization and metaphor into the process of translating... The recent developments of cognitive theories may provide a better interpretation for studies of translation rather than a description.The paper tries to put categorization and metaphor into the process of translating and translators’ psychology so as to produce a more powerful interpretation. [ 展开更多
关键词 COGNITION INTERPRETATION CATEGORIZATION METAPHOR
在线阅读 下载PDF
Underdetermination, Multiplicity, and Mathematical Logic
14
作者 Salim Rashid 《Journal of Philosophy Study》 2013年第2期113-122,共10页
Whether a collection of scientific data can be explained only by a unique theory or whether such data can be equally explained by multiple theories is one of the more contested issues in the history and philosophy of ... Whether a collection of scientific data can be explained only by a unique theory or whether such data can be equally explained by multiple theories is one of the more contested issues in the history and philosophy of science. This paper argues that the case for multiple explanations is strengthened by the widespread failure of models in mathematical logic to be unique, i.e., categorical. Science is taken to require replicable and explicit public knowledge; this necessitates an unambiguous language for its transmission. Mathematics has been chosen as the vehicle to transmit scientific knowledge, both because of its "unreasonable effectiveness" and because of its unambiguous nature, hence the vogue of axiomatic systems. But mathematical logic tells us that axiomatic systems need not refer to uniquely defined real structures. Hence what is accepted as science may be only one of several possibilities. 展开更多
关键词 UNDERDETERMINATION LOGIC categoricity "saving the appearances"
在线阅读 下载PDF
A Novel Active Learning Method Using SVM for Text Classification 被引量:26
15
作者 Mohamed Goudjil Mouloud Koudil +1 位作者 Mouldi Bedda Noureddine Ghoggali 《International Journal of Automation and computing》 EI CSCD 2018年第3期290-298,共9页
Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data... Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data classification and information retrieval, they require manually labeled data samples in the training stage. However, manual labeling is a time consuming and errorprone task. One possible solution to this issue is to exploit the large number of unlabeled samples that are easily accessible via the internet. This paper presents a novel active learning method for text categorization. The main objective of active learning is to reduce the labeling effort, without compromising the accuracy of classification, by intelligently selecting which samples should be labeled.The proposed method selects a batch of informative samples using the posterior probabilities provided by a set of multi-class SVM classifiers, and these samples are then manually labeled by an expert. Experimental results indicate that the proposed active learning method significantly reduces the labeling effort, while simultaneously enhancing the classification accuracy. 展开更多
关键词 Text categorization active learning support vector machine (SVM) pool-based active learning pairwise coupling.
原文传递
Pseudogenes:Pseudo or Real Functional Elements? 被引量:8
16
作者 Wen Li Wei Yang Xiu-Jie Wang 《Journal of Genetics and Genomics》 SCIE CAS CSCD 2013年第4期171-177,共7页
Pseudogenes are genomic remnants of ancient protein-coding genes which have lost their coding potentials through evolution.Although broadly existed,pseudogenes used to be considered as junk or relics of genomes which ... Pseudogenes are genomic remnants of ancient protein-coding genes which have lost their coding potentials through evolution.Although broadly existed,pseudogenes used to be considered as junk or relics of genomes which have not drawn enough attentions of biologists until recent years.With the broad applications of high-throughput experimental techniques,growing lines of evidence have strongly suggested that some pseudogenes possess special functions,including regulating parental gene expression and participating in the regulation of many biological processes.In this review,we summarize some basic features of pseudogenes and their functions in regulating development and diseases.All of these observations indicate that pseudogenes are not purely dead fossils of genomes,but warrant further exploration in their distribution,expression regulation and functions.A new nomenclature is desirable for the currently called 'pseudogenes' to better describe their functions. 展开更多
关键词 PSEUDOGENE CATEGORIZATION ORIGINATION Function
原文传递
A New Approach of Feature Selection for Text Categorization 被引量:6
17
作者 CUI Zifeng XU Baowen +1 位作者 ZHANG Weifeng XU Junling 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1335-1339,共5页
This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of e... This paper proposes a new approach of feature selection based on the independent measure between features for text categorization. A fundamental hypothesis that occurrence of the terms in documents is independent of each other, widely used in the probabilistic models for text categorization (TC), is discussed. However, the basic hypothesis is incom plete for independence of feature set. From the view of feature selection, a new independent measure between features is designed, by which a feature selection algorithm is given to ob rain a feature subset. The selected subset is high in relevance with category and strong in independence between features, satisfies the basic hypothesis at maximum degree. Compared with other traditional feature selection method in TC (which is only taken into the relevance account), the performance of feature subset selected by our method is prior to others with experiments on the benchmark dataset of 20 Newsgroups. 展开更多
关键词 feature selection independency CHI square test text categorization
在线阅读 下载PDF
Comparison of Text Categorization Algorithms 被引量:4
18
作者 SHIYong-feng ZHAOYan-ping 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期798-804,共7页
This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classi... This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages. It provides clues for making use of appropriate automatic classifying algorithms in different fields. Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out. Key words text categorization - naive bayes - KNN - SVM - neural network CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (70031010) and the Research Foundation of Beijing Institute of TechnologyBiography: SHI Yong-feng (1980-), male, Master candidate, research direction: web information mining. 展开更多
关键词 text categorization naive bayes KNN SVM neural network
在线阅读 下载PDF
Protocol Format Extraction Based on an Improved CFSM Algorithm 被引量:4
19
作者 Peihong Lin Zheng Hong +2 位作者 Lifa Wu Yihao Li Zhenji Zhou 《China Communications》 SCIE CSCD 2020年第11期156-180,共25页
As the information technology rapidly develops,many network applications appear and their communication protocols are unknown.Although many protocol keyword recognition based protocol reverse engineering methods have ... As the information technology rapidly develops,many network applications appear and their communication protocols are unknown.Although many protocol keyword recognition based protocol reverse engineering methods have been proposed,most of the keyword recognition algorithms are time consuming.This paper firstly uses the traffic clustering method F-DBSCAN to cluster the unknown protocol traffic.Then an improved CFSM(Closed Frequent Sequence Mining)algorithm is used to mine closed frequent sequences from the messages and identify protocol keywords.Finally,CFGM(Closed Frequent Group Mining)algorithm is proposed to explore the parallel,sequential and hierarchical relations between the protocol keywords and obtain accurate protocol message formats.Experimental results show that the proposed protocol formats extraction method is better than Apriori algorithm and Sequence alignment algorithm in terms of time complexity and it can achieve high keyword recognition accuracy.Additionally,based on the relations between the keywords,the method can obtain accurate protocol formats.Compared with the protocol formats obtained from the existing methods,our protocol format can better grasp the overall structure of target protocols and the results perform better in the application of protocol reverse engineering such as fuzzing test. 展开更多
关键词 flow clustering CFSM algorithm closed frequent sequences keyword recognition CFGM algorithm keyword relations format categorization
在线阅读 下载PDF
上一页 1 2 8 下一页 到第
使用帮助 返回顶部