In the field of electronic record management,especially in the current big data environment,data continuity has become a new topic that is as important as security and needs to be studied.This paper decomposes the dat...In the field of electronic record management,especially in the current big data environment,data continuity has become a new topic that is as important as security and needs to be studied.This paper decomposes the data continuity guarantee of electronic record into a set of data protection requirements consisting of data relevance,traceability and comprehensibility,and proposes to use the associated data technology to provide an integrated guarantee mechanism to meet the above three requirements.展开更多
With the explosive growth of data available, there is an urgent need to develop continuous data mining which reduces manual interaction evidently. A novel model for data mining is proposed in evolving environment. Fir...With the explosive growth of data available, there is an urgent need to develop continuous data mining which reduces manual interaction evidently. A novel model for data mining is proposed in evolving environment. First, some valid mining task schedules are generated, and then au tonomous and local mining are executed periodically, finally, previous results are merged and refined. The framework based on the model creates a communication mechanism to in corporate domain knowledge into continuous process through ontology service. The local and merge mining are transparent to the end user and heterogeneous data ,source by ontology. Experiments suggest that the framework should be useful in guiding the continuous mining process.展开更多
Detecting sophisticated cyberattacks,mainly Distributed Denial of Service(DDoS)attacks,with unexpected patterns remains challenging in modern networks.Traditional detection systems often struggle to mitigate such atta...Detecting sophisticated cyberattacks,mainly Distributed Denial of Service(DDoS)attacks,with unexpected patterns remains challenging in modern networks.Traditional detection systems often struggle to mitigate such attacks in conventional and software-defined networking(SDN)environments.While Machine Learning(ML)models can distinguish between benign and malicious traffic,their limited feature scope hinders the detection of new zero-day or low-rate DDoS attacks requiring frequent retraining.In this paper,we propose a novel DDoS detection framework that combines Machine Learning(ML)and Ensemble Learning(EL)techniques to improve DDoS attack detection and mitigation in SDN environments.Our model leverages the“DDoS SDN”dataset for training and evaluation and employs a dynamic feature selection mechanism that enhances detection accuracy by focusing on the most relevant features.This adaptive approach addresses the limitations of conventional ML models and provides more accurate detection of various DDoS attack scenarios.Our proposed ensemble model introduces an additional layer of detection,increasing reliability through the innovative application of ensemble techniques.The proposed solution significantly enhances the model’s ability to identify and respond to dynamic threats in SDNs.It provides a strong foundation for proactive DDoS detection and mitigation,enhancing network defenses against evolving threats.Our comprehensive runtime analysis of Simultaneous Multi-Threading(SMT)on identical configurations shows superior accuracy and efficiency,with significantly reduced computational time,making it ideal for real-time DDoS detection in dynamic,rapidly changing SDNs.Experimental results demonstrate that our model achieves outstanding performance,outperforming traditional algorithms with 99%accuracy using Random Forest(RF)and K-Nearest Neighbors(KNN)and 98%accuracy using XGBoost.展开更多
We study continuous data assimilation(CDA)applied to projection and penalty methods for the Navier-Stokes(NS)equations.Penalty and projection methods are more efficient than consistent Ns discretizations,however are l...We study continuous data assimilation(CDA)applied to projection and penalty methods for the Navier-Stokes(NS)equations.Penalty and projection methods are more efficient than consistent Ns discretizations,however are less accurate due to modeling error(penalty)and splitting error(projection).We show analytically and numerically that with measurement data and properly chosen parameters,CDA can effectively remove these splitting and modeling errors and provide long time optimally accurate solutions.展开更多
This paper is dedicated to the expansion of the framework of general interpolant observables introduced by Azouani,Olson,and Titi for continuous data assimilation of nonlinear partial differential equations.The main f...This paper is dedicated to the expansion of the framework of general interpolant observables introduced by Azouani,Olson,and Titi for continuous data assimilation of nonlinear partial differential equations.The main feature of this expanded framework is its mesh-free aspect,which allows the observational data itself to dictate the subdivision of the domain via partition of unity in the spirit of the so-called Partition of Unity Method by Babuska and Melenk.As an application of this framework,we consider a nudging-based scheme for data assimilation applied to the context of the two-dimensional Navier-Stokes equations as a paradigmatic example and establish convergence to the reference solution in all higher-order Sobolev topologies in a periodic,mean-free setting.The convergence analysis also makes use of absorbing ball bounds in higherorder Sobolev norms,for which explicit bounds appear to be available in the literature only up to H^(2);such bounds are additionally proved for all integer levels of Sobolev regularity above H^(2).展开更多
This paper focuses on the problem of detecting the geographical cluster with the most severe status in multiple groups of population given limited medical resources.Populations are grouped based on characteristics suc...This paper focuses on the problem of detecting the geographical cluster with the most severe status in multiple groups of population given limited medical resources.Populations are grouped based on characteristics such as age,gender,and race.In the early stages of a disease,an outbreak may only present in specific population groups.Therefore,to efficiently detect the outbreak,we are particularly interested in monitoring and evaluating such groups.We define the objective of detection as the most severe cluster(MSC).Taking into account the interactions between population groups,a multivariate normal scan statistic is proposed to simultaneously determine the location and size of a significant MSC,as well as the specific population groups in which the MSC is located.The proposed method is applied to an example of lung cancer in New York State,where the MSC with the highest mortality rate at the aggregate level is detected.Further,the detection capacity of this method is evaluated using a simulation study based on the lung cancer example.展开更多
基金This work is supported by the NSFC(61772280)the national training programs of innovation and entrepreneurship for undergraduates(Nos.201910300123Y,202010300200)the PAPD fund from NUIST.
文摘In the field of electronic record management,especially in the current big data environment,data continuity has become a new topic that is as important as security and needs to be studied.This paper decomposes the data continuity guarantee of electronic record into a set of data protection requirements consisting of data relevance,traceability and comprehensibility,and proposes to use the associated data technology to provide an integrated guarantee mechanism to meet the above three requirements.
基金Supported by the National Natural Science Foun-dation of China (60173058 ,70372024)
文摘With the explosive growth of data available, there is an urgent need to develop continuous data mining which reduces manual interaction evidently. A novel model for data mining is proposed in evolving environment. First, some valid mining task schedules are generated, and then au tonomous and local mining are executed periodically, finally, previous results are merged and refined. The framework based on the model creates a communication mechanism to in corporate domain knowledge into continuous process through ontology service. The local and merge mining are transparent to the end user and heterogeneous data ,source by ontology. Experiments suggest that the framework should be useful in guiding the continuous mining process.
文摘Detecting sophisticated cyberattacks,mainly Distributed Denial of Service(DDoS)attacks,with unexpected patterns remains challenging in modern networks.Traditional detection systems often struggle to mitigate such attacks in conventional and software-defined networking(SDN)environments.While Machine Learning(ML)models can distinguish between benign and malicious traffic,their limited feature scope hinders the detection of new zero-day or low-rate DDoS attacks requiring frequent retraining.In this paper,we propose a novel DDoS detection framework that combines Machine Learning(ML)and Ensemble Learning(EL)techniques to improve DDoS attack detection and mitigation in SDN environments.Our model leverages the“DDoS SDN”dataset for training and evaluation and employs a dynamic feature selection mechanism that enhances detection accuracy by focusing on the most relevant features.This adaptive approach addresses the limitations of conventional ML models and provides more accurate detection of various DDoS attack scenarios.Our proposed ensemble model introduces an additional layer of detection,increasing reliability through the innovative application of ensemble techniques.The proposed solution significantly enhances the model’s ability to identify and respond to dynamic threats in SDNs.It provides a strong foundation for proactive DDoS detection and mitigation,enhancing network defenses against evolving threats.Our comprehensive runtime analysis of Simultaneous Multi-Threading(SMT)on identical configurations shows superior accuracy and efficiency,with significantly reduced computational time,making it ideal for real-time DDoS detection in dynamic,rapidly changing SDNs.Experimental results demonstrate that our model achieves outstanding performance,outperforming traditional algorithms with 99%accuracy using Random Forest(RF)and K-Nearest Neighbors(KNN)and 98%accuracy using XGBoost.
文摘We study continuous data assimilation(CDA)applied to projection and penalty methods for the Navier-Stokes(NS)equations.Penalty and projection methods are more efficient than consistent Ns discretizations,however are less accurate due to modeling error(penalty)and splitting error(projection).We show analytically and numerically that with measurement data and properly chosen parameters,CDA can effectively remove these splitting and modeling errors and provide long time optimally accurate solutions.
基金partially supported by the award PSC-CUNY64335-0052,jointly funded by The Professional Staff Congress and The City University of New York。
文摘This paper is dedicated to the expansion of the framework of general interpolant observables introduced by Azouani,Olson,and Titi for continuous data assimilation of nonlinear partial differential equations.The main feature of this expanded framework is its mesh-free aspect,which allows the observational data itself to dictate the subdivision of the domain via partition of unity in the spirit of the so-called Partition of Unity Method by Babuska and Melenk.As an application of this framework,we consider a nudging-based scheme for data assimilation applied to the context of the two-dimensional Navier-Stokes equations as a paradigmatic example and establish convergence to the reference solution in all higher-order Sobolev topologies in a periodic,mean-free setting.The convergence analysis also makes use of absorbing ball bounds in higherorder Sobolev norms,for which explicit bounds appear to be available in the literature only up to H^(2);such bounds are additionally proved for all integer levels of Sobolev regularity above H^(2).
基金This work is supported by National Science Foundation of China[grant number 71172131 and 71325003]Ministry of Education of China[grant number NCET11-0321]Shanghai Pujiang Programme。
文摘This paper focuses on the problem of detecting the geographical cluster with the most severe status in multiple groups of population given limited medical resources.Populations are grouped based on characteristics such as age,gender,and race.In the early stages of a disease,an outbreak may only present in specific population groups.Therefore,to efficiently detect the outbreak,we are particularly interested in monitoring and evaluating such groups.We define the objective of detection as the most severe cluster(MSC).Taking into account the interactions between population groups,a multivariate normal scan statistic is proposed to simultaneously determine the location and size of a significant MSC,as well as the specific population groups in which the MSC is located.The proposed method is applied to an example of lung cancer in New York State,where the MSC with the highest mortality rate at the aggregate level is detected.Further,the detection capacity of this method is evaluated using a simulation study based on the lung cancer example.