This paper considers the problem of applying data mining techniques to aeronautical field.The truncation method,which is one of the techniques in the aeronautical data mining,can be used to efficiently handle the air-...This paper considers the problem of applying data mining techniques to aeronautical field.The truncation method,which is one of the techniques in the aeronautical data mining,can be used to efficiently handle the air-combat behavior data.The technique of air-combat behavior data mining based on the truncation method is proposed to discover the air-combat rules or patterns.The simulation platform of the air-combat behavior data mining that supports two fighters is implemented.The simulation experimental results show that the proposed air-combat behavior data mining technique based on the truncation method is feasible whether in efficiency or in effectiveness.展开更多
This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from ...This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from a perspective of engineering course,especially of software engineering.As for evaluation method,relying on the behavioral data of students during their school years,we aim to construct the evaluation model as objective as possible,effectively weakening the negative impact of personal subjective assumptions on the evaluation results.展开更多
E-learning behavior data indicates several students’activities on the e-learning platform such as the number of accesses to a set of resources and number of participants in lectures.This article proposes a new analyt...E-learning behavior data indicates several students’activities on the e-learning platform such as the number of accesses to a set of resources and number of participants in lectures.This article proposes a new analytics systemto support academic evaluation for students via e-learning activities to overcome the challenges faced by traditional learning environments.The proposed e-learning analytics system includes a new deep forest model.It consists of multistage cascade random forests with minimal hyperparameters compared to traditional deep neural networks.The developed forest model can analyze each student’s activities during the use of an e-learning platform to give accurate expectations of the student’s performance before ending the semester and/or the final exam.Experiments have been conducted on the Open University Learning Analytics Dataset(OULAD)of 32,593 students.Our proposed deep model showed a competitive accuracy score of 98.0%compared to artificial intelligence-based models,such as ConvolutionalNeuralNetwork(CNN)and Long Short-TermMemory(LSTM)in previous studies.That allows academic advisors to support expected failed students significantly and improve their academic level at the right time.Consequently,the proposed analytics system can enhance the quality of educational services for students in an innovative e-learning framework.展开更多
Purpose:The goal of our research is to suggest specific Web metrics that are useful for evaluating and improving user navigation experience on informational websites.Design/methodology/approach:We revised metrics in a...Purpose:The goal of our research is to suggest specific Web metrics that are useful for evaluating and improving user navigation experience on informational websites.Design/methodology/approach:We revised metrics in a Web forensic framework proposed in the literature and defined the metrics of footprint,track and movement.Data were obtained from user clickstreams provided by a real estate site’s administrators.There were two phases of data analysis with the first phase on navigation behavior based on user footprints and tracks,and the second phase on navigational transition patterns based on user movements.Findings:Preliminary results suggest that the apartment pages were heavily-trafficked while the agent pages and related information pages were underused to a great extent.Navigation within the same category of pages was prevalent,especially when users navigated among the regional apartment listings.However,navigation of these pages was found to be inefficient.Research limitations:The suggestions for navigation design optimization provided in the paper are specific to this website,and their applicability to other online environments needs to be verified.Preference predications or personal recommendations are not made during the current stage of research.Practical implications:Our clickstream data analysis results offer a base for future research.Meanwhile,website administrators and managers can make better use of the readily available clickstream data to evaluate the effectiveness and efficiency of their site navigation design.Originality/value:Our empirical study is valuable to those seeking analysis metrics for evaluating and improving user navigation experience on informational websites based on clickstream data.Our attempts to analyze the log file in terms of footprint,track and movement will enrich the utilization of such trace data to engender a deeper understanding of users’within-site navigation behavior.展开更多
In the financial sector,alternatives to traditional datasets,such as financial statements and Securities and Exchange Commission filings,can provide additional ways to describe the running status of businesses.Nontrad...In the financial sector,alternatives to traditional datasets,such as financial statements and Securities and Exchange Commission filings,can provide additional ways to describe the running status of businesses.Nontraditional data sources include individual behaviors,business processes,and various sensors.In recent years,alternative data have been leveraged by businesses and investors to adjust credit scores,mitigate financial fraud,and optimize investment portfolios because they can be used to conduct more in-depth,comprehensive,and timely evaluations of enterprises.Adopting alternative data in developing models for finance and business scenarios has become increasingly popular in academia.In this article,we first identify the advantages of alternative data compared with traditional data,such as having multiple sources,heterogeneity,flexibility,objectivity,and constant evolution.We then provide an overall investigation of emerging studies to outline the various types,emerging applications,and effects of alternative data in finance and business by reviewing over 100 papers published from 2015 to 2023.The investigation is implemented according to application scenarios,including business return prediction,business risk management,credit evaluation,investment risk prediction,and stock prediction.We discuss the roles of alternative data from the perspective of finance theory to argue that alternative data have the potential to serve as a bridge toward achieving high efficiency in financial markets.The challenges and future trends of alternative data in finance and business are also discussed.展开更多
This paper presents a driver behavior analysis using microscopic video data measures including vehicle speed, lane-changing ratio, and time to collision. An analytical framework was developed to evaluate the effect of...This paper presents a driver behavior analysis using microscopic video data measures including vehicle speed, lane-changing ratio, and time to collision. An analytical framework was developed to evaluate the effect of adverse winter weather conditions on highway driving behavior based on automated (computer) and manual methods. The research was conducted through two case studies. The first case study was conducted to evaluate the feasibility of applying an au- tomated approach to extracting driver behavior data based on 15 video recordings obtained in the winter 2013 at three dif- ferent locations on the Don Valley Parkway in Toronto, Canada. A comparison was made between the automated approach and manual approach, and issues in collecting data using the automated approach under winter conditions were identified. The second case study was based on high quality data collected in the winter 2014, at a location on Highway 25 in Montreal, Canada. The results demonstrate the effectiveness of the automated analytical framework in analyzing driver behavior, as well as evaluating the impact of adverse winter weather conditions on driver behavior. This approach could be applied to evaluate winter maintenance strategies and crash risk on highways during adverse winter weather conditions.展开更多
As the travel purpose of non-occupied taxies is to find new passengers rather than to arrive at the destination, large differences exist in the route choice behavior between the occupied and non-occupied taxies.With t...As the travel purpose of non-occupied taxies is to find new passengers rather than to arrive at the destination, large differences exist in the route choice behavior between the occupied and non-occupied taxies.With the assistance of geographic information system(GIS) and taxi-based floating car data(FCD), this paper investigates the behavior differences between occupied and non-occupied taxi drivers with the same origin and destination. Descriptive statistical indexes from the FCD in Shenzhen, China are explored to identify the route choice characteristics of occupied and non-occupied taxies. Then, a conditional logit model is proposed to model the quantitative relationship between drivers' route choice and the related significant variables. Attributes of the variables related to non-occupied taxies' observed routes are compared with the case of occupied ones. The results indicate that, compared with their counterparts, non-occupied taxi drivers generally pay more attention to choosing arterial roads and avoiding congested segments. Additionally, they are also found less sensitive to fewer traffic lights and shorter travel time. Findings from this research can assist to improve urban road network planning and traffic management.展开更多
Rural intersections account for around 30% of crashes in rural areas and 6% of all fatal crashes, representing a significant but poorly understood safety problem. Crashes at rural intersections are also problematic si...Rural intersections account for around 30% of crashes in rural areas and 6% of all fatal crashes, representing a significant but poorly understood safety problem. Crashes at rural intersections are also problematic since high speeds on intersection approaches are present which can exacerbate the impact of a crash. Additionally, rural areas are often underserved with EMS services which can further contribute to negative crash outcomes. This paper describes an analysis of driver stopping behavior at rural T-intersections using the SHRP 2 Naturalistic Driving Study data. Type of stop was used as a safety surrogate measure using full/rolling stops compared to non-stops. Time series traces were obtained for 157 drivers at 87 unique intersections resulting in 1277 samples at the stop controlled approach for T-intersections. Roadway (i.e. number of lanes, presence of skew, speed limit, presence of stop bar or other traffic control devices), driver (age, gender, speeding), and environmental characteristics (time of day, presence of rain) were reduced and included as independent variables. Results of a logistic regression model indicated drivers were less likely to stop during the nighttime. However presence of intersection lighting increased the likelihood of full/rolling stops. Presence of intersection skew was shown to negatively impact stopping behavior. Additionally drivers who were traveling over the posted speed limit upstream of the intersection approach were less likely to stop at the approach stop sign.展开更多
文摘This paper considers the problem of applying data mining techniques to aeronautical field.The truncation method,which is one of the techniques in the aeronautical data mining,can be used to efficiently handle the air-combat behavior data.The technique of air-combat behavior data mining based on the truncation method is proposed to discover the air-combat rules or patterns.The simulation platform of the air-combat behavior data mining that supports two fighters is implemented.The simulation experimental results show that the proposed air-combat behavior data mining technique based on the truncation method is feasible whether in efficiency or in effectiveness.
基金supported in part by the Education Reform Key Projects of Heilongjiang Province(Grant No.SJGZ20220011,SJGZ20220012)the Excellent Project of Ministry of Education and China Higher Education Association on Digital Ideological and Political Education in Universities(Grant No.GXSZSZJPXM001)。
文摘This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from a perspective of engineering course,especially of software engineering.As for evaluation method,relying on the behavioral data of students during their school years,we aim to construct the evaluation model as objective as possible,effectively weakening the negative impact of personal subjective assumptions on the evaluation results.
基金The authors thank to the deanship of scientific research at Shaqra University for funding this research work through the Project Number(SU-ANN-2023017).
文摘E-learning behavior data indicates several students’activities on the e-learning platform such as the number of accesses to a set of resources and number of participants in lectures.This article proposes a new analytics systemto support academic evaluation for students via e-learning activities to overcome the challenges faced by traditional learning environments.The proposed e-learning analytics system includes a new deep forest model.It consists of multistage cascade random forests with minimal hyperparameters compared to traditional deep neural networks.The developed forest model can analyze each student’s activities during the use of an e-learning platform to give accurate expectations of the student’s performance before ending the semester and/or the final exam.Experiments have been conducted on the Open University Learning Analytics Dataset(OULAD)of 32,593 students.Our proposed deep model showed a competitive accuracy score of 98.0%compared to artificial intelligence-based models,such as ConvolutionalNeuralNetwork(CNN)and Long Short-TermMemory(LSTM)in previous studies.That allows academic advisors to support expected failed students significantly and improve their academic level at the right time.Consequently,the proposed analytics system can enhance the quality of educational services for students in an innovative e-learning framework.
基金supported by the National Natural Science Foundation of China(Grant No.:71203163)the Foundation for Humanities and Social Sciences of the Chinese Ministry of Education(Grant No.:12YJC870011)
文摘Purpose:The goal of our research is to suggest specific Web metrics that are useful for evaluating and improving user navigation experience on informational websites.Design/methodology/approach:We revised metrics in a Web forensic framework proposed in the literature and defined the metrics of footprint,track and movement.Data were obtained from user clickstreams provided by a real estate site’s administrators.There were two phases of data analysis with the first phase on navigation behavior based on user footprints and tracks,and the second phase on navigational transition patterns based on user movements.Findings:Preliminary results suggest that the apartment pages were heavily-trafficked while the agent pages and related information pages were underused to a great extent.Navigation within the same category of pages was prevalent,especially when users navigated among the regional apartment listings.However,navigation of these pages was found to be inefficient.Research limitations:The suggestions for navigation design optimization provided in the paper are specific to this website,and their applicability to other online environments needs to be verified.Preference predications or personal recommendations are not made during the current stage of research.Practical implications:Our clickstream data analysis results offer a base for future research.Meanwhile,website administrators and managers can make better use of the readily available clickstream data to evaluate the effectiveness and efficiency of their site navigation design.Originality/value:Our empirical study is valuable to those seeking analysis metrics for evaluating and improving user navigation experience on informational websites based on clickstream data.Our attempts to analyze the log file in terms of footprint,track and movement will enrich the utilization of such trace data to engender a deeper understanding of users’within-site navigation behavior.
基金sponsored by the National Natural Science Foundation of China(72371032)the National Key Research and Development Program of China(2023YFC3305401).
文摘In the financial sector,alternatives to traditional datasets,such as financial statements and Securities and Exchange Commission filings,can provide additional ways to describe the running status of businesses.Nontraditional data sources include individual behaviors,business processes,and various sensors.In recent years,alternative data have been leveraged by businesses and investors to adjust credit scores,mitigate financial fraud,and optimize investment portfolios because they can be used to conduct more in-depth,comprehensive,and timely evaluations of enterprises.Adopting alternative data in developing models for finance and business scenarios has become increasingly popular in academia.In this article,we first identify the advantages of alternative data compared with traditional data,such as having multiple sources,heterogeneity,flexibility,objectivity,and constant evolution.We then provide an overall investigation of emerging studies to outline the various types,emerging applications,and effects of alternative data in finance and business by reviewing over 100 papers published from 2015 to 2023.The investigation is implemented according to application scenarios,including business return prediction,business risk management,credit evaluation,investment risk prediction,and stock prediction.We discuss the roles of alternative data from the perspective of finance theory to argue that alternative data have the potential to serve as a bridge toward achieving high efficiency in financial markets.The challenges and future trends of alternative data in finance and business are also discussed.
文摘This paper presents a driver behavior analysis using microscopic video data measures including vehicle speed, lane-changing ratio, and time to collision. An analytical framework was developed to evaluate the effect of adverse winter weather conditions on highway driving behavior based on automated (computer) and manual methods. The research was conducted through two case studies. The first case study was conducted to evaluate the feasibility of applying an au- tomated approach to extracting driver behavior data based on 15 video recordings obtained in the winter 2013 at three dif- ferent locations on the Don Valley Parkway in Toronto, Canada. A comparison was made between the automated approach and manual approach, and issues in collecting data using the automated approach under winter conditions were identified. The second case study was based on high quality data collected in the winter 2014, at a location on Highway 25 in Montreal, Canada. The results demonstrate the effectiveness of the automated analytical framework in analyzing driver behavior, as well as evaluating the impact of adverse winter weather conditions on driver behavior. This approach could be applied to evaluate winter maintenance strategies and crash risk on highways during adverse winter weather conditions.
基金the Major Project of National Social Science Foundation of China(No.16ZDA048)the Shanghai Municipal Natural Science Foundation,China(No.17ZR1445500)the Humanities and Social Science Research Project of Ministry of Education,China(No.15YJCZH148)
文摘As the travel purpose of non-occupied taxies is to find new passengers rather than to arrive at the destination, large differences exist in the route choice behavior between the occupied and non-occupied taxies.With the assistance of geographic information system(GIS) and taxi-based floating car data(FCD), this paper investigates the behavior differences between occupied and non-occupied taxi drivers with the same origin and destination. Descriptive statistical indexes from the FCD in Shenzhen, China are explored to identify the route choice characteristics of occupied and non-occupied taxies. Then, a conditional logit model is proposed to model the quantitative relationship between drivers' route choice and the related significant variables. Attributes of the variables related to non-occupied taxies' observed routes are compared with the case of occupied ones. The results indicate that, compared with their counterparts, non-occupied taxi drivers generally pay more attention to choosing arterial roads and avoiding congested segments. Additionally, they are also found less sensitive to fewer traffic lights and shorter travel time. Findings from this research can assist to improve urban road network planning and traffic management.
文摘Rural intersections account for around 30% of crashes in rural areas and 6% of all fatal crashes, representing a significant but poorly understood safety problem. Crashes at rural intersections are also problematic since high speeds on intersection approaches are present which can exacerbate the impact of a crash. Additionally, rural areas are often underserved with EMS services which can further contribute to negative crash outcomes. This paper describes an analysis of driver stopping behavior at rural T-intersections using the SHRP 2 Naturalistic Driving Study data. Type of stop was used as a safety surrogate measure using full/rolling stops compared to non-stops. Time series traces were obtained for 157 drivers at 87 unique intersections resulting in 1277 samples at the stop controlled approach for T-intersections. Roadway (i.e. number of lanes, presence of skew, speed limit, presence of stop bar or other traffic control devices), driver (age, gender, speeding), and environmental characteristics (time of day, presence of rain) were reduced and included as independent variables. Results of a logistic regression model indicated drivers were less likely to stop during the nighttime. However presence of intersection lighting increased the likelihood of full/rolling stops. Presence of intersection skew was shown to negatively impact stopping behavior. Additionally drivers who were traveling over the posted speed limit upstream of the intersection approach were less likely to stop at the approach stop sign.