Student performance prediction helps the educational stakeholders to take proactive decisions and make interventions,for the improvement of quality of education and to meet the dynamic needs of society.The selection o...Student performance prediction helps the educational stakeholders to take proactive decisions and make interventions,for the improvement of quality of education and to meet the dynamic needs of society.The selection of features for student’s performance prediction not only plays significant role in increasing prediction accuracy,but also helps in building the strategic plans for the improvement of students’academic performance.There are different feature selection algorithms for predicting the performance of students,however the studies reported in the literature claim that there are different pros and cons of existing feature selection algorithms in selection of optimal features.In this paper,a hybrid feature selection framework(using feature-fusion)is designed to identify the significant features and associated features with target class,to predict the performance of students.The main goal of the proposed hybrid feature selection is not only to improve the prediction accuracy,but also to identify optimal features for building productive strategies for the improvement in students’academic performance.The key difference between proposed hybrid feature selection framework and existing hybrid feature selection framework,is two level feature fusion technique,with the utilization of cosine-based fusion.Whereas,according to the results reported in existing literature,cosine similarity is considered as the best similarity measure among existing similarity measures.The proposed hybrid feature selection is validated on four benchmark datasets with variations in number of features and number of instances.The validated results confirm that the proposed hybrid feature selection framework performs better than the existing hybrid feature selection framework,existing feature selection algorithms in terms of accuracy,f-measure,recall,and precision.Results reported in presented paper show that the proposed approach gives more than 90%accuracy on benchmark dataset that is better than the results of existing approach.展开更多
Every application in a smart city environment like the smart grid,health monitoring, security, and surveillance generates non-stationary datastreams. Due to such nature, the statistical properties of data changes over...Every application in a smart city environment like the smart grid,health monitoring, security, and surveillance generates non-stationary datastreams. Due to such nature, the statistical properties of data changes overtime, leading to class imbalance and concept drift issues. Both these issuescause model performance degradation. Most of the current work has beenfocused on developing an ensemble strategy by training a new classifier on thelatest data to resolve the issue. These techniques suffer while training the newclassifier if the data is imbalanced. Also, the class imbalance ratio may changegreatly from one input stream to another, making the problem more complex.The existing solutions proposed for addressing the combined issue of classimbalance and concept drift are lacking in understating of correlation of oneproblem with the other. This work studies the association between conceptdrift and class imbalance ratio and then demonstrates how changes in classimbalance ratio along with concept drift affect the classifier’s performance.We analyzed the effect of both the issues on minority and majority classesindividually. To do this, we conducted experiments on benchmark datasetsusing state-of-the-art classifiers especially designed for data stream classification.Precision, recall, F1 score, and geometric mean were used to measure theperformance. Our findings show that when both class imbalance and conceptdrift problems occur together the performance can decrease up to 15%. Ourresults also show that the increase in the imbalance ratio can cause a 10% to15% decrease in the precision scores of both minority and majority classes.The study findings may help in designing intelligent and adaptive solutionsthat can cope with the challenges of non-stationary data streams like conceptdrift and class imbalance.展开更多
One of the most pressing concerns for the consumer market is the detection of adulteration in meat products due to their preciousness.The rapid and accurate identification mechanism for lard adulteration in meat produ...One of the most pressing concerns for the consumer market is the detection of adulteration in meat products due to their preciousness.The rapid and accurate identification mechanism for lard adulteration in meat products is highly necessary,for developing a mechanism trusted by consumers and that can be used to make a definitive diagnosis.Fourier Transform Infrared Spectroscopy(FTIR)is used in this work to identify lard adulteration in cow,lamb,and chicken samples.A simplified extraction method was implied to obtain the lipids from pure and adulterated meat.Adulterated samples were obtained by mixing lard with chicken,lamb,and beef with different concentrations(10%–50%v/v).Principal component analysis(PCA)and partial least square(PLS)were used to develop a calibration model at 800–3500 cm^(−1).Three-dimension PCA was successfully used by dividing the spectrum in three regions to classify lard meat adulteration in chicken,lamb,and beef samples.The corresponding FTIR peaks for the lard have been observed at 1159.6,1743.4,2853.1,and 2922.5 cm−1,which differentiate chicken,lamb,and beef samples.The wavenumbers offer the highest determination coefficient R2 value of 0.846 and lowest root mean square error of calibration(RMSEC)and root mean square error prediction(RMSEP)with an accuracy of 84.6%.Even the tiniest fat adulteration up to 10%can be reliably discovered using this methodology.展开更多
Long Range Wide Area Network (LoRaWAN) in the Internet ofThings (IoT) domain has been the subject of interest for researchers. Thereis an increasing demand to localize these IoT devices using LoRaWAN dueto the quickly...Long Range Wide Area Network (LoRaWAN) in the Internet ofThings (IoT) domain has been the subject of interest for researchers. Thereis an increasing demand to localize these IoT devices using LoRaWAN dueto the quickly growing number of IoT devices. LoRaWAN is well suited tosupport localization applications in IoTs due to its low power consumptionand long range. Multiple approaches have been proposed to solve the localizationproblem using LoRaWAN. The Expected Signal Power (ESP) basedtrilateration algorithm has the significant potential for localization becauseESP can identify the signal’s energy below the noise floor with no additionalhardware requirements and ease of implementation. This research articleoffers the technical evaluation of the trilateration technique, its efficiency,and its limitations for the localization using LoRa ESP in a large outdoorpopulated campus environment. Additionally, experimental evaluations areconducted to determine the effects of frequency hopping, outlier removal, andincreasing the number of gateways on localization accuracy. Results obtainedfrom the experiment show the importance of calculating the path loss exponentfor every frequency to circumvent the high localization error because ofthe frequency hopping, thus improving the localization performance withoutthe need of using only a single frequency.展开更多
文摘Student performance prediction helps the educational stakeholders to take proactive decisions and make interventions,for the improvement of quality of education and to meet the dynamic needs of society.The selection of features for student’s performance prediction not only plays significant role in increasing prediction accuracy,but also helps in building the strategic plans for the improvement of students’academic performance.There are different feature selection algorithms for predicting the performance of students,however the studies reported in the literature claim that there are different pros and cons of existing feature selection algorithms in selection of optimal features.In this paper,a hybrid feature selection framework(using feature-fusion)is designed to identify the significant features and associated features with target class,to predict the performance of students.The main goal of the proposed hybrid feature selection is not only to improve the prediction accuracy,but also to identify optimal features for building productive strategies for the improvement in students’academic performance.The key difference between proposed hybrid feature selection framework and existing hybrid feature selection framework,is two level feature fusion technique,with the utilization of cosine-based fusion.Whereas,according to the results reported in existing literature,cosine similarity is considered as the best similarity measure among existing similarity measures.The proposed hybrid feature selection is validated on four benchmark datasets with variations in number of features and number of instances.The validated results confirm that the proposed hybrid feature selection framework performs better than the existing hybrid feature selection framework,existing feature selection algorithms in terms of accuracy,f-measure,recall,and precision.Results reported in presented paper show that the proposed approach gives more than 90%accuracy on benchmark dataset that is better than the results of existing approach.
基金The authors would like to extend their gratitude to Universiti Teknologi PETRONAS (Malaysia)for funding this research through grant number (015LA0-037).
文摘Every application in a smart city environment like the smart grid,health monitoring, security, and surveillance generates non-stationary datastreams. Due to such nature, the statistical properties of data changes overtime, leading to class imbalance and concept drift issues. Both these issuescause model performance degradation. Most of the current work has beenfocused on developing an ensemble strategy by training a new classifier on thelatest data to resolve the issue. These techniques suffer while training the newclassifier if the data is imbalanced. Also, the class imbalance ratio may changegreatly from one input stream to another, making the problem more complex.The existing solutions proposed for addressing the combined issue of classimbalance and concept drift are lacking in understating of correlation of oneproblem with the other. This work studies the association between conceptdrift and class imbalance ratio and then demonstrates how changes in classimbalance ratio along with concept drift affect the classifier’s performance.We analyzed the effect of both the issues on minority and majority classesindividually. To do this, we conducted experiments on benchmark datasetsusing state-of-the-art classifiers especially designed for data stream classification.Precision, recall, F1 score, and geometric mean were used to measure theperformance. Our findings show that when both class imbalance and conceptdrift problems occur together the performance can decrease up to 15%. Ourresults also show that the increase in the imbalance ratio can cause a 10% to15% decrease in the precision scores of both minority and majority classes.The study findings may help in designing intelligent and adaptive solutionsthat can cope with the challenges of non-stationary data streams like conceptdrift and class imbalance.
文摘One of the most pressing concerns for the consumer market is the detection of adulteration in meat products due to their preciousness.The rapid and accurate identification mechanism for lard adulteration in meat products is highly necessary,for developing a mechanism trusted by consumers and that can be used to make a definitive diagnosis.Fourier Transform Infrared Spectroscopy(FTIR)is used in this work to identify lard adulteration in cow,lamb,and chicken samples.A simplified extraction method was implied to obtain the lipids from pure and adulterated meat.Adulterated samples were obtained by mixing lard with chicken,lamb,and beef with different concentrations(10%–50%v/v).Principal component analysis(PCA)and partial least square(PLS)were used to develop a calibration model at 800–3500 cm^(−1).Three-dimension PCA was successfully used by dividing the spectrum in three regions to classify lard meat adulteration in chicken,lamb,and beef samples.The corresponding FTIR peaks for the lard have been observed at 1159.6,1743.4,2853.1,and 2922.5 cm−1,which differentiate chicken,lamb,and beef samples.The wavenumbers offer the highest determination coefficient R2 value of 0.846 and lowest root mean square error of calibration(RMSEC)and root mean square error prediction(RMSEP)with an accuracy of 84.6%.Even the tiniest fat adulteration up to 10%can be reliably discovered using this methodology.
基金the ADEK Award for Research Excellence (AARE19-245)2019.
文摘Long Range Wide Area Network (LoRaWAN) in the Internet ofThings (IoT) domain has been the subject of interest for researchers. Thereis an increasing demand to localize these IoT devices using LoRaWAN dueto the quickly growing number of IoT devices. LoRaWAN is well suited tosupport localization applications in IoTs due to its low power consumptionand long range. Multiple approaches have been proposed to solve the localizationproblem using LoRaWAN. The Expected Signal Power (ESP) basedtrilateration algorithm has the significant potential for localization becauseESP can identify the signal’s energy below the noise floor with no additionalhardware requirements and ease of implementation. This research articleoffers the technical evaluation of the trilateration technique, its efficiency,and its limitations for the localization using LoRa ESP in a large outdoorpopulated campus environment. Additionally, experimental evaluations areconducted to determine the effects of frequency hopping, outlier removal, andincreasing the number of gateways on localization accuracy. Results obtainedfrom the experiment show the importance of calculating the path loss exponentfor every frequency to circumvent the high localization error because ofthe frequency hopping, thus improving the localization performance withoutthe need of using only a single frequency.