Measles,an infectious disease caused by the measles virus,remains a significant public health concern worldwide due to its highly contagious nature and potential for severe complications[1].In addition to symptoms suc...Measles,an infectious disease caused by the measles virus,remains a significant public health concern worldwide due to its highly contagious nature and potential for severe complications[1].In addition to symptoms such as high fever,cough,Koplik spots,and rash,measles can lead to serious complications including pneumonia and myocarditis,particularly in vulnerable populations such as young children[1,2].展开更多
Social media has revolutionized the dissemination of real-life information,serving as a robust platform for sharing life events.Twitter,characterized by its brevity and continuous flow of posts,has emerged as a crucia...Social media has revolutionized the dissemination of real-life information,serving as a robust platform for sharing life events.Twitter,characterized by its brevity and continuous flow of posts,has emerged as a crucial source for public health surveillance,offering valuable insights into public reactions during the COVID-19 pandemic.This study aims to leverage a range of machine learning techniques to extract pivotal themes and facilitate text classification on a dataset of COVID-19 outbreak-related tweets.Diverse topic modeling approaches have been employed to extract pertinent themes and subsequently form a dataset for training text classification models.An assessment of coherence metrics revealed that the Gibbs Sampling Dirichlet Mixture Model(GSDMM),which utilizes trigram and bag-of-words(BOW)feature extraction,outperformed Non-negative Matrix Factorization(NMF),Latent Dirichlet Allocation(LDA),and a hybrid strategy involving Bidirectional Encoder Representations from Transformers(BERT)combined with LDA and K-means to pinpoint significant themes within the dataset.Among the models assessed for text clustering,the utilization of LDA,either as a clustering model or for feature extraction combined with BERT for K-means,resulted in higher coherence scores,consistent with human ratings,signifying their efficacy.In particular,LDA,notably in conjunction with trigram representation and BOW,demonstrated superior performance.This underscores the suitability of LDA for conducting topic modeling,given its proficiency in capturing intricate textual relationships.In the context of text classification,models such as Linear Support Vector Classification(LSVC),Long Short-Term Memory(LSTM),Bidirectional Long Short-Term Memory(BiLSTM),Convolutional Neural Network with BiLSTM(CNN-BiLSTM),and BERT have shown outstanding performance,achieving accuracy and weighted F1-Score scores exceeding 80%.These results significantly surpassed other models,such as Multinomial Naive Bayes(MNB),Linear Support Vector Machine(LSVM),and Logistic Regression(LR),which achieved scores in the range of 60 to 70 percent.展开更多
Although the 2022 cryptocurrency market crash prompted despair among investors,the rallying cry,“wagmi”(We’re all gonna make it.)emerged among cryptocurrency enthusiasts in the aftermath.Did cryptocurrency enthusia...Although the 2022 cryptocurrency market crash prompted despair among investors,the rallying cry,“wagmi”(We’re all gonna make it.)emerged among cryptocurrency enthusiasts in the aftermath.Did cryptocurrency enthusiasts respond to this crash differently compared to traditional investors?Using natural language processing techniques applied to Twitter data,this study employed a difference-in-differences method to determine whether the cryptocurrency market crash had a differential effect on investor sentiment toward cryptocurrency enthusiasts relative to more traditional investors.The results indicate that the crash affected investor sentiment among cryptocurrency enthusiastic investors differently from traditional investors.In particular,cryptocurrency enthusiasts’tweets became more neutral and,surprisingly,less negative.This result appears to be primarily driven by a deliberate,collectivist effort to promote positivity within the cryptocurrency community(“wagmi”).Considering the more nuanced emotional content of tweets,it appears that cryptocurrency enthusiasts expressed less joy and surprise in the aftermath of the cryptocurrency crash than traditional investors.Moreover,cryptocurrency enthusiasts tweeted more frequently after the cryptocurrency crash,with a relative increase in tweet frequency of approximately one tweet per day.An analysis of the specific textual content of tweets provides evidence of herding behavior among cryptocurrency enthusiasts.展开更多
文摘Measles,an infectious disease caused by the measles virus,remains a significant public health concern worldwide due to its highly contagious nature and potential for severe complications[1].In addition to symptoms such as high fever,cough,Koplik spots,and rash,measles can lead to serious complications including pneumonia and myocarditis,particularly in vulnerable populations such as young children[1,2].
文摘Social media has revolutionized the dissemination of real-life information,serving as a robust platform for sharing life events.Twitter,characterized by its brevity and continuous flow of posts,has emerged as a crucial source for public health surveillance,offering valuable insights into public reactions during the COVID-19 pandemic.This study aims to leverage a range of machine learning techniques to extract pivotal themes and facilitate text classification on a dataset of COVID-19 outbreak-related tweets.Diverse topic modeling approaches have been employed to extract pertinent themes and subsequently form a dataset for training text classification models.An assessment of coherence metrics revealed that the Gibbs Sampling Dirichlet Mixture Model(GSDMM),which utilizes trigram and bag-of-words(BOW)feature extraction,outperformed Non-negative Matrix Factorization(NMF),Latent Dirichlet Allocation(LDA),and a hybrid strategy involving Bidirectional Encoder Representations from Transformers(BERT)combined with LDA and K-means to pinpoint significant themes within the dataset.Among the models assessed for text clustering,the utilization of LDA,either as a clustering model or for feature extraction combined with BERT for K-means,resulted in higher coherence scores,consistent with human ratings,signifying their efficacy.In particular,LDA,notably in conjunction with trigram representation and BOW,demonstrated superior performance.This underscores the suitability of LDA for conducting topic modeling,given its proficiency in capturing intricate textual relationships.In the context of text classification,models such as Linear Support Vector Classification(LSVC),Long Short-Term Memory(LSTM),Bidirectional Long Short-Term Memory(BiLSTM),Convolutional Neural Network with BiLSTM(CNN-BiLSTM),and BERT have shown outstanding performance,achieving accuracy and weighted F1-Score scores exceeding 80%.These results significantly surpassed other models,such as Multinomial Naive Bayes(MNB),Linear Support Vector Machine(LSVM),and Logistic Regression(LR),which achieved scores in the range of 60 to 70 percent.
文摘Although the 2022 cryptocurrency market crash prompted despair among investors,the rallying cry,“wagmi”(We’re all gonna make it.)emerged among cryptocurrency enthusiasts in the aftermath.Did cryptocurrency enthusiasts respond to this crash differently compared to traditional investors?Using natural language processing techniques applied to Twitter data,this study employed a difference-in-differences method to determine whether the cryptocurrency market crash had a differential effect on investor sentiment toward cryptocurrency enthusiasts relative to more traditional investors.The results indicate that the crash affected investor sentiment among cryptocurrency enthusiastic investors differently from traditional investors.In particular,cryptocurrency enthusiasts’tweets became more neutral and,surprisingly,less negative.This result appears to be primarily driven by a deliberate,collectivist effort to promote positivity within the cryptocurrency community(“wagmi”).Considering the more nuanced emotional content of tweets,it appears that cryptocurrency enthusiasts expressed less joy and surprise in the aftermath of the cryptocurrency crash than traditional investors.Moreover,cryptocurrency enthusiasts tweeted more frequently after the cryptocurrency crash,with a relative increase in tweet frequency of approximately one tweet per day.An analysis of the specific textual content of tweets provides evidence of herding behavior among cryptocurrency enthusiasts.