Empirical Study and Comparison of Models via Multiclass Classification of COVID-19 Tweets using Natural Language Processing
Keywords:
Natural Language Processing, Sentiment Analysis, COVID-19, Machine Learning, Deep Learning, Data AnalysisAbstract
This paper represents the empirical study of sentiment analysis of the Covid-19 tweets during the pandemic period. Various industries have been psychologically affected throughout the country during this period. Through the Covid-19 tweets, a study has been established to determine whether the people’s attitude is positive, negative or neutral during the pandemic. In this work, Natural Language Processing, Exploratory Data Analysis and Machine Learning are used to analyze textual data consisting of Covid- 19 tweets. Different Machine Learning and Deep Learning techniques like Naive Bayes, Logistic Regression, Extreme Gradient Boost (XGBoost), Stochastic Gradient Descent, Random Forest, SVM, Bidirectional LSTMs (BiLSTM) and Backpropagation neural networks have been incorporated to analyze and predict efficiently. Finally, a comparison of each model’s performance based on evaluation metrics like accuracy, precision, recall and F1-score has been done.