WhatsApp at (+91-9098855509) Support
ijprems Logo
  • Home
  • About Us
    • Editor Vision
    • Editorial Board
    • Privacy Policy
    • Terms & Conditions
    • Publication Ethics
    • Peer Review Process
  • For Authors
    • Publication Process(up)
    • Submit Paper Online
    • Pay Publication Fee
    • Track Paper
    • Copyright Form
    • Paper Format
    • Topics
  • Fees
  • Indexing
  • Conference
  • Contact
  • Archieves
    • Current Issue
    • Past Issue
  • More
    • FAQs
    • Join As Reviewer
  • Submit Paper

Recent Papers

Dedicated to advancing knowledge through rigorous research and scholarly publication

  1. Home
  2. Recent Papers

Sentiment Analysis of Social Media Posts : Comparative Evaluation of Naive Bayes and Logistic Regression Classifiers

Insiyah Udaipurwala

Download Paper

Paper Contents

Abstract

The proliferation of digital communication, particularly through social media platforms like YouTube and Instagram, has established massive, real-time repositories of unstructured textual opinions. Analyzing this data via Sentiment Analysis (SA) is crucial for stakeholders across commercial, political, and public health sectors. This research addresses the task of multi-class (tertiary: Positive, Negative, Neutral) sentiment classification on complex social media text, which is characterized by linguistic noise and severe class imbalance. The study evaluates the performance and efficiency of two foundational linear classifiers: Multinomial Naive Bayes (MNB) and Multinomial Logistic Regression (LR). A publicly available, pre-labeled Kaggle dataset of social media comments was preprocessed using a robust Natural Language Toolkit (NLTK) pipeline, including cleaning, lemmatization, and stopword removal, followed by feature vectorization using Term FrequencyInverse Document Frequency (TF-IDF). Evaluation relied on standard metrics, with the Macro F1-Score prioritized to ensure balanced performance across the inherently undersampled sentiment classes. The key findings indicate that LR achieved superior predictive performance with an overall Accuracy of 88.00% and a Macro F1-Score of 0.83. MNB, while faster, lagged in classification rigor, yielding an Accuracy of 86.00% and a Macro F1-Score of 0.78. This statistical advantage is attributed to LRs capacity to employ L2 regularization, which effectively manages overfitting in the sparse, high-dimensional feature space created by TF-IDF, mitigating the restrictive independence assumption inherent to MNB. However, MNB demonstrated significantly faster training speed (0.52 seconds compared to LR's 1.87 seconds), establishing a critical performance-efficiency trade-off for real-time deployment considerations

Copyright

Copyright © 2025 Insiyah Udaipurwala. This is an open access article distributed under the Creative Commons Attribution License.

Paper Details
Paper ID: IJPREMS51100062258
ISSN: 2321-9653
Publisher: ijprems
Page Navigation
  • Abstract
  • Copyright
About IJPREMS

The International Journal of Progressive Research in Engineering, Management and Science is a peer-reviewed, open access journal that publishes original research articles in engineering, management, and applied sciences.

Quick Links
  • Home
  • About Our Journal
  • Editorial Board
  • Publication Ethics
Contact Us
  • IJPREMS - International Journal of Progressive Research in Engineering Management and Science, motinagar, ujjain, Madhya Pradesh., india
  • Chat with us on WhatsApp: +91 909-885-5509
  • Email us: editor@ijprems.com
  • Sun-Sat: 9:00 AM - 9:00 PM

© 2025 International Journal of Progressive Research in Engineering, Management and Science. All Rights Reserved.

Terms & Conditions | Privacy Policy | Publication Ethics | Peer Review Process | Contact Us