WhatsApp at (+91-9098855509) Support
ijprems Logo
  • Home
  • About Us
    • Editor Vision
    • Editorial Board
    • Privacy Policy
    • Terms & Conditions
    • Publication Ethics
    • Peer Review Process
  • For Authors
    • Publication Process(up)
    • Submit Paper Online
    • Pay Publication Fee
    • Track Paper
    • Copyright Form
    • Paper Format
    • Topics
  • Fees
  • Indexing
  • Conference
  • Contact
  • Archieves
    • Current Issue
    • Past Issue
  • More
    • FAQs
    • Join As Reviewer
  • Submit Paper

Recent Papers

Dedicated to advancing knowledge through rigorous research and scholarly publication

  1. Home
  2. Recent Papers

DATA QUALITY AND CLEANING TECHNIQUES FOR BIG DATA

Dr. A. Antony Prakash A. Antony Prakash

Download Paper

Paper Contents

Abstract

In the big data era, maintaining data quality is made more difficult by the amount, speed, and diversity of data being produced. Since inaccurate insights, distorted projections, and less-than-ideal decision-making can result from low-quality data, data cleaning is an essential step in the data analysis process. The numerous problems with data quality that come with large data, such as noisy data, outliers, missing values, and inconsistencies, are examined in this work. From conventional procedures like imputation and normalization to more sophisticated machine learning-based strategies like anomaly identification and outlier handling, it explores cutting-edge data cleaning methods and tools designed for large-scale datasets. The study also emphasizes how data preparation systems, such Hadoop and Apache Spark, can help with problems related to data quality at scale. It also addresses the difficulties in cleaning unstructured data (text, photos, etc.) and provides strategies for managing complicated data kinds. The purpose of this paper is to give academics and practitioners the information they need to guarantee high-quality data for effective big data analytics by giving an overview of data cleaning best practices, current trends, and upcoming technologies.

Copyright

Copyright © 2025 Dr. A. Antony Prakash. This is an open access article distributed under the Creative Commons Attribution License.

Paper Details
Paper ID: IJPREMS50900014743
ISSN: 2321-9653
Publisher: ijprems
Page Navigation
  • Abstract
  • Copyright
About IJPREMS

The International Journal of Progressive Research in Engineering, Management and Science is a peer-reviewed, open access journal that publishes original research articles in engineering, management, and applied sciences.

Quick Links
  • Home
  • About Our Journal
  • Editorial Board
  • Publication Ethics
Contact Us
  • IJPREMS - International Journal of Progressive Research in Engineering Management and Science, motinagar, ujjain, Madhya Pradesh., india
  • Chat with us on WhatsApp: +91 909-885-5509
  • Email us: editor@ijprems.com
  • Mon-Fri: 9:00 AM - 5:00 PM

© 2025 International Journal of Progressive Research in Engineering, Management and Science.Designed and Developed by EVG Software Solutions All Rights Reserved.

Terms & Conditions | Privacy Policy | Publication Ethics | Peer Review Process | Contact Us