WhatsApp at (+91-9098855509) Support
ijprems Logo
  • Home
  • About Us
    • Editor Vision
    • Editorial Board
    • Privacy Policy
    • Terms & Conditions
    • Publication Ethics
    • Peer Review Process
  • For Authors
    • Publication Process(up)
    • Submit Paper Online
    • Pay Publication Fee
    • Track Paper
    • Copyright Form
    • Paper Format
    • Topics
  • Fees
  • Indexing
  • Conference
  • Contact
  • Archieves
    • Current Issue
    • Past Issue
  • More
    • FAQs
    • Join As Reviewer
  • Submit Paper

Recent Papers

Dedicated to advancing knowledge through rigorous research and scholarly publication

  1. Home
  2. Recent Papers

Integration of Machine Learning Models in Python with Big Data Tools (Hadoop, Spark)

Hariprakash K K

Download Paper

Paper Contents

Abstract

The explosion of data in healthcare, finance, and social media has created a huge need for tools that can analyze information quickly and at scale. Machine learning (ML) plays a key role here, helping us spot patterns, make predictions, and guide smarter decisions. Python has become the go-to language for ML because its simple and backed by powerful libraries like scikit-learn, TensorFlow, and PyTorch.But Python on its own struggles with todays massive datasets since it typically runs on a single machine. Thats where big data platforms like Hadoop and Apache Spark come in. Hadoop offers reliable storage and batch processing, while Spark speeds things up with in-memory computing and specialized tools for ML and streaming. Combining Python with these platforms unlocks both flexibility and scalability.Approaches like PySpark, Hadoop Streaming, or distributed deep learning frameworks (TensorFlow on Spark, Horovod) make this integration possible. Studies show Spark can cut training times by up to fivefold compared to standalone Python. This has real-world impact: from disease prediction in healthcare, to fraud detection in finance, to analyzing billions of social media posts. Looking ahead, cloud-native ML, Python frameworks like Ray and Dask, and federated learning will push these integrations even further.

Copyright

Copyright © 2025 Hariprakash K. This is an open access article distributed under the Creative Commons Attribution License.

Paper Details
Paper ID: IJPREMS50800040584
ISSN: 2321-9653
Publisher: ijprems
Page Navigation
  • Abstract
  • Copyright
About IJPREMS

The International Journal of Progressive Research in Engineering, Management and Science is a peer-reviewed, open access journal that publishes original research articles in engineering, management, and applied sciences.

Quick Links
  • Home
  • About Our Journal
  • Editorial Board
  • Publication Ethics
Contact Us
  • IJPREMS - International Journal of Progressive Research in Engineering Management and Science, motinagar, ujjain, Madhya Pradesh., india
  • Chat with us on WhatsApp: +91 909-885-5509
  • Email us: editor@ijprems.com
  • Sun-Sat: 9:00 AM - 9:00 PM

© 2025 International Journal of Progressive Research in Engineering, Management and Science. All Rights Reserved.

Terms & Conditions | Privacy Policy | Publication Ethics | Peer Review Process | Contact Us