*Result*: Practical big data techniques for end-to-end machine learning deployment: a comprehensive review.

Title:
Practical big data techniques for end-to-end machine learning deployment: a comprehensive review.
Source:
Discover Data; 4/15/2025, Vol. 3 Issue 1, p1-18, 18p
Database:
Complementary Index

*Further Information*

*In recent years, data generated from diverse sources has grown exponentially, giving rise to new challenges for processing and analysis under the umbrella of "big data". Although various methods and platforms have been proposed to tackle these issues, there is still a need for a conceptual guide that synthesizes state-of-the-art tools and techniques across the full machine learning (ML) workflow. In this survey, we consolidate research on big data infrastructures, distributed processing frameworks, and ML methods, mapping out an end-to-end conceptual framework that can serve as a reference architecture for end-to-end deployment. Specifically, we discuss key aspects of big data analytics (storage solutions, platforms like Hadoop and Spark, NoSQL databases), preprocessing approaches (dimensionality reduction, instance selection, noise handling), ML algorithms (supervised, unsupervised, and emerging deep learning paradigms), and deployment considerations (monitoring, continuous integration, and versioning). By gathering and integrating these elements, our survey provides a comprehensive overview of existing solutions, clarifies the design choices developers must consider, and identifies research gaps that remain to be addressed. [ABSTRACT FROM AUTHOR]

Copyright of Discover Data is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)*