Addison-Wesley Professional, 2018. — 320 p. — ISBN: ISBN: 013484601X. Solve Data Analytics Problems with Spark, PySpark, and Related Open Source Tools. Spark is at the heart of today’s Big Data revolution, helping data professionals supercharge efficiency and performance in a wide range of data processing and analytics tasks. In this guide, Big Data expert Jeffrey Aven covers...
Addison-Wesley Professional, 2018. — 320 p. — ISBN: ISBN: 013484601X. Solve Data Analytics Problems with Spark, PySpark, and Related Open Source Tools Spark is at the heart of today’s Big Data revolution, helping data professionals supercharge efficiency and performance in a wide range of data processing and analytics tasks. In this guide, Big Data expert Jeffrey Aven covers...
Packt Publishing, 2017. — 312 p. — ISBN: 978-1-78646-370-8. Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture...
Packt Publishing, 2017. — 312 p. — ISBN: 978-1-78646-370-8. Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture...
Packt Publishing, 2017. — 312 p. — ISBN: 978-1-78646-370-8. — True PDF. Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0...
2nd Edition. — Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu. — Apress Media LLC., 2024. — 450 p. — ISBN-13: 979-8-8688-0819-7. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. In Chapters 1, 2 & 3, we will get...
2nd Edition. — Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu. — Apress Media LLC., 2024. — 450 p. — ISBN-13: 979-8-8688-0820-3. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. In Chapters 1, 2 & 3, we will get...
2nd Edition. — Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu. — Apress Media LLC., 2024. — 450 p. — ISBN-13: 979-8-8688-0820-3. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. In Chapters 1, 2 & 3, we will get...
2nd Edition. — Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu. — Apress Media LLC., 2024. — 450 p. — ISBN-13: 979-8-8688-0820-3. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. In Chapters 1, 2 & 3, we will get...
Apress, 2020. — 436 p. — ISBN 9781484264997. Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six...
Apress, 2020. — 436 p. — ISBN 9781484264997. Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six...
Packt Publishing, 2018. — 330 p. - ISBN: 1788835360 Combine the power of Apache Spark and Python to build effective big data applications Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. The PySpark Cookbook presents effective and time-saving recipes for leveraging the power of Python and...
Packt Publishing, 2018. — 330 p. Combine the power of Apache Spark and Python to build effective big data applications Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. The PySpark Cookbook presents effective and time-saving recipes for leveraging the power of Python and putting it to use in...
Apress, 2019. — 323 р. — ISBN: 978-1484243343. Carry out data analysis with PySpark SQL, graphframes, and graph data processing using a problem-solution approach. This book provides solutions to problems related to dataframes, data manipulation summarization, and exploratory analysis. You will improve your skills in graph data analysis using graphframes and see how to optimize...
Apress, 2019. — 323 р. — ISBN: 978-1484243343. Carry out data analysis with PySpark SQL, graphframes, and graph data processing using a problem-solution approach. This book provides solutions to problems related to dataframes, data manipulation summarization, and exploratory analysis. You will improve your skills in graph data analysis using graphframes and see how to optimize...
Amazon Digital Services LLC, 2019. — 682 p. This book is about PySpark: Python API for Spark. Apache Spark is an analytics engine for large-scale data processing. Spark is the open source cluster computing system that makes data analytics fast to write and fast to run. This book provides a large set of recipes for implementing big data processing and analytics using Spark and...
Apress, 2018. — 256 p. Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved! PySpark Recipes covers Hadoop and its shortcomings. The architecture...
Apress, 2018. — 256 p. Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved! PySpark Recipes covers Hadoop and its shortcomings. The architecture...
Manning Publications Co., 2022. — 458 p. — ISBN 978-1617297205. Gustavo Patino, Oakland University William Beaumont School of Medicine Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks and create lightning-fast pipelines. In Data Analysis with Python and PySpark you...
Manning Publications Co., 2022. — 458 p. — ISBN 978-1617297205. Gustavo Patino, Oakland University William Beaumont School of Medicine Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks and create lightning-fast pipelines. In Data Analysis with Python and PySpark you...
Manning Publications Co., 2022. — 458 p. — ISBN 978-1617297205. Gustavo Patino, Oakland University William Beaumont School of Medicine Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks and create lightning-fast pipelines. In Data Analysis with Python and PySpark you...
6th version. — Manning Publications, 2020. — 217 p. — ISBN: 978-1617297205. PySpark in Action is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Hadoop-based...
New York: Apress, 2019. — 214 p. History Data Collection Data Processing Spark Architecture Resource Management Structured Streaming Programming Language APIs Local Setup Databricks Data Processing Creating Dataframes Null Values Subset of a Dataframe Select Filter Aggregations Collect User-Defined Functions (UDFs) Pandas UDF Joins Pivoting Window Functions or Windowed...
Apress, 2019. — 223 p. — ISBN: 1484241304. Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and...
Apress, 2019. — 223 p. — ISBN: 1484241304. Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and...
Apress, 2019. — 230 p. — ISBN: 978-1-4842-4130-4. Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language...
Apress, 2019. — 230 p. — ISBN: 978-1-4842-4130-4. Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language...
Apress, 2019. — 230 p. — ISBN: 978-1-4842-4130-4. Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language...
2nd Edition. — Apress Media LLC, 2022. — 230 p. — ISBN: 978-1-4842-7776-8. Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable Machine Learning models, to natural language processing, to recommender systems. Machine Learning with PySpark, Second Edition begins with the...
2nd Edition. — Apress Media LLC, 2022. — 230 p. — ISBN-13 (electronic): 978-1-4842-7777-5. Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable Machine Learning models, to natural language processing, to recommender systems. Machine Learning with PySpark, Second Edition begins...
2nd Edition. — Apress Media LLC, 2022. — 230 p. — ISBN-13 (electronic): 978-1-4842-7777-5. Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable Machine Learning models, to natural language processing, to recommender systems. Machine Learning with PySpark, Second Edition begins...
2nd Edition. — Apress Media LLC, 2022. — 230 p. — ISBN-13 (electronic): 978-1-4842-7777-5. Master the new features in PySpark 3.1 to develop data-driven, intelligent applications. This updated edition covers topics ranging from building scalable Machine Learning models, to natural language processing, to recommender systems. Machine Learning with PySpark, Second Edition begins...
Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. — O’Reilly Media, 2022. — 215 p. — ISBN-13 978-1-098-10365-1. The amount of data being generated today is staggering--and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together...
Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. — O’Reilly Media, 2022. — 215 p. — ISBN-13 978-1-098-10365-1. The amount of data being generated today is staggering--and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together...
Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. — O’Reilly Media, 2022. — 215 p. — ISBN-13: 978-1-098-10365-1. The amount of data being generated today is staggering--and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together...
Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills. — O’Reilly Media, 2022. — 215 p. — ISBN-13: 978-1-098-10365-1. The amount of data being generated today is staggering--and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together...
Apress Media LLC, 2024. — 490 p. — ISBN-13: 978-1-4842-9750-6. Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools. Distributed Machine...
Apress Media LLC, 2024. — 490 p. — ISBN-13: 978-1-4842-9751-3. Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools. Distributed Machine...
Apress Media LLC, 2024. — 490 p. — ISBN-13: 978-1-4842-9751-3. Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools. Distributed Machine...
Apress Media LLC, 2024. — 490 p. — ISBN-13: 978-1-4842-9751-3. Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools. Distributed Machine...
Feb 05, 2020 - 481 p. Tutorial on PySpark. In this tutorial you will learn a wide array of concepts about PySpark in Data Mining, Text Mining, Machine Learning and Deep Learning.
Пер. с англ. — СПб.: БХВ-Петербург, 2023. — 224 с.: ил. Книга посвящена практическим методам анализа больших объемов данных с использованием языка Python и фреймворка Spark, она знакомит с моделью программирования Spark и основами системы с открытым исходным кодом PySpark. Каждая глава описывает отдельный аспект анализа данных, показаны основы обработки данных в PySpark и...
Пер. с англ. — СПб.: БХВ-Петербург, 2023. — 224 с.: ил. — ISBN 978-5-9775-1770-6. Книга посвящена практическим методам анализа больших объемов данных с использованием языка Python и фреймворка Spark, она знакомит с моделью программирования Spark и основами системы с открытым исходным кодом PySpark. Каждая глава описывает отдельный аспект анализа данных, показаны основы...
Комментарии