Data science with Python and Dask

Large datasets tend to be distributed, non-uniform, and prone to change. Dask simplifies the process of ingesting, filtering, and transforming data, reducing or eliminating the need for a heavyweight framework like Spark. Data Science at Scale with Python and Dask teaches readers how to build distributed data projects that can handle huge amounts of data. The book introduces Dask Data Frames and teaches helpful code patterns to streamline the reader's analysis. Key Features Working with large structured datasets Writing DataFrames Cleaningand visualizing DataFrames Machine learning with Dask-ML Working with Bags and Arrays Written for data engineers and scientists with experience using Python. Knowledge of the PyData stack (Pandas, NumPy, and Scikit-learn) will be helpful. No experience with low-level parallelism is required. About the technology Dask is a self-contained, easily extendible library designed to query, stream, filter, and consolidate huge datasets. Jesse Daniel has five years of experience writing applications in Python, including three years working with in the PyData stack (Pandas, NumPy, SciPy, Scikit-Learn). Jesse joined the faculty of the University of Denver in 2016 as an adjunct professor of business information and analytics, where he currently teaches a Python for Data Science course.

「Nielsen BookData」より

Google Books

詳細情報

NII書誌ID(NCID)
BC18930576
ISBN
- 9781617295607
LCCN
2019285629
出版国コード
us
タイトル言語コード
eng
本文言語コード
eng
出版地
Shelter Island, N.Y.
ページ数/冊数
xvii, 276 p.
大きさ
24 cm
分類
- DC23 : 006.312
- LCC : QA76.9.B45
件名
- LCSH : Big data
- LCSH : Machine learning
- LCSH : Python (Computer program language)
- LCSH : Information storage and retrieval systems -- Scalability
- LCSH : Data mining

Data science with Python and Dask

著者

書誌事項

大学図書館所蔵 全1件

この図書・雑誌をさがす

注記

内容説明・目次

詳細情報

書き出し

大学図書館所蔵全1件