Name: Data Preparation for Machine Learning: Data Cleaning, Feature Selection, and Data Transforms in Python
Rating: 4.24 (1 reviews)

Ahmed

108 reviews18 followers

February 12, 2025

The English review is below the Arabic review.

أسلوب جيسون برونلي البسيط خلى المفاهيم سهلة الفهم، حتى لو معندكش خلفية رياضية قوية.

إيه اللي عجبني في الكتاب؟
تغطية شاملة لعمليات تحضير البيانات – الكتاب بيشرح كل حاجة من تنظيف البيانات، لاختيار الميزات، للتحويلات الإحصائية، وتقليل الأبعاد، وكل ده ضروري عشان تطلع بنموذج تعلم آلة قوي.
تبسيط المفاهيم المعقدة بشكل عملي – بيشرح بالتفصيل إزاي تتعامل مع القيم المفقودة، تكتشف القيم الشاذة، وتشوف إيه الميزات اللي فعلاً مؤثرة على النموذج.
تفادي تسرب البيانات (Data Leakage) – واحدة من الحاجات اللي ناس كتير بتغلط فيها، والكتاب وضّح إزاي تحضر البيانات بطريقة تمنع أي تسرب بين مجموعة التدريب والاختبار.

إيه اللي كان محتاج يتحسن؟
كنت أتمنى يكون في شرح أعمق شوية للجوانب النظرية، خصوصًا ليه بنختار تحويلات معينة دون غيرها.
بعض الأمثلة كانت مكررة شوية، وكمان بعض الأكواد محتاجة تحديث عشان تتماشى مع أحدث إصدارات
------------------------------------------------------------------------------
Jason Brownlee’s simple writing style made the concepts easy to understand, even if you don’t have a strong mathematical background.

What did I like about the book?
Comprehensive coverage of data preparation – The book explains everything from data cleaning and feature selection to statistical transformations and dimensionality reduction, all of which are essential for building a strong machine learning model.
Simplifying complex concepts in a practical way – It provides clear explanations on handling missing values, detecting outliers, and identifying the most important features for a model.
Avoiding data leakage – One of the most common mistakes in machine learning, and the book does a great job of showing how to prepare data in a way that prevents leakage between training and testing sets.

What could be improved?
I wish there was a deeper explanation of some theoretical aspects, especially why certain transformations are chosen over others.
Some examples felt a bit repetitive, and a few code snippets need updates to align with the latest library versions.

Data Preparation for Machine Learning: Data Cleaning, Feature Selection, and Data Transforms in Python

Jason Brownlee

About the author

Jason Brownlee

Ratings & Reviews

Friends & Following

Community Reviews

Join the discussion

Can't find what you're looking for?