Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what’s important and what’s not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you’ll learn:

  • Why exploratory data analysis is a key preliminary step in data science
  • How random sampling can reduce bias and yield a higher-quality dataset, even with big data
  • How the principles of experimental design yield definitive answers to questions
  • How to use regression to estimate outcomes and detect anomalies
  • Key classification techniques for predicting which categories a record belongs to
  • Statistical machine learning methods that “learn” from data
  • Unsupervised learning methods for extracting meaning from unlabeled data.

9 reviews for Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

  1. (9)

    Marina

    The book is amazing and very useful, for beginners also. The most valuable from my point of view is presence of code both for R and Python, which helps understand the syntax better for one language if you know another.

  2. (9)

    Stephen Martin

    This is a very good book to begin your DS stats journey with. I learned more from this book than I did in my DS grad school classes. It covers the basics you’ll need everyday in a practical way.

  3. (9)

    M. W. Hefner

    I’ve taken many stats classes, most of them using R, at the undergraduate and graduate level, and I really wish I found this book before I did. I picked this book up as a refresher, and not only did it succinctly describe all and a bit more of what I learned in those courses, but it has excellent “further readings,” great clarifying synonym lists when it defines “key terms,” and is very readable. Literally blown away.

  4. (9)

    Jonathan

    I had purchased a new physical copy of the book, and realized there were several pages that were blank and missing. I contacted O’Reilly about the problem and they were extremely quick with a resolution! They were able to give me a different copy so I could read it without the missing pages. The content of the book itself is good, except in all black and white, which doesn’t bother me personally but may bother someone else when it comes to the graphs. I think the R and Python content are both great, and it keeps the code concise and quick to the point. Great for R beginners, but for python users I would recommend a little more experience. As for the math parts, its great for those who are new to statistics and gives easy to read explanations, and a great refresher for those who just want to review some of the concepts. I especially like the sections provided for further reading, which have been helpful.

  5. (9)

    Cabiria

    I got this because I am taking a data analytics course that is not explained that well and I need to fill up my gaps in statistics. It is a good book

  6. (9)

    Farshad E.

    Good content/low quality print

  7. (9)

    denverteach

    Very good book- covers more than just implementing same old tactics.

  8. (9)

    Read and think

    What a great book! The authors did a marvellous jobs in packing an incredible amount of information in very little pages AND doing it in a very pleasant style that is direct, informative, and extremely clear.
    I have read many books in statistics. I can tell you there are very very few written so well and so pleasant to read.
    And to top it all, it is one of the very few book of statistics for non-mathematician that *correctly* explain the p-value and t-test. Many statisticians *still* don’t understand what that “significance test” really mean. But these authors do understand it very well and this is very important for anyone new to statistics to know this test correctly and in the hand of these authors they *will* learn it correctly.
    Thanks a lot to the authors. You did a fabulous job.

  9. (9)

    José Luis

    Buen libro con un excelente contenido temático

Add a review

Your email address will not be published. Required fields are marked *

Back to top

New item(s) have been added to your cart.

Quantity: 1
Total: $13,99
Modern Statistics: A Computer-Based Approach with Python
(9)
Original price was: $86,99.Current price is: $19,99.
Linear Algebra for Data Science, Machine Learning, and Signal Processing Original price was: $69,99.Current price is: $19,99.
Linear Algebra: Theory, Intuition, Code
(9)
Original price was: $45,00.Current price is: $19,99.
Mathematics and Statistics for Financial Risk Management
(9)
Original price was: $69,99.Current price is: $19,99.
Mathematical Statistics with Applications in R Original price was: $129,99.Current price is: $19,99.
Computational Statistics in Data Science
(9)
Original price was: $173,00.Current price is: $19,99.
Introduction to Graph Theory (Dover Books on Mathematics)
(9)
Original price was: $21,95.Current price is: $9,99.
Linear Algebra With Machine Learning and Data
(9)
Original price was: $75,00.Current price is: $19,99.
The Self-Taught Programmer: The Definitive Guide to Programming Professionally Original price was: $31,87.Current price is: $9,99.
The Calculus Story: A Mathematical Adventure Original price was: $35,00.Current price is: $9,99.
Mathematical Analysis for Machine Learning and Data Mining Original price was: $178,99.Current price is: $19,97.
Create GUI Applications with Python & Qt6 (PyQt6 Edition): The hands-on guide to making apps with Python
(9)
Original price was: $54,99.Current price is: $14,99.
Storytelling with Data: A Data Visualization Guide for Business Professionals
(9)
Original price was: $37,00.Current price is: $11,99.
Advanced Calculus: Theory and Practice (Textbooks in Mathematics) Original price was: $94,65.Current price is: $19,96.
Computer Science: An Interdisciplinary Approach
(9)
Original price was: $61,99.Current price is: $19,99.
Machine Learning: An Applied Mathematics Introduction
(9)
Original price was: $34,99.Current price is: $14,99.
Numerical Analysis Original price was: $74,99.Current price is: $19,95.
R For College Mathematics and Statistics Original price was: $96,00.Current price is: $19,99.
Technical Analysis of Stock Trends Original price was: $86,95.Current price is: $19,95.
Essential Mathematics for Economic Analysis Original price was: $49,99.Current price is: $19,99.
Encyclopedia of Applied and Computational Mathematics, 2 volume set Original price was: $999,00.Current price is: $29,99.
The Art of Uncertainty: How to Navigate Chance, Ignorance, Risk and Luck
(9)
Original price was: $32,99.Current price is: $14,99.
Data Analysis and Machine Learning through Statistical Computing Original price was: $249,00.Current price is: $34,99.
What's the Point of Maths?
(9)
Original price was: $24,99.Current price is: $9,99.
Trigonometry 11th Edition Original price was: $265,99.Current price is: $14,99.
Quantum Computing: A Primer Course and Its Applications in Machine Learning
(9)
Original price was: $119,99.Current price is: $39,99.
Machine Learning Crash Course for Engineers
(9)
Original price was: $64,99.Current price is: $18,95.
Probabilistic Numerics: Computation as Machine Learning Original price was: $49,99.Current price is: $19,99.
Learn Physics with Calculus Step-by-Step (3 book series)
(9)
Original price was: $159,95.Current price is: $29,99.
Ordinary Differential Equations (Dover Books on Mathematics) Original price was: $38,49.Current price is: $15,00.
The Cartoon Guide to Geometry
(9)
Original price was: $35,99.Current price is: $12,99.
Linear Algebra and Its Applications Original price was: $214,99.Current price is: $19,99.
Mathematics for Machine Learning
(9)
Original price was: $88,99.Current price is: $19,95.
Vector: A Surprising Story of Space, Time, and Mathematical Transformation
(9)
Original price was: $47,99.Current price is: $19,99.
Numsense! Data Science for the Layman: No Math Added
(9)
Original price was: $43,99.Current price is: $15,00.
Calculus with Multiple Variables Essential Skills Workbook: Includes Vector Calculus and Full Solutions
(9)
Original price was: $35,00.Current price is: $17,99.
Mathematics for Human Flourishing
(9)
Original price was: $49,99.Current price is: $14,99.
Introduction to Probability, Statistics, and Random Processes
(9)
Original price was: $49,99.Current price is: $19,95.
GCSE Mathematics: Essential Foundations
(9)
Original price was: $669,99.Current price is: $49,99.