Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
- Pickup from New Mail
- New Mail Courier
- Pickup from the store
- Other transport services
- Cash upon receipt
- Bank transfer
- Privat 24
- WebMoney
- Автор: Peter Bruce | Andrew Bruce | Peter Gedeck
- ISBN-10: 149207294X
- ISBN-13: 978-1492072942
- Edition: 2nd
- Publisher: O'Reilly Media
- Publication date: June 16, 2020
- Language: English
- Dimensions: 7 x 0.9 x 9.1 inches
- Print length: 360 pages
From the brand
-
Explore further 'R' resources
Visit the Store
-
Sharing the knowledge of experts
O'Reilly's mission is to change the world by sharing the knowledge of innovators. For over 40 years, we've inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
From the Publisher
From the Preface
This book is aimed at the data scientist with some familiarity with the R and/or Python programming languages, and with some prior (perhaps spotty or ephemeral) exposure to statistics. Two of the authors came to the world of data science from the world of statistics, and have some appreciation of the contribution that statistics can make to the art of data science. At the same time, we are well aware of the limitations of traditional statistics instruction: statistics as a discipline is a century and a half old, and most statistics textbooks and courses are laden with the momentum and inertia of an ocean liner. All the methods in this book have some connection—historical or methodological—to the discipline of statistics. Methods that evolved mainly out of computer science, such as neural nets, are not included.
In all cases, this book gives code examples first in R and then in Python. In order to avoid unnecessary repetition, we generally show only output and plots created by the R code. We also skip the code required to load the required packages and data sets. You can find the complete code as well as the data sets for download at GitHub.
Two goals underlie this book:
- To lay out, in digestible, navigable, and easily referenced form, key concepts from statistics that are relevant to data science.
- To explain which concepts are important and useful from a data science perspective, which are less so, and why.