Book picks similar to
Hands-On Machine Learning with R by Brad Boehmke
Hadley Wickham - 2015
This practical book shows you how to bundle reusable R functions, sample data, and documentation together by applying author Hadley Wickham’s package development philosophy. In the process, you’ll work with devtools, roxygen, and testthat, a set of R packages that automate common development tasks. Devtools encapsulates best practices that Hadley has learned from years of working with this programming language. Ideal for developers, data scientists, and programmers with various backgrounds, this book starts you with the basics and shows you how to improve your package writing over time. You’ll learn to focus on what you want your package to do, rather than think about package structure. Learn about the most useful components of an R package, including vignettes and unit tests Automate anything you can, taking advantage of the years of development experience embodied in devtools Get tips on good style, such as organizing functions into files Streamline your development process with devtools Learn the best way to submit your package to the Comprehensive R Archive Network (CRAN) Learn from a well-respected member of the R community who created 30 R packages, including ggplot2, dplyr, and tidyr
The Midrange Theory
Seth Partnow - 2021
But what is a “good” shot? Are all good shots created equally? And how might one identify players who are more or less likely to make and prevent those shots in the first place? The concept of basketball “analytics,” for lack of a better term, has been lauded, derided, and misunderstood. The incorporation of more data into NBA decision-making has been credited—or blamed—for everything from the death of the traditional center to the proliferation of three-point shooting to the alleged abandonment of the area of the court known as the midrange. What is beyond doubt is that understanding its methods has never been more important to watching and appreciating the NBA. In The Midrange Theory, Seth Partnow, NBA analyst for The Athletic and former Director of Basketball Research for the Milwaukee Bucks, explains how numbers have affected the modern NBA game, and how those numbers seek not to “solve” the game of basketball but instead urge us toward thinking about it in new ways.The relative value of Russell Westbrook’s triple-doublesWhy some players succeed in the playoffs while others don’tHow NBA teams think about constructing their rosters through the draft and free agencyThe difficulty in measuring defensive achievementThe fallacy of the “quick two”From shot selection to evaluating prospects to considering aesthetics and ethics while analyzing the box scores, Partnow deftly explores where the NBA is now, how it got here, and where it might be going next.
Tell Me The Odds: A 15 Page Introduction To Bayes Theorem
Scott Hartshorn - 2017
Essentially, you make an initial guess, and then get more data to improve it. Bayes Theorem, or Bayes Rule, has a ton of real world applications, from estimating your risk of a heart attack to making recommendations on Netflix But It Isn't That Complicated This book is a short introduction to Bayes Theorem. It is only 15 pages long, and is intended to show you how Bayes Theorem works as quickly as possible. The examples are intentionally kept simple to focus solely on Bayes Theorem without requiring that the reader know complicated probability distributions. If you want to learn the basics of Bayes Theorem as quickly as possible, with some easy to duplicate examples, this is a good book for you.
Introduction to Statistical Quality Control
Douglas C. Montgomery - 1985
It provides comprehensive coverage of the subject from basic principles to state-of-art concepts and applications. The objective is to give the reader a sound understanding of the principles and the basis for applying them in a variety of both product and nonproduct situations. While statistical techniques are emphasized throughout, the book has a strong engineering and management orientation. Guidelines are given throughout the book for selecting the proper type of statistical technique to use in a wide variety of product and nonproduct situations. By presenting theory, and supporting the theory with clear and relevant examples, Montgomery helps the reader to understand the big picture of important concepts. Updated to reflect contemporary practice and provide more information on management aspects of quality improvement.
Hands-On Programming with R: Write Your Own Functions and Simulations
Garrett Grolemund - 2014
With this book, you'll learn how to load data, assemble and disassemble data objects, navigate R's environment system, write your own functions, and use all of R's programming tools.RStudio Master Instructor Garrett Grolemund not only teaches you how to program, but also shows you how to get more from R than just visualizing and modeling data. You'll gain valuable programming skills and support your work as a data scientist at the same time.Work hands-on with three practical data analysis projects based on casino gamesStore, retrieve, and change data values in your computer's memoryWrite programs and simulations that outperform those written by typical R usersUse R programming tools such as if else statements, for loops, and S3 classesLearn how to write lightning-fast vectorized R codeTake advantage of R's package system and debugging toolsPractice and apply R programming concepts as you learn them
Statistics for Six SIGMA Made Easy
Warren Brussee - 2004
Yet although the heart of Six Sigma is statistics, most books on Six Sigma give a general overview, with little detail on actually Using the Six Sigma tools. Books that cover Six Sigma statistics typically are 500+ pages and cost near USD100. Basic Statistics for Six Sigma gives the information needed to apply the statistical tools of Six Sigma to real-world problems in a simplified, USD24.95 paperback format, for Greenbelts and Six Sigma Project Team Leaders. A typical company's Greenbelt training in Six Sigma includes 7 books, 4 software packages, and 3 weeks of class work. It is very extensive in statistics. Although this kind of course is excellent, not all companies or individuals want to commit to that level of instruction. Statistics for Six Sigma--Made Easy! describes only the tools used by most successful Six Sigma practitioners. The only software package needed is Excel, and the 14 formulas and 5 tables included in the book enable you to use all the basic Six Sigma statistical tools. Using these tools will enable a person t
Business Statistics: Contemporary Decision Making
Ken Black - 1991
eGrade Plus offers an integrated suite of teaching and learning resources, including an online version of Black's Business Statistics for Contemporary Decision Making, Fourth Edition Update, in one easy-to-use Web site. Organized around the essential activities you perform in class, eGrade Plus helps you: Create class presentation using a wealth of Wiley-provided resources. you may easily adapt, customize, and add to his content to meet the needs of your course. Automate the assigning and grading of homework or quizzes by using Wiley-provided question banks, or by writing your won. Student results will be automatically graded and recorded in your gradebook. Track your students' progress. An instructor's gradebook allows you to an analyze individual and overall class results to determine each student's progress and level of understanding. Administer your course. eGrade Plus can easily be integrated with another course management system, gradebook, or other resources you are using in your class. Provide students with problem-solving support. eGrade Plus can link homework problems to the relevant section of the online text, providing context-sensitive help. Best of all, instructors can arrange to have eGrade Plus packaged FREE with new copies of Business Statistics for Contemporary Decision Making, Fourth Edition Update, All instructors have to do is adopt the eGrade Plus version of this book and activate their eGrade Plus course.
Big Data Now: 2012 Edition
O'Reilly Media Inc. - 2012
It's not just a technical book or just a businessguide. Data is ubiquitous and it doesn't pay much attention toborders, so we've calibrated our coverage to follow it wherever itgoes.In the first edition of Big Data Now, the O'Reilly team tracked thebirth and early development of data tools and data science. Now, withthis second edition, we're seeing what happens when big data grows up:how it's being applied, where it's playing a role, and theconsequences -- good and bad alike -- of data's ascendance.We've organized the second edition of Big Data Now into five areas:Getting Up to Speed With Big Data -- Essential information on thestructures and definitions of big data.Big Data Tools, Techniques, and Strategies -- Expert guidance forturning big data theories into big data products.The Application of Big Data -- Examples of big data in action,including a look at the downside of data.What to Watch for in Big Data -- Thoughts on how big data will evolveand the role it will play across industries and domains.Big Data and Health Care -- A special section exploring thepossibilities that arise when data and health care come together.
David M. Diez - 2012
Our inaugural effort is OpenIntro Statistics. Probability is optional, inference is key, and we feature real data whenever possible. Files for the entire book are freely available at openintro.org, and anybody can purchase a paperback copy from amazon.com for under $10.The future for OpenIntro depends on the involvement and enthusiasm of our community. Visit our website, openintro.org. We provide free course management tools, including an online question bank, utilities for creating course quizzes, and many other helpful resources.CERTAIN CONTENT THAT APPEARS ON THIS SITE COMES FROM AMAZON SERVICES LLC. THIS CONTENT IS PROVIDED ‘AS IS’ AND IS SUBJECT TO CHANGE OR REMOVAL AT ANY TIME.Can’t find it here? Search Amazon.com Search: All Products Apparel & AccessoriesBabyBeautyBooksCamera & PhotoCell Phones & ServiceClassical MusicComputersComputer & Video GamesDVDElectronicsGourmet FoodHome & GardenMiscellaneousHealth & Personal CareJewelry & WatchesKitchen & HousewaresMagazine SubscriptionsMusicMusical InstrumentsSoftwareSports & OutdoorsTools & HardwareToys & GamesVHS Keywords:
Neil A. Weiss - 1987
This book develops statistical thinking over rote drill and practice. The Nature of Statistics; Organizing Data; Descriptive Measures; Probability Concepts; Discrete Random Variables; The Normal Distribution; The Sampling Distribution of the Sample Menu; Confidence Intervals for One Population Mean; Hypothesis Tests for One Population Mean; Inferences for Two Population Means; Inferences for Population Standard Deviations; Inferences for Population Proportions; Chi-Square Procedures; Descriptive Methods in Regression and Correlation; Inferential Methods in Regression and Correlation; Analysis of Variance (ANOVA) For all readers interested in Introductory Statistics.
A Whirlwind Tour of Python
Jake Vanderplas - 2016
This report provides a brief yet comprehensive introduction to Python for engineers, researchers, and data scientists who are already familiar with another programming language.Author Jake VanderPlas, an interdisciplinary research director at the University of Washington, explains Python’s essential syntax and semantics, built-in data types and structures, function definitions, control flow statements, and more, using Python 3 syntax.You’ll explore:- Python syntax basics and running Python codeBasic semantics of Python variables, objects, and operators- Built-in simple types and data structures- Control flow statements for executing code blocks conditionally- Methods for creating and using reusable functionsIterators, list comprehensions, and generators- String manipulation and regular expressions- Python’s standard library and third-party modules- Python’s core data science tools- Recommended resources to help you learn more
Statistics for Business & Economics
James T. McClave - 1991
Theoretical, yet applied. Statistics for Business and Economics, Eleventh Edition, gives you the best of both worlds. Using a rich array of applications from a variety of industries, McClave/Sincich/Benson clearly demonstrates how to use statistics effectively in a business environment.The book focuses on developing statistical thinking so the reader can better assess the credibility and value of inferences made from data. As consumers and future producers of statistical inferences, readers are introduced to a wide variety of data collection and analysis techniques to help them evaluate data and make informed business decisions. As with previous editions, this revision offers an abundance of applications with many new and updated exercises that draw on real business situations and recent economic events. The authors assume a background of basic algebra.
Data Smart: Using Data Science to Transform Information into Insight
John W. Foreman - 2013
Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.Data science is little more than using straight-forward steps to process raw data into actionable insight. And in Data Smart, author and data scientist John Foreman will show you how that's done within the familiar environment of a spreadsheet. Why a spreadsheet? It's comfortable! You get to look at the data every step of the way, building confidence as you learn the tricks of the trade. Plus, spreadsheets are a vendor-neutral place to learn data science without the hype. But don't let the Excel sheets fool you. This is a book for those serious about learning the analytic techniques, the math and the magic, behind big data.Each chapter will cover a different technique in a spreadsheet so you can follow along: - Mathematical optimization, including non-linear programming and genetic algorithms- Clustering via k-means, spherical k-means, and graph modularity- Data mining in graphs, such as outlier detection- Supervised AI through logistic regression, ensemble models, and bag-of-words models- Forecasting, seasonal adjustments, and prediction intervals through monte carlo simulation- Moving from spreadsheets into the R programming languageYou get your hands dirty as you work alongside John through each technique. But never fear, the topics are readily applicable and the author laces humor throughout. You'll even learn what a dead squirrel has to do with optimization modeling, which you no doubt are dying to know.