R for Data Science: Import, Tidy, Transform, Visualize, and Model Data


Hadley Wickham - 2016
    This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You’ll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you’ve learned along the way. You’ll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true "signals" in your dataset Communicate—learn R Markdown for integrating prose, code, and results

Evaluation: A Systematic Approach


Peter H. Rossi - 1979
    Rossi, Mark W. Lipsey, and Howard E. Freeman first published Evaluation: A Systematic Approach, more than 90,000 readers have considered it the premier text on how to design, implement, and appraise social programs through evaluation. In this, the completely revised Seventh Edition, authors Rossi and Lipsey include the latest techniques and approaches to evaluation as well as guidelines to tailor evaluations to fit programs and social contexts.With decades of hands-on experience conducting evaluations, the authors provide scores of examples to help students understand how evaluators deal with various critical issues. They include a glossary of key terms and concepts, making this the most comprehensive and authoritative evaluation text available.Thoroughly revised, the Seventh Edition now includes* Substantially more attention to outcome measurement* Lengthy discussions of program theory, including a section about detecting program effects and interpreting their practical significance* An augmented and updated discussion of major evaluation designs* A detailed exposition of meta-analysis as an approach to the synthesis of evaluation studies* Alternative approaches to evaluation* Examples of successful evaluations* Discussions of the political and social contexts of evaluation

The Art of R Programming: A Tour of Statistical Software Design


Norman Matloff - 2011
    No statistical knowledge is required, and your programming skills can range from hobbyist to pro.Along the way, you'll learn about functional and object-oriented programming, running mathematical simulations, and rearranging complex data into simpler, more useful formats. You'll also learn to: Create artful graphs to visualize complex data sets and functions Write more efficient code using parallel R and vectorization Interface R with C/C++ and Python for increased speed or functionality Find new R packages for text analysis, image manipulation, and more Squash annoying bugs with advanced debugging techniques Whether you're designing aircraft, forecasting the weather, or you just need to tame your data, The Art of R Programming is your guide to harnessing the power of statistical computing.

Machine Learning with R


Brett Lantz - 2014
    This practical guide that covers all of the need to know topics in a very systematic way. For each machine learning approach, each step in the process is detailed, from preparing the data for analysis to evaluating the results. These steps will build the knowledge you need to apply them to your own data science tasks.Intended for those who want to learn how to use R's machine learning capabilities and gain insight from your data. Perhaps you already know a bit about machine learning, but have never used R; or perhaps you know a little R but are new to machine learning. In either case, this book will get you up and running quickly. It would be helpful to have a bit of familiarity with basic programming concepts, but no prior experience is required.

Statistics for Management


Richard I. Levin - 1978
    Like its predecessors, the seventh edition includes the absolute minimum of mathematical/statistical notation necessary to teach the material. Concepts are fully explained in simple, easy-to-understand language as they are presented, making the book an excellent source from which to learn and teach. After each discussion, readers are guided through real-world examples to show how book principles work in professional practice. Includes easy-to-understand explanations of difficult statistical topics, such as sampling distributions, relationship between confidence level and confidence interval, interpreting r-square. A complete package of teaching/learning aids is provided in every chapter, including chapter review exercises, chapter concepts tests,"Statistics at Work" conceptual cases, "Computer Database Exercises," "From the Textbook to the Real-World Examples." This ISBN is in two volumes Part A and Part B.

The Baseball Economist: The Real Game Exposed


J.C. Bradbury - 2007
     Two hot topics team up in The Baseball Economist, and the result is a refreshing, clear- eyed survey of a playing field that has changed radically in recent years. Utilizing the latest economic methods and statistical analysis, writer, economics professor, and popular blogger J. C. Bradbury dissects burning baseball topics with his original Sabernomic perspective, such as: • Did steroids have nothing to do with the recent home run records? Incredibly, Bradbury's research, reviewed by Stanford economists, reveals steroids had little statistical significance. • Is the big-city versus small-city competition really lopsided? Bradbury shows why the Marlins and Indians are likely to dominate big-city franchises in the coming years. • Which players are ridiculously overvalued? Bradbury lists all players by team with their revenue value to the team listed in dollars—including a dishonor role of those players with negative values. • Is major league baseball a monopoly that can't govern itself? Bradbury sets out what rules the owners really need to play by, and what the players' union should be doing. • Does it help to lobby for balls and strikes? How would Babe Ruth perform in today's game? And who killed all the left-handed catchers, anyway? The Baseball Economist knows. Providing far more than a mere collection of numbers, Bradbury shines the light of his economic thinking on baseball, exposing the power of tradeoffs, competition, and incentives. Statistics alone aren't enough anymore. Fans, fantasy buffs, and players, as well as coaches at all levels who want to grasp what is really happening on the field today and in the coming years, will use and enjoy Bradbury's brilliant new understanding of the national pastime.

Introduction to Econometrics (Addison-Wesley Series in Economics)


James H. Stock - 2002
    This text aims to motivate the need for tools with concrete applications, providing simple assumptions that match the application.

Bayesian Reasoning and Machine Learning


David Barber - 2012
    They are established tools in a wide range of industrial applications, including search engines, DNA sequencing, stock market analysis, and robot locomotion, and their use is spreading rapidly. People who know the methods have their choice of rewarding jobs. This hands-on text opens these opportunities to computer science students with modest mathematical backgrounds. It is designed for final-year undergraduates and master's students with limited background in linear algebra and calculus. Comprehensive and coherent, it develops everything from basic reasoning to advanced techniques within the framework of graphical models. Students learn more than a menu of techniques, they develop analytical and problem-solving skills that equip them for the real world. Numerous examples and exercises, both computer based and theoretical, are included in every chapter. Resources for students and instructors, including a MATLAB toolbox, are available online.

Multivariate Data Analysis


Joseph F. Hair Jr. - 1979
    This book provides an applications-oriented introduction to multivariate data analysis for the non-statistician, by focusing on the fundamental concepts that affect the use of specific techniques.

Deep Learning with Python


François Chollet - 2017
    It is the technology behind photo tagging systems at Facebook and Google, self-driving cars, speech recognition systems on your smartphone, and much more.In particular, Deep learning excels at solving machine perception problems: understanding the content of image data, video data, or sound data. Here's a simple example: say you have a large collection of images, and that you want tags associated with each image, for example, "dog," "cat," etc. Deep learning can allow you to create a system that understands how to map such tags to images, learning only from examples. This system can then be applied to new images, automating the task of photo tagging. A deep learning model only has to be fed examples of a task to start generating useful results on new data.

Elements of Information Theory


Thomas M. Cover - 1991
    Readers are provided once again with an instructive mix of mathematics, physics, statistics, and information theory.All the essential topics in information theory are covered in detail, including entropy, data compression, channel capacity, rate distortion, network information theory, and hypothesis testing. The authors provide readers with a solid understanding of the underlying theory and applications. Problem sets and a telegraphic summary at the end of each chapter further assist readers. The historical notes that follow each chapter recap the main points.The Second Edition features: * Chapters reorganized to improve teaching * 200 new problems * New material on source coding, portfolio theory, and feedback capacity * Updated referencesNow current and enhanced, the Second Edition of Elements of Information Theory remains the ideal textbook for upper-level undergraduate and graduate courses in electrical engineering, statistics, and telecommunications.

Social Network Analysis: Methods and Applications


Stanley Wasserman - 1994
    Social Network Analysis: Methods and Applications reviews and discusses methods for the analysis of social networks with a focus on applications of these methods to many substantive examples. As the first book to provide a comprehensive coverage of the methodology and applications of the field, this study is both a reference book and a textbook.

Time Series Analysis


James Douglas Hamilton - 1994
    This book synthesizes these recent advances and makes them accessible to first-year graduate students. James Hamilton provides the first adequate text-book treatments of important innovations such as vector autoregressions, generalized method of moments, the economic and statistical consequences of unit roots, time-varying variances, and nonlinear time series models. In addition, he presents basic tools for analyzing dynamic systems (including linear representations, autocovariance generating functions, spectral analysis, and the Kalman filter) in a way that integrates economic theory with the practical difficulties of analyzing and interpreting real-world data. Time Series Analysis fills an important need for a textbook that integrates economic theory, econometrics, and new results.The book is intended to provide students and researchers with a self-contained survey of time series analysis. It starts from first principles and should be readily accessible to any beginning graduate student, while it is also intended to serve as a reference book for researchers.-- "Journal of Economics"

Introduction to Probability


Dimitri P. Bertsekas - 2002
    This is the currently used textbook for "Probabilistic Systems Analysis," an introductory probability course at the Massachusetts Institute of Technology, attended by a large number of undergraduate and graduate students. The book covers the fundamentals of probability theory (probabilistic models, discrete and continuous random variables, multiple random variables, and limit theorems), which are typically part of a first course on the subject. It also contains, a number of more advanced topics, from which an instructor can choose to match the goals of a particular course. These topics include transforms, sums of random variables, least squares estimation, the bivariate normal distribution, and a fairly detailed introduction to Bernoulli, Poisson, and Markov processes. The book strikes a balance between simplicity in exposition and sophistication in analytical reasoning. Some of the more mathematically rigorous analysis has been just intuitively explained in the text, but is developed in detail (at the level of advanced calculus) in the numerous solved theoretical problems. The book has been widely adopted for classroom use in introductory probability courses within the USA and abroad.

Mathematical Statistics and Data Analysis


John A. Rice - 1988
    The book's approach interweaves traditional topics with data analysis and reflects the use of the computer with close ties to the practice of statistics. The author stresses analysis of data, examines real problems with real data, and motivates the theory. The book's descriptive statistics, graphical displays, and realistic applications stand in strong contrast to traditional texts which are set in abstract settings.