Book picks similar to
Applied Longitudinal Analysis by Garrett M. Fitzmaurice
data-science
statistics
public-health-epidemiology
references
Organic Chemistry: Structure and Function
K. Peter C. Vollhardt - 1987
By emphasizing the relationship between structure and function, the authors provide a framework for understanding mechanisms and reactions. Stressing the importance of synthetic strategies and biological and industrial applications, the text introduces students to real chemistry as it is actually practised. This fourth edition offers significant updates in coverage and learning tools and enhanced media support at the book's companion website.
Python for Data Analysis
Wes McKinney - 2011
It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you'll need to effectively solve a broad set of data analysis problems. This book is not an exposition on analytical methods using Python as the implementation language.Written by Wes McKinney, the main author of the pandas library, this hands-on book is packed with practical cases studies. It's ideal for analysts new to Python and for Python programmers new to scientific computing.Use the IPython interactive shell as your primary development environmentLearn basic and advanced NumPy (Numerical Python) featuresGet started with data analysis tools in the pandas libraryUse high-performance tools to load, clean, transform, merge, and reshape dataCreate scatter plots and static or interactive visualizations with matplotlibApply the pandas groupby facility to slice, dice, and summarize datasetsMeasure data by points in time, whether it's specific instances, fixed periods, or intervalsLearn how to solve problems in web analytics, social sciences, finance, and economics, through detailed examples
R in Action
Robert Kabacoff - 2011
The book begins by introducing the R language, including the development environment. Focusing on practical solutions, the book also offers a crash course in practical statistics and covers elegant methods for dealing with messy and incomplete data using features of R.About the TechnologyR is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It runs on all important platforms and provides thousands of useful specialized modules and utilities. This makes R a great way to get meaningful information from mountains of raw data.About the BookR in Action is a language tutorial focused on practical problems. It presents useful statistics examples and includes elegant methods for handling messy, incomplete, and non-normal data that are difficult to analyze using traditional methods. And statistical analysis is only part of the story. You'll also master R's extensive graphical capabilities for exploring and presenting data visually. Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book. What's InsidePractical data analysis, step by stepInterfacing R with other softwareUsing R to visualize dataOver 130 graphsEight reference appendixes================================Table of ContentsPart I Getting startedIntroduction to RCreating a datasetGetting started with graphsBasic data managementAdvanced data managementPart II Basic methodsBasic graphsBasic statisticsPart III Intermediate methodsRegressionAnalysis of variancePower analysisIntermediate graphsRe-sampling statistics and bootstrappingPart IV Advanced methodsGeneralized linear modelsPrincipal components and factor analysisAdvanced methods for missing dataAdvanced graphics
The Hundred-Page Machine Learning Book
Andriy Burkov - 2019
During that week, you will learn almost everything modern machine learning has to offer. The author and other practitioners have spent years learning these concepts.Companion wiki — the book has a continuously updated wiki that extends some book chapters with additional information: Q&A, code snippets, further reading, tools, and other relevant resources.Flexible price and formats — choose from a variety of formats and price options: Kindle, hardcover, paperback, EPUB, PDF. If you buy an EPUB or a PDF, you decide the price you pay!Read first, buy later — download book chapters for free, read them and share with your friends and colleagues. Only if you liked the book or found it useful in your work, study or business, then buy it.
Foundations of Statistical Natural Language Processing
Christopher D. Manning - 1999
This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.
Mining of Massive Datasets
Anand Rajaraman - 2011
This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike.
Data Visualization: A Practical Introduction
Kieran Healy - 2018
It explains what makes some graphs succeed while others fail, how to make high-quality figures from data using powerful and reproducible methods, and how to think about data visualization in an honest and effective way.Data Visualization builds the reader's expertise in ggplot2, a versatile visualization library for the R programming language. Through a series of worked examples, this accessible primer then demonstrates how to create plots piece by piece, beginning with summaries of single variables and moving on to more complex graphics. Topics include plotting continuous and categorical variables; layering information on graphics; producing effective "small multiple" plots; grouping, summarizing, and transforming data for plotting; creating maps; working with the output of statistical models; and refining plots to make them more comprehensible.Effective graphics are essential to communicating ideas and a great way to better understand data. This book provides the practical skills students and practitioners need to visualize quantitative data and get the most out of their research findings.Provides hands-on instruction using R and ggplot2Shows how the "tidyverse" of data analysis tools makes working with R easier and more consistentIncludes a library of data sets, code, and functions
Experimental and Quasi-Experimental Designs for Generalized Causal Inference
William R. Shadish - 2001
The book covers four major topics in field experimentation:
CliffsNotes Math Review for Standardized Tests
Jerry Bobrow - 2010
Your guide to a higher math score on standardized tests*SATACT(R)ASVABGMAT(R)GRE(R)CBEST(R)PRAXIS I(R)GED(R) And More!Why CliffsNotes?Go with the name you know and trustGet the information you need-fast!About the Contents:IntroductionHow to use this bookOverview of the examsPart I: Basic Skills ReviewArithmetic and Data AnalysisAlgebraPart II: Strategies and PracticeMathematical AbilityQuantitative ComparisonData SufficiencyEach section includes a diagnostic test, explanations of rules, concepts withexamples, practice problems with complete explanations, a review test, and aglossary!Test-Prep Essentials from the Experts at CliffsNotes(R)For more test-prep help, visit CliffsNotes.com(R)*SAT is a registered trademark of the College Board, which was not involved inthe production of, and does not endorse, this product.
Statistical Inference
George Casella - 2001
Starting from the basics of probability, the authors develop the theory of statistical inference using techniques, definitions, and concepts that are statistical and are natural extensions and consequences of previous concepts. This book can be used for readers who have a solid mathematics background. It can also be used in a way that stresses the more practical uses of statistical theory, being more concerned with understanding basic statistical concepts and deriving reasonable statistical procedures for a variety of situations, and less concerned with formal optimality investigations.
Networks: An Introduction
M.E.J. Newman - 2010
The rise of the Internet and the wide availability of inexpensive computers have made it possible to gather and analyze network data on a large scale, and the development of a variety of new theoretical tools has allowed us to extract new knowledge from many different kinds of networks.The study of networks is broadly interdisciplinary and important developments have occurred in many fields, including mathematics, physics, computer and information sciences, biology, and the social sciences. This book brings together for the first time the most important breakthroughs in each of these fields and presents them in a coherent fashion, highlighting the strong interconnections between work in different areas.Subjects covered include the measurement and structure of networks in many branches of science, methods for analyzing network data, including methods developed in physics, statistics, and sociology, the fundamentals of graph theory, computer algorithms, and spectral methods, mathematical models of networks, including random graph models and generative models, and theories of dynamical processes taking place on networks.
Baseball Between the Numbers: Why Everything You Know About the Game Is Wrong
Jonah Keri - 2006
Properly understood, they can tell us how the teams we root for could employ better strategies, put more effective players on the field, and win more games. The revolution in baseball statistics that began in the 1970s is a controversial subject that professionals and fans alike argue over without end. Despite this fundamental change in the way we watch and understand the sport, no one has written the book that reveals, across every area of strategy and management, how the best practitioners of statistical analysis in baseball-people like Bill James, Billy Beane, and Theo Epstein-think about numbers and the game. Baseball Between the Numbers is that book. In separate chapters covering every aspect of the game, from hitting, pitching, and fielding to roster construction and the scouting and drafting of players, the experts at Baseball Prospectus examine the subtle, hidden aspects of the game, bring them out into the open, and show us how our favorite teams could win more games. This is a book that every fan, every follower of sports radio, every fantasy player, every coach, and every player, at every level, can learn from and enjoy.
Introductory Graph Theory
Gary Chartrand - 1984
Introductory Graph Theory presents a nontechnical introduction to this exciting field in a clear, lively, and informative style. Author Gary Chartrand covers the important elementary topics of graph theory and its applications. In addition, he presents a large variety of proofs designed to strengthen mathematical techniques and offers challenging opportunities to have fun with mathematics. Ten major topics — profusely illustrated — include: Mathematical Models, Elementary Concepts of Graph Theory, Transportation Problems, Connection Problems, Party Problems, Digraphs and Mathematical Models, Games and Puzzles, Graphs and Social Psychology, Planar Graphs and Coloring Problems, and Graphs and Other Mathematics. A useful Appendix covers Sets, Relations, Functions, and Proofs, and a section devoted to exercises — with answers, hints, and solutions — is especially valuable to anyone encountering graph theory for the first time. Undergraduate mathematics students at every level, puzzlists, and mathematical hobbyists will find well-organized coverage of the fundamentals of graph theory in this highly readable and thoroughly enjoyable book.
Essentials of Statistics for the Behavioral Sciences
Frederick J. Gravetter - 1991
The authors take time to explain statistical procedures so that you can go beyond memorizing formulas and gain a conceptual understanding of statistics. The authors also take care to show you how having an understanding of statistical procedures will help you comprehend published findings and will lead you to become a savvy consumer of information. Known for its exceptional accuracy and examples, this text also has a complete supplements package to support your learning.
Power Generation, Operation, and Control
Allen J. Wood - 1983
Wood and Bruce F. Wollenberg presented their comprehensive introduction to the engineering and economic factors involved in operating and controlling power generation systems in electric utilities, the electric power industry has undergone unprecedented change. Deregulation, open access to transmission systems, and the birth of independent power producers have altered the structure of the industry, while technological advances have created a host of new opportunities and challenges. In Power Generation, Operation, and Control, Second Edition, Wood and Wollenberg bring professionals and students alike up to date on the nuts and bolts of the field. Continuing in the tradition of the first edition, they offer a practical, hands-on guide to theoretical developments and to the application of advanced operations research methods to realistic electric power engineering problems. This one-of-a-kind text also addresses the interaction between human and economic factors to prepare readers to make real-world decisions that go beyond the limits of mere technical calculations. The Second Edition features vital new material, including: * A computer disk developed by the authors to help readers solve complicated problems * Examination of Optimal Power Flow (OPF) * Treatment of unit commitment expanded to incorporate the Lagrange relaxation technique * Introduction to the use of bounding techniques and other contingency selection methods * Applications suited to the new, deregulated systems as well as to the traditional, vertically organized utilities company Wood and Wollenberg draw upon nearly 30 years of classroom testing to provide valuable data on operations research, state estimation methods, fuel scheduling techniques, and more. Designed for clarity and ease of use, this invaluable reference prepares industry professionals and students to meet the future challenges of power generation, operation, and control.