Book picks similar to
The Geometry of Multivariate Statistics by Thomas D. Wickens
statistics
mathematics
stats
data-science
Learning From Data: A Short Course
Yaser S. Abu-Mostafa - 2012
Its techniques are widely applied in engineering, science, finance, and commerce. This book is designed for a short course on machine learning. It is a short course, not a hurried course. From over a decade of teaching this material, we have distilled what we believe to be the core topics that every student of the subject should know. We chose the title `learning from data' that faithfully describes what the subject is about, and made it a point to cover the topics in a story-like fashion. Our hope is that the reader can learn all the fundamentals of the subject by reading the book cover to cover. ---- Learning from data has distinct theoretical and practical tracks. In this book, we balance the theoretical and the practical, the mathematical and the heuristic. Our criterion for inclusion is relevance. Theory that establishes the conceptual framework for learning is included, and so are heuristics that impact the performance of real learning systems. ---- Learning from data is a very dynamic field. Some of the hot techniques and theories at times become just fads, and others gain traction and become part of the field. What we have emphasized in this book are the necessary fundamentals that give any student of learning from data a solid foundation, and enable him or her to venture out and explore further techniques and theories, or perhaps to contribute their own. ---- The authors are professors at California Institute of Technology (Caltech), Rensselaer Polytechnic Institute (RPI), and National Taiwan University (NTU), where this book is the main text for their popular courses on machine learning. The authors also consult extensively with financial and commercial companies on machine learning applications, and have led winning teams in machine learning competitions.
Bit by Bit: Social Research in the Digital Age
Matthew J. Salganik - 2017
In addition to changing how we live, these tools enable us to collect and process data about human behavior on a scale never before imaginable, offering entirely new approaches to core questions about social behavior. Bit by Bit is the key to unlocking these powerful methods--a landmark book that will fundamentally change how the next generation of social scientists and data scientists explores the world around us.Bit by Bit is the essential guide to mastering the key principles of doing social research in this fast-evolving digital age. In this comprehensive yet accessible book, Matthew Salganik explains how the digital revolution is transforming how social scientists observe behavior, ask questions, run experiments, and engage in mass collaborations. He provides a wealth of real-world examples throughout and also lays out a principles-based approach to handling ethical challenges.Bit by Bit is an invaluable resource for social scientists who want to harness the research potential of big data and a must-read for data scientists interested in applying the lessons of social science to tomorrow's technologies.Illustrates important ideas with examples of outstanding researchCombines ideas from social science and data science in an accessible style and without jargonGoes beyond the analysis of "found" data to discuss the collection of "designed" data such as surveys, experiments, and mass collaborationFeatures an entire chapter on ethicsIncludes extensive suggestions for further reading and activities for the classroom or self-study
The Calculus Direct
John Weiss - 2009
The calculus is not a hard subject and I prove this through an easy to read and obvious approach spanning only 100 pages. I have written this book with the following type of student in mind; the non-traditional student returning to college after a long break, a notoriously weak student in math who just needs to get past calculus to obtain a degree, and the garage tinkerer who wishes to understand a little more about the technical subjects. This book is meant to address the many fundamental thought-blocks that keep the average 'mathaphobe' (or just an interested person who doesn't have the time to enroll in a course) from excelling in mathematics in a clear and concise manner. It is my sincerest hope that this book helps you with your needs.Show more Show less
The Flaw of Averages: Why We Underestimate Risk in the Face of Uncertainty
Sam L. Savage - 2009
As the recent collapse on Wall Street shows, we are often ill-equipped to deal with uncertainty and risk. Yet every day we base our personal and business plans on uncertainties, whether they be next month's sales, next year's costs, or tomorrow's stock price. In The Flaw of Averages, Sam Savage-known for his creative exposition of difficult subjects- describes common avoidable mistakes in assessing risk in the face of uncertainty. Along the way, he shows why plans based on average assumptions are wrong, on average, in areas as diverse as healthcare, accounting, the War on Terror, and climate change. In his chapter on Sex and the Central Limit Theorem, he bravely grasps the literary third rail of gender differences.Instead of statistical jargon, Savage presents complex concepts in plain English. In addition, a tightly integrated web site contains numerous animations and simulations to further connect the seat of the reader's intellect to the seat of their pants.The Flaw of Averages typically results when someone plugs a single number into a spreadsheet to represent an uncertain future quantity. Savage finishes the book with a discussion of the emerging field of Probability Management, which cures this problem though a new technology that can pack thousands of numbers into a single spreadsheet cell.Praise for The Flaw of Averages"Statistical uncertainties are pervasive in decisions we make every day in business, government, and our personal lives. Sam Savage's lively and engaging book gives any interested reader the insight and the tools to deal effectively with those uncertainties. I highly recommend The Flaw of Averages." --William J. Perry, Former U.S. Secretary of Defense"Enterprise analysis under uncertainty has long been an academic ideal. . . . In this profound and entertaining book, Professor Savage shows how to make all this practical, practicable, and comprehensible." ---Harry Markowitz, Nobel Laureate in Economics
Mining of Massive Datasets
Anand Rajaraman - 2011
This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike.
Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists
Philipp K. Janert - 2010
With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you.Use graphics to describe data with one, two, or dozens of variablesDevelop conceptual models using back-of-the-envelope calculations, as well asscaling and probability argumentsMine data with computationally intensive methods such as simulation and clusteringMake your conclusions understandable through reports, dashboards, and other metrics programsUnderstand financial calculations, including the time-value of moneyUse dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situationsBecome familiar with different open source programming environments for data analysisFinally, a concise reference for understanding how to conquer piles of data.--Austin King, Senior Web Developer, MozillaAn indispensable text for aspiring data scientists.--Michael E. Driscoll, CEO/Founder, Dataspora
Social and Economic Networks
Matthew O. Jackson - 2008
The many aspects of our lives that are governed by social networks make it critical to understand how they impact behavior, which network structures are likely to emerge in a society, and why we organize ourselves as we do. In Social and Economic Networks, Matthew Jackson offers a comprehensive introduction to social and economic networks, drawing on the latest findings in economics, sociology, computer science, physics, and mathematics. He provides empirical background on networks and the regularities that they exhibit, and discusses random graph-based models and strategic models of network formation. He helps readers to understand behavior in networked societies, with a detailed analysis of learning and diffusion in networks, decision making by individuals who are influenced by their social neighbors, game theory and markets on networks, and a host of related subjects. Jackson also describes the varied statistical and modeling techniques used to analyze social networks. Each chapter includes exercises to aid students in their analysis of how networks function.This book is an indispensable resource for students and researchers in economics, mathematics, physics, sociology, and business.
Intuitive Biostatistics
Harvey Motulsky - 1995
Intuitive Biostatistics covers all the topics typically found in an introductory statistics text, but with the emphasis on confidence intervals rather than P values, making it easier for students to understand both. Additionally, it introduces a broad range of topics left out of most other introductory texts but used frequently in biomedical publications, including survival curves. multiple comparisons, sensitivity and specificity of lab tests, Bayesian thinking, lod scores, and logistic, proportional hazards and nonlinear regression. By emphasizing interpretation rather than calculation, this text provides a clear and virtually painless introduction to statistical principles for those students who will need to use statistics constantly in their work. In addition, its practical approach enables readers to understand the statistical results published in biological and medical journals.
Foundations of Statistical Natural Language Processing
Christopher D. Manning - 1999
This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.
Chances Are . . .: Adventures in Probability
Michael Kaplan - 2003
All things are possible, only one thing actually happens; everything else is in the realm of probability. The twin disciplines of probability and statistics underpin every modern science and sketch the shape of all purposeful group activity- politics, economics, medicine, law, sports-giving humans a handle on the essential uncertainty of their existence. Yet while we are all aware of the hard facts, most of us still refuse to take account of probability-preferring to drive, not fly; buying into market blips; smoking cigarettes; denying we will ever age. There are some people, though-gamblers, risk buyers, forensic experts, doctors, strategists- who find probability's mass of incomplete uncertainties delightful and revelatory. "Chances Are" is their story. Combining philosophical and historical background with portraits of the men and women who command the forces of probability, this engaging, wide-ranging, and clearly written volume will be welcomed not only by the proven audiences for popular books like "E=MC2" and "The Golden Ratio" but by anyone interested in the workings of fate.
Mathematics In The Modern World: Readings From Scientific American
Morris Kline - 1968
Networks: An Introduction
M.E.J. Newman - 2010
The rise of the Internet and the wide availability of inexpensive computers have made it possible to gather and analyze network data on a large scale, and the development of a variety of new theoretical tools has allowed us to extract new knowledge from many different kinds of networks.The study of networks is broadly interdisciplinary and important developments have occurred in many fields, including mathematics, physics, computer and information sciences, biology, and the social sciences. This book brings together for the first time the most important breakthroughs in each of these fields and presents them in a coherent fashion, highlighting the strong interconnections between work in different areas.Subjects covered include the measurement and structure of networks in many branches of science, methods for analyzing network data, including methods developed in physics, statistics, and sociology, the fundamentals of graph theory, computer algorithms, and spectral methods, mathematical models of networks, including random graph models and generative models, and theories of dynamical processes taking place on networks.
Probability And Statistics For Engineers And Scientists
Ronald E. Walpole - 1978
Offers extensively updated coverage, new problem sets, and chapter-ending material to enhance the book’s relevance to today’s engineers and scientists. Includes new problem sets demonstrating updated applications to engineering as well as biological, physical, and computer science. Emphasizes key ideas as well as the risks and hazards associated with practical application of the material. Includes new material on topics including: difference between discrete and continuous measurements; binary data; quartiles; importance of experimental design; “dummy” variables; rules for expectations and variances of linear functions; Poisson distribution; Weibull and lognormal distributions; central limit theorem, and data plotting. Introduces Bayesian statistics, including its applications to many fields. For those interested in learning more about probability and statistics.
Convex Optimization
Stephen Boyd - 2004
A comprehensive introduction to the subject, this book shows in detail how such problems can be solved numerically with great efficiency. The focus is on recognizing convex optimization problems and then finding the most appropriate technique for solving them. The text contains many worked examples and homework exercises and will appeal to students, researchers and practitioners in fields such as engineering, computer science, mathematics, statistics, finance, and economics.
Counting: How We Use Numbers to Decide What Matters
Deborah Stone - 2020
With help from Dr. Seuss and Cookie Monster, she explains why numbers can’t be objective: in order to count, one must first decide what counts. Every number is the ending to a story built on cultural assumptions, social conventions, and personal judgments.And yet, in this age of big data and metric mania, numbers shape almost every facet of our lives: whether we get hired, fired, or promoted; whether we get into college or out of prison; how our opinions are gathered and portrayed to politicians; or how government designs health and safety regulations. In warm and playful prose, Counting explores what happens when we measure nebulous notions like merit, race, poverty, pain, or productivity.When so much rides on numbers, they can become instruments of social welfare, justice, and democracy—or not. The citizens of Flint, Michigan, for instance, used numbers to prove how their household water got contaminated and to force their government to take remedial action. In stark contrast, the Founding Fathers finessed an intractable conflict by counting each slave as three-fifths of a person in the national census. They set a terrible precedent for today’s politicians who claim to solve moral and political dilemmas with arithmetic.Suffused with moral reflection and ending with a powerful epilogue on COVID-19’s dizzying statistics, Counting will forever change our relationship with numbers.