Data Smart: Using Data Science to Transform Information into Insight


John W. Foreman - 2013
    Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.Data science is little more than using straight-forward steps to process raw data into actionable insight. And in Data Smart, author and data scientist John Foreman will show you how that's done within the familiar environment of a spreadsheet. Why a spreadsheet? It's comfortable! You get to look at the data every step of the way, building confidence as you learn the tricks of the trade. Plus, spreadsheets are a vendor-neutral place to learn data science without the hype. But don't let the Excel sheets fool you. This is a book for those serious about learning the analytic techniques, the math and the magic, behind big data.Each chapter will cover a different technique in a spreadsheet so you can follow along: - Mathematical optimization, including non-linear programming and genetic algorithms- Clustering via k-means, spherical k-means, and graph modularity- Data mining in graphs, such as outlier detection- Supervised AI through logistic regression, ensemble models, and bag-of-words models- Forecasting, seasonal adjustments, and prediction intervals through monte carlo simulation- Moving from spreadsheets into the R programming languageYou get your hands dirty as you work alongside John through each technique. But never fear, the topics are readily applicable and the author laces humor throughout. You'll even learn what a dead squirrel has to do with optimization modeling, which you no doubt are dying to know.

Big Data: Does Size Matter?


Timandra Harkness - 2016
    It can help us do things faster and more efficiently than ever before, from tracking wolves through Minnesota by GPS to predicting which crimes are likely to happen where. Mega data has led to scientific and social achievements that would have been impossible just a few years ago. But being too dazzled by the scale, the speed, and the geeky jargon can lead us astray. It's big, but it's not always clever.Timandra Harkness cuts through the hype to put data science into its real-life context using a wide range of stories, people, and places to reveal what is essentially a human science--demystifying big data, telling us where it comes from and what it can do. BIG DATA then asks the awkward questions: What are the unspoken assumptions underlying its methods? Are we being bamboozled by mega data's size, its speed, and its shiny technology?Nobody needs a degree in computer science to follow Harkness's exploration of what mega data can do for us--and what it can't or shouldn't. BIG DATA asks you to decide: Are you a data point, or a human being?

Data Science from Scratch: First Principles with Python


Joel Grus - 2015
    In this book, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch. If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with hacking skills you need to get started as a data scientist. Today’s messy glut of data holds answers to questions no one’s even thought to ask. This book provides you with the know-how to dig those answers out. Get a crash course in Python Learn the basics of linear algebra, statistics, and probability—and understand how and when they're used in data science Collect, explore, clean, munge, and manipulate data Dive into the fundamentals of machine learning Implement models such as k-nearest Neighbors, Naive Bayes, linear and logistic regression, decision trees, neural networks, and clustering Explore recommender systems, natural language processing, network analysis, MapReduce, and databases

Nine Algorithms That Changed the Future: The Ingenious Ideas That Drive Today's Computers


John MacCormick - 2012
    A simple web search picks out a handful of relevant needles from the world's biggest haystack: the billions of pages on the World Wide Web. Uploading a photo to Facebook transmits millions of pieces of information over numerous error-prone network links, yet somehow a perfect copy of the photo arrives intact. Without even knowing it, we use public-key cryptography to transmit secret information like credit card numbers; and we use digital signatures to verify the identity of the websites we visit. How do our computers perform these tasks with such ease? This is the first book to answer that question in language anyone can understand, revealing the extraordinary ideas that power our PCs, laptops, and smartphones. Using vivid examples, John MacCormick explains the fundamental "tricks" behind nine types of computer algorithms, including artificial intelligence (where we learn about the "nearest neighbor trick" and "twenty questions trick"), Google's famous PageRank algorithm (which uses the "random surfer trick"), data compression, error correction, and much more. These revolutionary algorithms have changed our world: this book unlocks their secrets, and lays bare the incredible ideas that our computers use every day.

Scale: The Universal Laws of Growth, Innovation, Sustainability, and the Pace of Life in Organisms, Cities, Economies, and Companies


Geoffrey B. West - 2017
    The term “complexity” can be misleading, however, because what makes West’s discoveries so beautiful is that he has found an underlying simplicity that unites the seemingly complex and diverse phenomena of living systems, including our bodies, our cities and our businesses. Fascinated by issues of aging and mortality, West applied the rigor of a physicist to the biological question of why we live as long as we do and no longer. The result was astonishing, and changed science, creating a new understanding of energy use and metabolism: West found that despite the riotous diversity in the sizes of mammals, they are all, to a large degree, scaled versions of each other. If you know the size of a mammal, you can use scaling laws to learn everything from how much food it eats per day, what its heart-rate is, how long it will take to mature, its lifespan, and so on. Furthermore, the efficiency of the mammal’s circulatory systems scales up precisely based on weight: if you compare a mouse, a human and an elephant on a logarithmic graph, you find with every doubling of average weight, a species gets 25% more efficient—and lives 25% longer. This speaks to everything from how long we can expect to live to how many hours of sleep we need. Fundamentally, he has proven, the issue has to do with the fractal geometry of the networks that supply energy and remove waste from the organism's body. West's work has been game-changing for biologists, but then he made the even bolder move of exploring his work's applicability to cities. Cities, too, are constellations of networks and laws of scalability relate with eerie precision to them. For every doubling in a city's size, the city needs 15% less road, electrical wire, and gas stations to support the same population. More amazingly, for every doubling in size, cities produce 15% more patents and more wealth, as well as 15% more crime and disease. This broad pattern lays the groundwork for a new science of cities. Recently, West has applied his revolutionary work on cities and biological life to the business world. This investigation has led to powerful insights into why some companies thrive while others fail. The implications of these discoveries are far-reaching, and are just beginning to be explored. Scale is a thrilling scientific adventure story about the elemental natural laws that bind us together in simple but profound ways. Through the brilliant mind of Geoffrey West, we can envision how cities, companies and biological life alike are dancing to the same simple, powerful tune, however diverse and unrelated they are to each other.From the Hardcover edition.

Nonzero: The Logic of Human Destiny


Robert Wright - 1999
    Now Wright attempts something even more ambitious: explaining the direction of evolution and human history–and discerning where history will lead us next.In Nonzero: The Logic of Human Destiny, Wright asserts that, ever since the primordial ooze, life has followed a basic pattern. Organisms and human societies alike have grown more complex by mastering the challenges of internal cooperation. Wright's narrative ranges from fossilized bacteria to vampire bats, from stone-age villages to the World Trade Organization, uncovering such surprises as the benefits of barbarian hordes and the useful stability of feudalism. Here is history endowed with moral significance–a way of looking at our biological and cultural evolution that suggests, refreshingly, that human morality has improved over time, and that our instinct to discover meaning may itself serve a higher purpose. Insightful, witty, profound, Nonzero offers breathtaking implications for what we believe and how we adapt to technology's ongoing transformation of the world.From the Trade Paperback edition.

Everydata: The Misinformation Hidden in the Little Data You Consume Every Day


John H. Johnson - 2016
    The average person consumes approximately 30 gigabytes of data every single day, but has no idea how to interpret it correctly. EVERYDATA explains, through the eyes of an expert economist and statistician, how to decipher the small bytes of data we consume in a day.EVERYDATA is filled with countless examples of people misconstruing data—with results that range from merely frustrating to catastrophic:The space shuttle Challenger exploded in part because the engineers were reviewing a limited sample set.Millions of women avoid caffeine during pregnancy because they interpret correlation as causation.Attorneys faced a $1 billion jury verdict because of outlier data.Each chapter highlights one commonly misunderstood data concept, using both realworld and hypothetical examples from a wide range of topics, including business, politics, advertising, law, engineering, retail, parenting, and more. You’ll find the answer to the question—“Now what?”—along with concrete ways you can use this information to immediately start making smarter decisions, today and every day.

Automate This: How Algorithms Came to Rule Our World


Christopher Steiner - 2012
    It used to be that to diagnose an illness, interpret legal documents, analyze foreign policy, or write a newspaper article you needed a human being with specific skills—and maybe an advanced degree or two. These days, high-level tasks are increasingly being handled by algorithms that can do precise work not only with speed but also with nuance. These “bots” started with human programming and logic, but now their reach extends beyond what their creators ever expected. In this fascinating, frightening book, Christopher Steiner tells the story of how algorithms took over—and shows why the “bot revolution” is about to spill into every aspect of our lives, often silently, without our knowledge. The May 2010 “Flash Crash” exposed Wall Street’s reliance on trading bots to the tune of a 998-point market drop and $1 trillion in vanished market value. But that was just the beginning. In Automate This, we meet bots that are driving cars, penning haiku, and writing music mistaken for Bach’s. They listen in on our customer service calls and figure out what Iran would do in the event of a nuclear standoff. There are algorithms that can pick out the most cohesive crew of astronauts for a space mission or identify the next Jeremy Lin. Some can even ingest statistics from baseball games and spit out pitch-perfect sports journalism indistinguishable from that produced by humans. The interaction of man and machine can make our lives easier. But what will the world look like when algorithms control our hospitals, our roads, our culture, and our national security? What hap­pens to businesses when we automate judgment and eliminate human instinct? And what role will be left for doctors, lawyers, writers, truck drivers, and many others?  Who knows—maybe there’s a bot learning to do your job this minute.

Risk Savvy: How to Make Good Decisions


Gerd Gigerenzer - 2013
    But as risk expert Gerd Gigerenzer shows, the surprising truth is that in the real world, we often get better results by using simple rules and considering less information. In Risk Savvy, Gigerenzer reveals that most of us, including doctors, lawyers, financial advisers, and elected officials, misunderstand statistics much more often than we think, leaving us not only misinformed, but vulnerable to exploitation. Yet there is hope. Anyone can learn to make better decisions for their health, finances, family, and business without needing to consult an expert or a super computer, and Gigerenzer shows us how.Risk Savvy is an insightful and easy-to-understand remedy to our collective information overload and an essential guide to making smart, confident decisions in the face of uncertainty.

Factfulness: Ten Reasons We're Wrong About the World – and Why Things Are Better Than You Think


Hans Rosling - 2018
    So wrong that a chimpanzee choosing answers at random will consistently outguess teachers, journalists, Nobel laureates, and investment bankers.In Factfulness, Professor of International Health and global TED phenomenon Hans Rosling, together with his two long-time collaborators, Anna and Ola, offers a radical new explanation of why this happens. They reveal the ten instincts that distort our perspective—from our tendency to divide the world into two camps (usually some version of us and them) to the way we consume media (where fear rules) to how we perceive progress (believing that most things are getting worse).Our problem is that we don’t know what we don’t know, and even our guesses are informed by unconscious and predictable biases.It turns out that the world, for all its imperfections, is in a much better state than we might think. That doesn’t mean there aren’t real concerns. But when we worry about everything all the time instead of embracing a worldview based on facts, we can lose our ability to focus on the things that threaten us most.Inspiring and revelatory, filled with lively anecdotes and moving stories, Factfulness is an urgent and essential book that will change the way you see the world and empower you to respond to the crises and opportunities of the future.

Being Wrong: Adventures in the Margin of Error


Kathryn Schulz - 2010
    Kathryn Schulz, editor of Grist magazine, argues that error is the fundamental human condition and should be celebrated as such. Guiding the reader through the history and psychology of error, from Socrates to Alan Greenspan, Being Wrong will change the way you perceive screw-ups, both of the mammoth and daily variety, forever.

The Joy of x: A Guided Tour of Math, from One to Infinity


Steven H. Strogatz - 2012
    do it? How should you flip your mattress to get the maximum wear out of it? How does Google search the Internet? How many people should you date before settling down? Believe it or not, math plays a crucial role in answering all of these questions and more.Math underpins everything in the cosmos, including us, yet too few of us understand this universal language well enough to revel in its wisdom, its beauty — and its joy. This deeply enlightening, vastly entertaining volume translates math in a way that is at once intelligible and thrilling. Each trenchant chapter of The Joy of x offers an “aha!” moment, starting with why numbers are so helpful, and progressing through the wondrous truths implicit in π, the Pythagorean theorem, irrational numbers, fat tails, even the rigors and surprising charms of calculus. Showing why he has won awards as a professor at Cornell and garnered extensive praise for his articles about math for the New York Times, Strogatz presumes of his readers only curiosity and common sense. And he rewards them with clear, ingenious, and often funny explanations of the most vital and exciting principles of his discipline.Whether you aced integral calculus or aren’t sure what an integer is, you’ll find profound wisdom and persistent delight in The Joy of x.

Data Science


John D. Kelleher - 2018
    Today data science determines the ads we see online, the books and movies that are recommended to us online, which emails are filtered into our spam folders, and even how much we pay for health insurance. This volume in the MIT Press Essential Knowledge series offers a concise introduction to the emerging field of data science, explaining its evolution, current uses, data infrastructure issues, and ethical challenges.It has never been easier for organizations to gather, store, and process data. Use of data science is driven by the rise of big data and social media, the development of high-performance computing, and the emergence of such powerful methods for data analysis and modeling as deep learning. Data science encompasses a set of principles, problem definitions, algorithms, and processes for extracting non-obvious and useful patterns from large datasets. It is closely related to the fields of data mining and machine learning, but broader in scope. This book offers a brief history of the field, introduces fundamental data concepts, and describes the stages in a data science project. It considers data infrastructure and the challenges posed by integrating data from multiple sources, introduces the basics of machine learning, and discusses how to link machine learning expertise with real-world problems. The book also reviews ethical and legal issues, developments in data regulation, and computational approaches to preserving privacy. Finally, it considers the future impact of data science and offers principles for success in data science projects.

Complexity: A Guided Tour


Melanie Mitchell - 2009
    Based on her work at the Santa Fe Institute and drawing on its interdisciplinary strategies, Mitchell brings clarity to the workings of complexity across a broad range of biological, technological, and social phenomena, seeking out the general principles or laws that apply to all of them. Richly illustrated, Complexity: A Guided Tour--winner of the 2010 Phi Beta Kappa Book Award in Science--offers a wide-ranging overview of the ideas underlying complex systems science, the current research at the forefront of this field, and the prospects for its contribution to solving some of the most important scientific questions of our time.

Scorecasting: The Hidden Influences Behind How Sports Are Played and Games Are Won


Tobias J. Moskowitz - 2011
    Jon Wertheim to overturn some of the most cherished truisms of sports, and reveal the hidden forces that shape how basketball, baseball, football, and hockey games are played, won and lost.Drawing from Moskowitz's original research, as well as studies from fellow economists such as bestselling author Richard Thaler, the authors look at: the influence home-field advantage has on the outcomes of games in all sports and why it exists; the surprising truth about the universally accepted axiom that defense wins championships;  the subtle biases that umpires exhibit in calling balls and strikes in key situations; the unintended consequences of referees' tendencies in every sport to "swallow the whistle," and more.Among the insights that Scorecasting reveals:Why Tiger Woods is prone to the same mistake in high-pressure putting situations that you and I areWhy professional teams routinely overvalue draft picks The myth of momentum  or the "hot hand" in sports, and why so many fans, coaches, and broadcasters fervently subscribe to itWhy NFL coaches rarely go for a first down on fourth-down situations--even when their reluctance to do so reduces their chances of winning.In an engaging narrative that takes us from the putting greens of Augusta to the grid iron of a small parochial high school in Arkansas, Scorecasting will forever change how you view the game, whatever your favorite sport might be.