Superforecasting: The Art and Science of Prediction


Philip E. Tetlock - 2015
    Unfortunately, people tend to be terrible forecasters. As Wharton professor Philip Tetlock showed in a landmark 2005 study, even experts’ predictions are only slightly better than chance. However, an important and underreported conclusion of that study was that some experts do have real foresight, and Tetlock has spent the past decade trying to figure out why. What makes some people so good? And can this talent be taught?   In Superforecasting, Tetlock and coauthor Dan Gardner offer a masterwork on prediction, drawing on decades of research and the results of a massive, government-funded forecasting tournament. The Good Judgment Project involves tens of thousands of ordinary people—including a Brooklyn filmmaker, a retired pipe installer, and a former ballroom dancer—who set out to forecast global events. Some of the volunteers have turned out to be astonishingly good. They’ve beaten other benchmarks, competitors, and prediction markets. They’ve even beaten the collective judgment of intelligence analysts with access to classified information. They are "superforecasters."   In this groundbreaking and accessible book, Tetlock and Gardner show us how we can learn from this elite group. Weaving together stories of forecasting successes (the raid on Osama bin Laden’s compound) and failures (the Bay of Pigs) and interviews with a range of high-level decision makers, from David Petraeus to Robert Rubin, they show that good forecasting doesn’t require powerful computers or arcane methods. It involves gathering evidence from a variety of sources, thinking probabilistically, working in teams, keeping score, and being willing to admit error and change course. Superforecasting offers the first demonstrably effective way to improve our ability to predict the future—whether in business, finance, politics, international affairs, or daily life—and is destined to become a modern classic.

Database Internals: A deep-dive into how distributed data systems work


Alex Petrov - 2019
    But with so many distributed databases and tools available today, it’s often difficult to understand what each one offers and how they differ. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals.Throughout the book, you’ll explore relevant material gleaned from numerous books, papers, blog posts, and the source code of several open source databases. These resources are listed at the end of parts one and two. You’ll discover that the most significant distinctions among many modern databases reside in subsystems that determine how storage is organized and how data is distributed.This book examines:Storage engines: Explore storage classification and taxonomy, and dive into B-Tree-based and immutable log structured storage engines, with differences and use-cases for eachDistributed systems: Learn step-by-step how nodes and processes connect and build complex communication patterns, from UDP to reliable consensus protocolsDatabase clusters: Discover how to achieve consistent models for replicated data

Learning SPARQL


Bob DuCharme - 2011
    With this concise book, you will learn how to use the latest version of this W3C standard to retrieve and manipulate the increasing amount of public and private data available via SPARQL endpoints. Several open source and commercial tools already support SPARQL, and this introduction gets you started right away.Begin with how to write and run simple SPARQL 1.1 queries, then dive into the language's powerful features and capabilities for manipulating the data you retrieve. Learn what you need to know to add to, update, and delete data in RDF datasets, and give web applications access to this data.Understand SPARQL’s connection with RDF, the semantic web, and related specificationsQuery and combine data from local and remote sourcesCopy, convert, and create new RDF dataLearn how datatype metadata, standardized functions, and extension functions contribute to your queriesIncorporate SPARQL queries into web-based applications

Python for Everybody: Exploring Data in Python 3


Charles Severance - 2016
    You can think of the Python programming language as your tool to solve data problems that are beyond the capability of a spreadsheet.Python is an easy to use and easy to learn programming language that is freely available on Macintosh, Windows, or Linux computers. So once you learn Python you can use it for the rest of your career without needing to purchase any software.This book uses the Python 3 language. The earlier Python 2 version of this book is titled "Python for Informatics: Exploring Information".

Automate the Boring Stuff with Python: Practical Programming for Total Beginners


Al Sweigart - 2014
    But what if you could have your computer do them for you?In "Automate the Boring Stuff with Python," you'll learn how to use Python to write programs that do in minutes what would take you hours to do by hand no prior programming experience required. Once you've mastered the basics of programming, you'll create Python programs that effortlessly perform useful and impressive feats of automation to: Search for text in a file or across multiple filesCreate, update, move, and rename files and foldersSearch the Web and download online contentUpdate and format data in Excel spreadsheets of any sizeSplit, merge, watermark, and encrypt PDFsSend reminder emails and text notificationsFill out online formsStep-by-step instructions walk you through each program, and practice projects at the end of each chapter challenge you to improve those programs and use your newfound skills to automate similar tasks.Don't spend your time doing work a well-trained monkey could do. Even if you've never written a line of code, you can make your computer do the grunt work. Learn how in "Automate the Boring Stuff with Python.""

Digital Copyright: Protecting Intellectual Property on the Internet


Jessica Litman - 2000
    The efforts to enforce these new rights have resulted in highly publicized legal battles between established media, such as major record labels and motion picture studios, and upstart Internet companies, such as MP3.com and Napster.The general public is used to thinking of copyright (if it thinks of it at all) as marginal and arcane, and it hasn't paid much attention as legislation to expand copyright moved through Congress. But copyright law is central to our society's information policy, and affects what we can read, view, hear, use, or learn.In this enlightening and well-argued book, law professor Jessica Litman questions whether copyright laws crafted by lawyers and their lobbyists really make sense for the vast majority of us. Should every interaction between ordinary consumers and copyright-protected works be governed by laws drafted without ordinary consumers in mind? Is it practical to enforce such laws, or expect consumers to obey them? Most important, what are the effects of such laws on the exchange of information in a free society? Litman's critique exposes the 1998 copyright law as an incoherent patchwork. She argues for reforms that reflect common sense and the way people actually behave in their daily digital interactions.(front flap)

Bit by Bit: Social Research in the Digital Age


Matthew J. Salganik - 2017
    In addition to changing how we live, these tools enable us to collect and process data about human behavior on a scale never before imaginable, offering entirely new approaches to core questions about social behavior. Bit by Bit is the key to unlocking these powerful methods--a landmark book that will fundamentally change how the next generation of social scientists and data scientists explores the world around us.Bit by Bit is the essential guide to mastering the key principles of doing social research in this fast-evolving digital age. In this comprehensive yet accessible book, Matthew Salganik explains how the digital revolution is transforming how social scientists observe behavior, ask questions, run experiments, and engage in mass collaborations. He provides a wealth of real-world examples throughout and also lays out a principles-based approach to handling ethical challenges.Bit by Bit is an invaluable resource for social scientists who want to harness the research potential of big data and a must-read for data scientists interested in applying the lessons of social science to tomorrow's technologies.Illustrates important ideas with examples of outstanding researchCombines ideas from social science and data science in an accessible style and without jargonGoes beyond the analysis of "found" data to discuss the collection of "designed" data such as surveys, experiments, and mass collaborationFeatures an entire chapter on ethicsIncludes extensive suggestions for further reading and activities for the classroom or self-study

Data and Reality


William Kent - 1978
    

The Book of Trees: Visualizing Branches of Knowledge


Manuel Lima - 2014
    Particularly noteworthy are the numerous historical examples of past efforts to make sense of complex systems of information. In this new companion volume, The Book of Trees, data viz expert Manuel Lima examines the more than eight hundred year history of the tree diagram, from its roots in the illuminated manuscripts of medieval monasteries to its current resurgence as an elegant means of visualization. Lima presents two hundred intricately detailed tree diagram illustrations on a remarkable variety of subjects—from some of the earliest known examples from ancient Mesopotamia to the manuscripts of medieval monasteries to contributions by leading contemporary designers. A timeline of capsule biographies on key figures in the development of the tree diagram rounds out this one-of-a-kind visual compendium.

The Ascent of Information: Books, Bits, Genes, Machines, and Life's Unending Algorithm


Caleb Scharf - 2021
    But in our rush to build the infrastructure for the 20 quintillion bits we create every day, we've failed to ask exactly why we're expending ever-increasing amounts of energy, resources, and human effort to maintain all this data.Drawing on deep ideas and frontier thinking in evolutionary biology, computer science, information theory, and astrobiology, Caleb Scharf argues that information is, in a very real sense, alive. All the data we create--all of our emails, tweets, selfies, A.I.-generated text and funny cat videos--amounts to an aggregate lifeform. It has goals and needs. It can control our behavior and influence our well-being. And it's an organism that has evolved right alongside us.This symbiotic relationship with information offers a startling new lens for looking at the world. Data isn't just something we produce; it's the reason we exist. This powerful idea has the potential to upend the way we think about our technology, our role as humans, and the fundamental nature of life. The Ascent of Information offers a humbling vision of a universe built of and for information. Scharf explores how our relationship with data will affect our ongoing evolution as a species. Understanding this relationship will be crucial to preventing our data from becoming more of a burden than an asset, and to preserving the possibility of a human future.

Breaking the Social Media Prism: How to Make Our Platforms Less Polarizing


Christopher A. Bail - 2021
    We use social media as a mirror to decipher our place in society but, as Chris Bail explains, it functions more like a prism that distorts our identities, empowers status-seeking extremists, and renders moderates all but invisible. Breaking the Social Media Prism challenges common myths about echo chambers, foreign misinformation campaigns, and radicalizing algorithms, revealing that the solution to political tribalism lies deep inside ourselves.Drawing on innovative online experiments and in-depth interviews with social media users from across the political spectrum, this book explains why stepping outside of our echo chambers can make us more polarized, not less. Bail takes you inside the minds of online extremists through vivid narratives that trace their lives on the platforms and off--detailing how they dominate public discourse at the expense of the moderate majority. Wherever you stand on the spectrum of user behavior and political opinion, he offers fresh solutions to counter political tribalism from the bottom up and the top down. He introduces new apps and bots to help readers avoid misperceptions and engage in better conversations with the other side. Finally, he explores what the virtual public square might look like if we could hit reset and redesign social media from scratch through a first-of-its-kind experiment on a new social media platform built for scientific research.Providing data-driven recommendations for strengthening our social media connections, Breaking the Social Media Prism shows how to combat online polarization without deleting our accounts.

Introduction to Algorithms


Thomas H. Cormen - 1989
    Each chapter is relatively self-contained and can be used as a unit of study. The algorithms are described in English and in a pseudocode designed to be readable by anyone who has done a little programming. The explanations have been kept elementary without sacrificing depth of coverage or mathematical rigor.

Data Modeling Essentials


Graeme Simsion - 1992
    In order to enable students to apply the basics of data modeling to real models, the book addresses the realities of developing systems in real-world situations by assessing the merits of a variety of possible solutions as well as using language and diagramming methods that represent industry practice.This revised edition has been given significantly expanded coverage and reorganized for greater reader comprehension even as it retains its distinctive hallmarks of readability and usefulness. Beginning with the basics, the book provides a thorough grounding in theory before guiding the reader through the various stages of applied data modeling and database design. Later chapters address advanced subjects, including business rules, data warehousing, enterprise-wide modeling and data management. It includes an entirely new section discussing the development of logical and physical modeling, along with new material describing a powerful technique for model verification. It also provides an excellent resource for additional lectures and exercises.This text is the ideal reference for data modelers, data architects, database designers, DBAs, and systems analysts, as well as undergraduate and graduate-level students looking for a real-world perspective.

Moral Panics and the Copyright Wars


William Patry - 2009
    In Moral Panics and the Copyright Wars, William Patry lays bare how we got to where we are: a bloated, punitive legal regime that has strayed far from its modest, but important roots. Patry demonstrates how copyright is a utilitarian government program--not a property or moral right. As a government program, copyright must be regulated and held accountable to ensure it is serving its public purpose. Just as Wall Street must serve Main Street, neither can copyright be left to a Reaganite magic of the market. The way we have come to talk about copyright--metaphoric language demonizing everyone involved--has led to bad business and bad policy decisions. Unless we recognize that the debates over copyright are debates over business models, we will never be able to make the correct business and policy decisions. A centrist and believer in appropriately balanced copyright laws, Patry concludes that calls for strong copyright laws, just like calls for weak copyright laws, miss the point entirely: the only laws we need are effective laws, laws that further the purpose of encouraging the creation of new works and learning. Our current regime, unfortunately, creates too many bad incentives, leading to bad conduct. Just as President Obama has called for re-tooling and re-imagining the auto industry, Patry calls for a remaking of our copyright laws so that they may once again be respected.

Being Digital


Nicholas Negroponte - 1995
    Negroponte's fans will want to get a copy of Being Digital, which is an edited version of the 18 articles he wrote for Wired about "being digital." Negroponte's text is mostly a history of media technology rather than a set of predictions for future technologies. In the beginning, he describes the evolution of CD-ROMs, multimedia, hypermedia, HDTV (high-definition television), and more. The section on interfaces is informative, offering an up-to-date history on visual interfaces, graphics, virtual reality (VR), holograms, teleconferencing hardware, the mouse and touch-sensitive interfaces, and speech recognition. In the last chapter and the epilogue, Negroponte offers visionary insight on what "being digital" means for our future. Negroponte praises computers for their educational value but recognizes certain dangers of technological advances, such as increased software and data piracy and huge shifts in our job market that will require workers to transfer their skills to the digital medium. Overall, Being Digital provides an informative history of the rise of technology and some interesting predictions for its future.