Book picks similar to
"Raw Data" Is An Oxymoron by Lisa Gitelman
nonfiction
non-fiction
science
tech
Mining the Social Web: Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites
Matthew A. Russell - 2011
You’ll learn how to combine social web data, analysis techniques, and visualization to find what you’ve been looking for in the social haystack—as well as useful information you didn’t know existed.Each standalone chapter introduces techniques for mining data in different areas of the social Web, including blogs and email. All you need to get started is a programming background and a willingness to learn basic Python tools.Get a straightforward synopsis of the social web landscapeUse adaptable scripts on GitHub to harvest data from social network APIs such as Twitter, Facebook, LinkedIn, and Google+Learn how to employ easy-to-use Python tools to slice and dice the data you collectExplore social connections in microformats with the XHTML Friends NetworkApply advanced mining techniques such as TF-IDF, cosine similarity, collocation analysis, document summarization, and clique detectionBuild interactive visualizations with web technologies based upon HTML5 and JavaScript toolkits"A rich, compact, useful, practical introduction to a galaxy of tools, techniques, and theories for exploring structured and unstructured data." --Alex Martelli, Senior Staff Engineer, Google
Naked Statistics: Stripping the Dread from the Data
Charles Wheelan - 2012
How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? What is causing the rising incidence of autism? As best-selling author Charles Wheelan shows us in Naked Statistics, the right data and a few well-chosen statistical tools can help us answer these questions and more.For those who slept through Stats 101, this book is a lifesaver. Wheelan strips away the arcane and technical details and focuses on the underlying intuition that drives statistical analysis. He clarifies key concepts such as inference, correlation, and regression analysis, reveals how biased or careless parties can manipulate or misrepresent data, and shows us how brilliant and creative researchers are exploiting the valuable data from natural experiments to tackle thorny questions.And in Wheelan’s trademark style, there’s not a dull page in sight. You’ll encounter clever Schlitz Beer marketers leveraging basic probability, an International Sausage Festival illuminating the tenets of the central limit theorem, and a head-scratching choice from the famous game show Let’s Make a Deal—and you’ll come away with insights each time. With the wit, accessibility, and sheer fun that turned Naked Economics into a bestseller, Wheelan defies the odds yet again by bringing another essential, formerly unglamorous discipline to life.
Mining of Massive Datasets
Anand Rajaraman - 2011
This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike.
The Human Use of Human Beings: Cybernetics and Society
Norbert Wiener - 1949
Norbert Wiener's classic is one in that small company. Founder of the science of cybernetics—the study of the relationship between computers and the human nervous system—Wiener was widely misunderstood as one who advocated the automation of human life. As this book reveals, his vision was much more complex and interesting. He hoped that machines would release people from relentless and repetitive drudgery in order to achieve more creative pursuits. At the same time he realized the danger of dehumanizing and displacement. His book examines the implications of cybernetics for education, law, language, science, technology, as he anticipates the enormous impact—in effect, a third industrial revolution—that the computer has had on our lives.
An Introduction to Statistical Learning: With Applications in R
Gareth James - 2013
This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree- based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fields, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical software platform. Two of the authors co-wrote The Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. This book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. The text assumes only a previous course in linear regression and no knowledge of matrix algebra.
Remediation: Understanding New Media
Jay David Bolter - 1998
In this richly illustrated study, Jay David Bolter and Richard Grusin offer a theory of mediation for our digital age that challenges this assumption. They argue that new visual media achieve their cultural significance precisely by paying homage to, rivaling, and refashioning such earlier media as perspective painting, photography, film, and television. They call this process of refashioning "remediation," and they note that earlier media have also refashioned one another: photography remediated painting, film remediated stage production and photography, and television remediated film, vaudeville, and radio.
The Case for Books: Past, Present, and Future
Robert Darnton - 2009
With the printed book, information and ideas could disseminate more widely and effectively than ever before—and in some cases, affect and redirect the sway of history. Today, nearly one million books are published each year. But is the era of the book as we know it—a codex of bound pages—coming to an end? And if it is, should we celebrate its demise and the creation of a democratic digital future, or mourn an irreplaceable loss? The digital age is revolutionizing the information landscape. Already, more books have been scanned and digitized than were housed in the great library in Alexandria, making available millions of texts for a curious reader at the click of a button, and electronic book sales are growing exponentially. Will this revolution in the delivery of information and entertainment make for more transparent and far-reaching dissemination or create a monopolistic stranglehold? In The Case for Books, Robert Darnton, an intellectual pioneer in the field of the history of the book and director of Harvard University's Library, offers an in-depth examination of the book from its earliest beginnings to its shifting role today in popular culture, commerce, and the academy. As an author, editorial advisor, and publishing entrepreneur, Darnton is a unique authority on the life and role of the book in society. This book is a wise work of scholarship—one that requires readers to carefully consider how the digital revolution will broadly affect the marketplace of ideas.
Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World
Bruce Schneier - 2015
Your online and in-store purchasing patterns are recorded, and reveal if you're unemployed, sick, or pregnant. Your e-mails and texts expose your intimate and casual friends. Google knows what you’re thinking because it saves your private searches. Facebook can determine your sexual orientation without you ever mentioning it.The powers that surveil us do more than simply store this information. Corporations use surveillance to manipulate not only the news articles and advertisements we each see, but also the prices we’re offered. Governments use surveillance to discriminate, censor, chill free speech, and put people in danger worldwide. And both sides share this information with each other or, even worse, lose it to cybercriminals in huge data breaches.Much of this is voluntary: we cooperate with corporate surveillance because it promises us convenience, and we submit to government surveillance because it promises us protection. The result is a mass surveillance society of our own making. But have we given up more than we’ve gained? In Data and Goliath, security expert Bruce Schneier offers another path, one that values both security and privacy. He brings his bestseller up-to-date with a new preface covering the latest developments, and then shows us exactly what we can do to reform government surveillance programs, shake up surveillance-based business models, and protect our individual privacy. You'll never look at your phone, your computer, your credit cards, or even your car in the same way again.
To Save Everything, Click Here: The Folly of Technological Solutionism
Evgeny Morozov - 2013
But how will these be affected once we delegate much of the responsibility for them to technology? The temptation of the digital age is to fix everything—from crime to corruption to pollution to obesity—by digitally quantifying, tracking, or gamifiying behavior. But when we change the motivations for our moral, ethical, and civic behavior, we may also change the very nature of that behavior itself. Technology, Evgeny Morozov proposes, can be a force for improvement—but only if we abandon the idea that it is necessarily revolutionary and instead genuinely interrogate what we are doing with it and what it is doing to us.From urging us to abandon monolithic ideas of “the Internet” to showing how to design more humane and democratic technological solutions, To Save Everything, Click Here is a dazzling tour of our technological future, and a searching investigation into the digital version of an enduring struggle: between man and his machines.
Research Design: Qualitative, Quantitative, and Mixed Methods Approaches
John W. Creswell - 1994
It is a book that models the types of issues that best suit different approaches and allows students to understand when to use mixed methods. Furthermore, its focus on theory and paradigms is done in a way that helps students decode their meaning." --MARTHA MONTERO-SIEBURTH, University of Massachusetts, BostonNew to the Second Edition:Because mixed methods research has come into its own since the publication of the first edition, every chapter now shows how to implement a mixed methods design in your proposal or plan as well as showing how to implement qualitative and quantitative approaches Ethical issues that may arise in quantitative, qualitative and mixed methods designs have been added to a new section in Chapter 3 Writing tips and considerations have been expanded and moved to the first part of the book to get your research plan started in the right direction The latest developments in qualitative inquiry, including advocacy, participatory, and emancipatory approaches have been added to Chapter 10 Mixed methods procedures (Chapter 11) show readers how to identify types of mixed methods strategy, select data collection and analysis approaches, and plan the overall structure of the study
Metadata for Digital Collections: A How-To-Do-It Manual
Steven J. Miller - 2011
It's Complicated: The Social Lives of Networked Teens
Danah Boyd - 2014
. . It’s Complicated will update your mind.”—Alissa Quart, New York Times Book Review “A fascinating, well-researched and (mostly) reassuring look at how today's tech-savvy teenagers are using social media.”—People “The briefest possible summary? The kids are all right, but society isn’t.”—Andrew Leonard, Salon What is new about how teenagers communicate through services such as Facebook, Twitter, and Instagram? Do social media affect the quality of teens’ lives? In this eye-opening book, youth culture and technology expert danah boyd uncovers some of the major myths regarding teens' use of social media. She explores tropes about identity, privacy, safety, danger, and bullying. Ultimately, boyd argues that society fails young people when paternalism and protectionism hinder teenagers’ ability to become informed, thoughtful, and engaged citizens through their online interactions. Yet despite an environment of rampant fear-mongering, boyd finds that teens often find ways to engage and to develop a sense of identity. Boyd’s conclusions are essential reading not only for parents, teachers, and others who work with teens but also for anyone interested in the impact of emerging technologies on society, culture, and commerce in years to come. Offering insights gleaned from more than a decade of original fieldwork interviewing teenagers across the United States, boyd concludes reassuringly that the kids are all right. At the same time, she acknowledges that coming to terms with life in a networked era is not easy or obvious. In a technologically mediated world, life is bound to be complicated.
When We Are No More: How Digital Memory Is Shaping Our Future
Abby Smith Rumsey - 2015
Our stories, ideas, and innovations--in a word, our "culture"--can be recorded and passed on to future generations. Our enduring culture and restless curiosity have enabled us to invent powerful information technologies that give us invaluable perspective on our past and define our future. Today, we stand at the very edge of a vast, uncharted digital landscape, where our collective memory is stored in ephemeral bits and bytes and lives in air-conditioned server rooms. What sources will historians turn to in 100, let alone 1,000 years to understand our own time if all of our memory lives in digital codes that may no longer be decipherable?In When We Are No More Abby Smith Rumsey explores human memory from pre-history to the present to shed light on the grand challenge facing our world--the abundance of information and scarcity of human attention. Tracing the story from cuneiform tablets and papyrus scrolls, to movable type, books, and the birth of the Library of Congress, Rumsey weaves a compelling narrative that explores how humans have dealt with the problem of too much information throughout our history, and indeed how we might begin solve the same problem for our digital future. Serving as a call to consciousness, When We Are No More explains why data storage is not memory; why forgetting is the first step towards remembering; and above all, why memory is about the future, not the past."If we're thinking 1,000 years, 3,000 years ahead in the future, we have to ask ourselves, how do we preserve all the bits that we need in order to correctly interpret the digital objects we create? We are nonchalantly throwing all of our data into what could become an information black hole without realizing it." --Vint Cerf, Chief Evangelist at Google, at a press conference in February, 2015.
Macroanalysis: Digital Methods and Literary History
Matthew L. Jockers - 2013
Jockers introduces readers to large-scale literary computing and the revolutionary potential of macroanalysis--a new approach to the study of the literary record designed for probing the digital-textual world as it exists today, in digital form and in large quantities. Using computational analysis to retrieve key words, phrases, and linguistic patterns across thousands of texts in digital libraries, researchers can draw conclusions based on quantifiable evidence regarding how literary trends are employed over time, across periods, within regions, or within demographic groups, as well as how cultural, historical, and societal linkages may bind individual authors, texts, and genres into an aggregate literary culture. Moving beyond the limitations of literary interpretation based on the "close-reading" of individual works, Jockers describes how this new method of studying large collections of digital material can help us to better understand and contextualize the individual works within those collections.
Double Fold: Libraries and the Assault on Paper
Nicholson Baker - 2001
But for fifty years our country's libraries—including the Library of Congress—have been doing just the opposite, destroying hundreds of thousands of historic newspapers and replacing them with microfilm copies that are difficult to read, lack all the color and quality of the original paper and illustrations, and deteriorate with age.With meticulous detective work and Baker's well-known explanatory power, Double Fold reveals a secret history of microfilm lobbyists, former CIA agents, and warehouses where priceless archives are destroyed with a machine called a guillotine. Baker argues passionately for preservation, even cashing in his own retirement account to save one important archive—all twenty tons of it. Written the brilliant narrative style that Nicholson Baker fans have come to expect, Double Fold is a persuasive and often devastating book that may turn out to be The Jungle of the American library system.