Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists
Philipp K. Janert - 2010
With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you.Use graphics to describe data with one, two, or dozens of variablesDevelop conceptual models using back-of-the-envelope calculations, as well asscaling and probability argumentsMine data with computationally intensive methods such as simulation and clusteringMake your conclusions understandable through reports, dashboards, and other metrics programsUnderstand financial calculations, including the time-value of moneyUse dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situationsBecome familiar with different open source programming environments for data analysisFinally, a concise reference for understanding how to conquer piles of data.--Austin King, Senior Web Developer, MozillaAn indispensable text for aspiring data scientists.--Michael E. Driscoll, CEO/Founder, Dataspora
Machine Learning in Action
Peter Harrington - 2011
"Machine learning," the process of automating tasks once considered the domain of highly-trained analysts and mathematicians, is the key to efficiently extracting useful information from this sea of raw data. Machine Learning in Action is a unique book that blends the foundational theories of machine learning with the practical realities of building tools for everyday data analysis. In it, the author uses the flexible Python programming language to show how to build programs that implement algorithms for data classification, forecasting, recommendations, and higher-level features like summarization and simplification.
The Tree of Knowledge: The Biological Roots of Human Understanding
Humberto R. Maturana - 1984
Its authors present a new view of cognition that has important social and ethical implications, for, they assert, the only world we humans can have is the one we create together through the actions of our coexistence. Written for a general audience as well as for students, scholars, and scientists and abundantly illustrated with examples from biology, linguistics, and new social and cultural phenomena, this revised edition includes a new afterword by Dr. Varela, in which he discusses the effect the book has had in the years since its first publication.
Artificial Intelligence: Structures and Strategies for Complex Problem Solving
George F. Luger - 1997
It is suitable for a one or two semester university course on AI, as well as for researchers in the field.
Shapes: Nature's Patterns: A Tapestry in Three Parts
Philip Ball - 2008
Where does this order and regularity come from? It creates itself. The patterns we see come from self-organization. Whether living or non-living, scientists have found that there is a pattern-forming tendency inherent in the basic structure and processes of nature, so that from a few simple themes, and the repetition of simple rules, endless beautiful variations can arise.Part of a trilogy of books exploring the science of patterns in nature, acclaimed science writer Philip Ball here looks at how shapes form. From soap bubbles to honeycombs, delicate shell patterns, and even the developing body parts of a complex animal like ourselves, he uncovers patterns in growth and form in all corners of the natural world, explains how these patterns are self-made, and why similar shapes and structures may be found in very different settings, orchestrated by nothing more than simple physical forces. This book will make you look at the world with fresh eyes, seeing order and form even in the places you'd least expect.
How the Leopard Changed Its Spots: The Evolution of Complexity
Brian Goodwin - 1994
Genes are important, but only as part of a process constrained by environment, physical laws, and the universal tendencies of complex adaptive systems. In a new preface for this edition, Goodwin reflects on the advances in both genetics and the sciences of complexity since the book's original publication.
Mining of Massive Datasets
Anand Rajaraman - 2011
This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering. The final chapters cover two applications: recommendation systems and Web advertising, each vital in e-commerce. Written by two authorities in database and Web technologies, this book is essential reading for students and practitioners alike.
The Systems Thinking Playbook: Exercises to Stretch and Build Learning and Systems Thinking Capabilities
Linda Booth Sweeney - 2010
It provides short gaming exercises that illustrate the subtleties of systems thinking. The companion DVD shows the authors introducing and running each of the thirty games. The thirty games are classified by these areas of learning: Systems Thinking, Mental Models, Team Learning, Shared Vision, and Personal Mastery. Each description clearly explains when, how, and why the game is useful. There are explicit instructions for debriefing each exercise as well as a list of all required materials. A summary matrix has been added for a quick glance at all thirty games. When you are in a hurry to find just the right initiative for some part of your course, the matrix will help you find it. Linda Booth Sweeney and Dennis Meadows both have many years of experience in teaching complex concepts. This book reflects their insights. Every game works well and provokes a deep variety of new insights about paradigms, system boundaries, causal-loop diagrams, reference modes, and leverage points. Each of the thirty exercises here was tested and refined many times until it became a reliable source of learning. Some of the games are adapted from classics of the outdoor education field. Others are completely new. But all of them complement readings and lectures to help participants understand intuitively the principles of systems thinking. Biography Linda Booth Sweeney, Ed. D., is a researcher and writer dedicated to making the principles of systems thinking and sustainability accessible to children and others. She has worked with Outward Bound, Sloan School of M.I.T., and Schlumberger Excellence in Educational Development, or SEED. She is the author of The Systems Thinking Playbook; When a Butterfly Sneezes: A guide for helping children explore interconnections in our world through favorite stories; The SEED Water Book ; and numerous academic journals and newsletters. Sweeney liv
Open Sources
Chris DiBona - 1999
Open Source has grabbed the computer industry's attention. Netscape has opened the source code to Mozilla; IBM supports Apache; major database vendors haved ported their products to Linux. As enterprises realize the power of the open-source development model, Open Source is becoming a viable mainstream alternative to commercial software.Now in Open Sources, leaders of Open Source come together for the first time to discuss the new vision of the software industry they have created. The essays in this volume offer insight into how the Open Source movement works, why it succeeds, and where it is going.For programmers who have labored on open-source projects, Open Sources is the new gospel: a powerful vision from the movement's spiritual leaders. For businesses integrating open-source software into their enterprise, Open Sources reveals the mysteries of how open development builds better software, and how businesses can leverage freely available software for a competitive business advantage.The contributors here have been the leaders in the open-source arena:Brian Behlendorf (Apache) Kirk McKusick (Berkeley Unix) Tim O'Reilly (Publisher, O'Reilly & Associates) Bruce Perens (Debian Project, Open Source Initiative) Tom Paquin and Jim Hamerly (mozilla.org, Netscape) Eric Raymond (Open Source Initiative) Richard Stallman (GNU, Free Software Foundation, Emacs) Michael Tiemann (Cygnus Solutions) Linus Torvalds (Linux) Paul Vixie (Bind) Larry Wall (Perl) This book explains why the majority of the Internet's servers use open- source technologies for everything from the operating system to Web serving and email. Key technology products developed with open-source software have overtaken and surpassed the commercial efforts of billion dollar companies like Microsoft and IBM to dominate software markets. Learn the inside story of what led Netscape to decide to release its source code using the open-source mode. Learn how Cygnus Solutions builds the world's best compilers by sharing the source code. Learn why venture capitalists are eagerly watching Red Hat Software, a company that gives its key product -- Linux -- away.For the first time in print, this book presents the story of the open- source phenomenon told by the people who created this movement.Open Sources will bring you into the world of free software and show you the revolution.
The Art of Doing Science and Engineering: Learning to Learn
Richard Hamming - 1996
By presenting actual experiences and analyzing them as they are described, the author conveys the developmental thought processes employed and shows a style of thinking that leads to successful results is something that can be learned. Along with spectacular successes, the author also conveys how failures contributed to shaping the thought processes. Provides the reader with a style of thinking that will enhance a person's ability to function as a problem-solver of complex technical issues. Consists of a collection of stories about the author's participation in significant discoveries, relating how those discoveries came about and, most importantly, provides analysis about the thought processes and reasoning that took place as the author and his associates progressed through engineering problems.
Architects of Intelligence: The truth about AI from the people building it
Martin Ford - 2018
of Toronto and Google), Rodney Brooks (Rethink Robotics), Yann LeCun (Facebook) , Fei-Fei Li (Stanford and Google), Yoshua Bengio (Univ. of Montreal), Andrew Ng (AI Fund), Daphne Koller (Stanford), Stuart Russell (UC Berkeley), Nick Bostrom (Univ. of Oxford), Barbara Grosz (Harvard), David Ferrucci (Elemental Cognition), James Manyika (McKinsey), Judea Pearl (UCLA), Josh Tenenbaum (MIT), Rana el Kaliouby (Affectiva), Daniela Rus (MIT), Jeff Dean (Google), Cynthia Breazeal (MIT), Oren Etzioni (Allen Institute for AI), Gary Marcus (NYU), and Bryan Johnson (Kernel).Martin Ford is a prominent futurist, and author of Financial Times Business Book of the Year, Rise of the Robots. He speaks at conferences and companies around the world on what AI and automation might mean for the future. Editorial reviews: "In his newest book, Architects of Intelligence, Martin Ford provides us with an invaluable opportunity to learn from some of the most prominent thought leaders about the emerging fields of science that are shaping our future."
-Al Gore, Former Vice President of the US
"AI is going to shape our future, and Architects of Intelligence offers a unique and fascinating collection of perspectives from the top researchers and entrepreneurs who are driving progress in the field."
- Eric Schmidt, former Chairman and CEO, Google
"The best way to understand the challenges and consequences of AGI is to see inside the minds of industry experts shaping the field. Architects of Intelligence gives you that power."
-Sam Altman, President of Y Combinator and co-chairman of OpenAI
"Architects of Intelligence gets you inside the minds of the people building the technology that is going to transform our world. This is a book that everyone should read."
-Reid Hoffman, Co-founder of LinkedIn
The Shareholder Value Myth: How Putting Shareholders First Harms Investors, Corporations, and the Public
Lynn Stout - 2004
Stout shows how shareholder value thinking endangers not only investors but the rest of us as well, leading managers to focus myopically on short-term earnings; discouraging investment and innovation; harming employees, customers, and communities; and causing companies to indulge in reckless, sociopathic, and irresponsible behaviors. And she looks at new models of corporate purpose that better serve the needs of investors, corporations, and society.
Machine Learning: A Probabilistic Perspective
Kevin P. Murphy - 2012
Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach.The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package—PMTK (probabilistic modeling toolkit)—that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.
Computer Power and Human Reason: From Judgment to Calculation
Joseph Weizenbaum - 1976
A classic text by the author who developed ELIZA, a natural-language processing system.
Python Data Science Handbook: Tools and Techniques for Developers
Jake Vanderplas - 2016
Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is the must-have reference for scientific computing in Python.With this handbook, you’ll learn how to use: * IPython and Jupyter: provide computational environments for data scientists using Python * NumPy: includes the ndarray for efficient storage and manipulation of dense data arrays in Python * Pandas: features the DataFrame for efficient storage and manipulation of labeled/columnar data in Python * Matplotlib: includes capabilities for a flexible range of data visualizations in Python * Scikit-Learn: for efficient and clean Python implementations of the most important and established machine learning algorithms