data science

back to index

description: an interdisciplinary field that uses scientific methods, algorithms, and systems to extract knowledge and insights from structured and unstructured data.

461 results

Data Science from Scratch: First Principles with Python

by Joel Grus  · 13 Apr 2015  · 579pp  · 76,657 words

See http://oreilly.com/catalog/errata.csp?isbn=9781491901427 for release details. The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. Data Science from Scratch, the cover image of a Rock Ptarmigan, and related trade dress are trademarks of O’Reilly Media, Inc. While the publisher and

, it is your responsibility to ensure that your use thereof complies with such licenses and/or rights. 978-1-491-90142-7 [LSI] Preface Data Science Data scientist has been called “the sexiest job of the 21st century,” presumably by someone who has never visited a fire station. Nonetheless

10 years, we’ll need billions and billions more data scientists than we currently have. But what is data science? After all, we can’t produce data scientists if we don’t know what data science is. According to a Venn diagram that is somewhat famous in the industry, data science lies at the intersection of: Hacking skills Math

. This is emphatically not a math book, and for the most part, we won’t be “doing mathematics.” However, you can’t really do data science without some understanding of probability and statistics and linear algebra. This means that, where appropriate, we will dive into mathematical equations, mathematical intuition, mathematical axioms

tax preparation or coal mining.) From Scratch There are lots and lots of data science libraries, frameworks, modules, and toolkits that efficiently implement the most common (as well as the least common) data science algorithms and techniques. If you become a data scientist, you will become intimately familiar with NumPy, with scikit-learn, with pandas,

and with a panoply of other libraries. They are great for doing data science. But they are also a good way to start doing data science without actually understanding data science. In this

book, we will be approaching data science from scratch.

of our introduction to data science will take this same approach  —  going into detail where going into detail seems crucial or illuminating, at other times leaving details for you to figure out yourself (or look up on Wikipedia). Over the years, I’ve trained a number of data scientists. While not all of

data ninja rockstars, I’ve left them all better data scientists than I found them. And I’ve grown to believe that anyone who has some amount of mathematical aptitude and some amount of programming skill has the necessary raw materials to do data science. All she needs is an inquisitive mind, a

element indicates a warning or caution. Using Code Examples Supplemental material (code examples, exercises, etc.) is available for download at https://github.com/joelgrus/data-science-from-scratch. This book is here to help you get your job done. In general, if example code is offered with this book, you may

’s documentation does require permission. We appreciate, but do not require, attribution. An attribution usually includes the title, author, publisher, and ISBN. For example: “Data Science from Scratch by Joel Grus (O’Reilly). Copyright 2015 Joel Grus, 978-1-4919-0142-7.” If you feel your use of code examples falls

have a web page for this book, where we list errata, examples, and any additional information. You can access this page at http://bit.ly/data-science-from-scratch. To comment or ask technical questions about this book, send email to bookquestions@oreilly.com. For more information about our books, courses,

questions that no one’s ever thought to ask. In this book, we’ll learn how to find them. What Is Data Science? There’s a joke that says a data scientist is someone who knows more statistics than a computer scientist and more computer science than a statistician. (I didn’t say

them, though). In short, pretty much no matter how you define data science, you’ll find practitioners for whom the definition is totally, absolutely wrong. Nonetheless, we won’t let that stop us from trying. We’ll say that a data scientist is someone who extracts insights from messy data. Today’s world

political campaigns of the future will become more and more data-driven, resulting in a never-ending arms race of data science and data collection. Now, before you start feeling too jaded: some data scientists also occasionally use their skills for good — using data to make government more effective, to help the homeless,

to click on advertisements. Motivating Hypothetical: DataSciencester Congratulations! You’ve just been hired to lead the data science efforts at DataSciencester, the social network for data scientists. Despite being for data scientists, DataSciencester has never actually invested in building its own data science practice. (In fairness, DataSciencester has never really invested in building its product either.) That will

be your job! Throughout the book, we’ll be learning about data science concepts by solving problems that you

(id 0) but only one mutual friend with Clive (id 5). As a data scientist, you know that you also might enjoy meeting users with similar interests. (This is a good example of the “substantive expertise” aspect of data science.) After asking around, you manage to get your hands on this data, as

creating many, many functions. Strings Strings can be delimited by single or double quotation marks (but the quotes have to match): single_quoted_string = 'data science' double_quoted_string = "data science" Python uses backslashes to encode special characters. For example: tab_string = "\t" # represents the tab character len(tab_string) # is 1 If

= len(grades) # equals 3 We will frequently use dictionaries as a simple way to represent structured data: tweet = { "user" : "joelgrus", "text" : "Data Science is Awesome", "retweet_count" : 100, "hashtags" : ["#data", "#science", "#datascience", "#awesome", "#yolo"] } Besides looking for specific keys we can look at all of them: tweet_keys = tweet.keys() # list of keys

data scientist it would be a good idea to read a statistics textbook. Many are freely available online. A couple that I like are: OpenIntro Statistics OpenStax Introductory Statistics Chapter 6. Probability The laws of probability, so true in general, so fallacious in particular. Edward Gibbon It is hard to do data science

What will we do with all this statistics and probability theory? The science part of data science frequently involves forming and testing hypotheses about our data and the processes that generate it. Statistical Hypothesis Testing Often, as data scientists, we’ll want to test whether a certain hypothesis is likely to be true.

the null. Warning Make sure your data is roughly normally distributed before using normal_probability_above to compute p-values. The annals of bad data science are filled with examples of people opining that the chance of some observed event occurring at random is one in a million, when what they

Object Notation (JSON). JavaScript objects look quite similar to Python dicts, which makes their string representations easy to interpret: { "title" : "Data Science Book", "author" : "Joel Grus", "publicationYear" : 2014, "topics" : [ "data", "science", "data science"] } We can parse JSON using Python’s json module. In particular, we will use its loads function, which deserializes a string representing

a JSON object into a Python object: import json serialized = """{ "title" : "Data Science Book", "author" : "Joel Grus", "publicationYear" : 2014, "topics" : [ "data", "science", "data science"] }""" # parse the JSON to create a Python dict deserialized = json.loads(serialized) if "data science" in deserialized["topics"]: print deserialized Sometimes an API provider hates you and only provides responses

, not the access token or secret: from twython import Twython twitter = Twython(CONSUMER_KEY, CONSUMER_SECRET) # search for tweets containing the phrase "data science" for status in twitter.search(q='"data science"')["statuses"]: user = status["user"]["screen_name"].encode('utf-8') text = status["text"].encode('utf-8') print user, ":", text print Note The

Unicode text. If you run this, you should get some tweets back like: haithemnyc: Data scientists with the technical savvy & analytical chops to derive meaning from big data are in demand. http://t.co/HsF9Q0dShP RPubsRecent: Data Science http://t.co/6hcHUz2PHM spleonard1: Using #dplyr in #R to work through a procrastinated

to save them to a file or a database, so that you’d have them permanently. For Further Exploration pandas is the primary library that data science types use for working with (and, in particular, importing) data. Scrapy is a more full-featured library for building more complicated web scrapers that

. Histogram of uniform Two Dimensions Now imagine you have a data set with two dimensions. Maybe in addition to daily minutes you have years of data science experience. Of course you’d want to understand each dimension individually. But you probably also want to scatter the data. For example, consider another

Machine Learning I am always ready to learn although I do not always like being taught. Winston Churchill Many people imagine that data science is mostly machine learning and that data scientists mostly build and train and tweak machine-learning models all day long. (Then again, many of those people don’t actually

know what machine learning is.) In fact, data science is mostly turning business problems into data problems and collecting data and understanding data and cleaning

the trendiest subfields of data science. However, most neural networks are “black boxes” — inspecting their details doesn’t give you much understanding of how they’re solving a problem. And large neural networks can be difficult to train. For most problems you’ll encounter as a budding data scientist, they’re probably not

appears on resumes: data = [ ("big data", 100, 15), ("Hadoop", 95, 25), ("Python", 75, 50), ("R", 50, 40), ("machine learning", 80, 20), ("statistics", 20, 60), ("data science", 60, 70), ("analytics", 90, 3), ("team player", 85, 85), ("dynamic", 2, 90), ("synergies", 70, 0), ("actionable insights", 40, 30), ("think out of the box", 45

where sentences end). We can do this using re.findall(): from bs4 import BeautifulSoup import requests url = "http://radar.oreilly.com/2010/06/what-is-data-science.html" html = requests.get(url).text soup = BeautifulSoup(html, 'html5lib') content = soup.find("div", "entry-content") # find entry-content div regex = r"[\w']+|[\.]" # matches

seen verbatim in the original data. Having more data would help; it would also work better if you collected n-grams from multiple essays about data science. Grammars A different approach to modeling language is with grammars, rules for generating acceptable sentences. In elementary school, you probably learned about parts of

. We’ll define a slightly more complicated grammar: grammar = { "_S" : ["_NP _VP"], "_NP" : ["_N", "_A _NP _P _A _N"], "_VP" : ["_V", "_V _NP"], "_N" : ["data science", "Python", "regression"], "_A" : ["big", "linear", "logistic"], "_P" : ["about", "near"], "_V" : ["learns", "trains", "tests", "is"] } I made up the convention that names starting with underscores refer

trains','logistic','_NP','_P','_A','_N'] ['Python','trains','logistic','_N','_P','_A','_N'] ['Python','trains','logistic','data science','_P','_A','_N'] ['Python','trains','logistic','data science','about','_A', '_N'] ['Python','trains','logistic','data science','about','logistic','_N'] ['Python','trains','logistic','data science','about','logistic','Python'] How do we implement this? Well, to start, we’ll create a

[word].append(count) return [output for word, counts in collector.iteritems() for output in wc_reducer(word, counts)] Imagine that we have three documents ["data science", "big data", "science fiction"]. Then wc_mapper applied to the first document yields the two pairs ("data", 1) and ("science", 1). After we’ve gone through all

status updates. You manage to extract a data set of status updates that look like: {"id": 1, "username" : "joelgrus", "text" : "Is anyone interested in a data science book?", "created_at" : datetime.datetime(2013, 12, 21, 11, 47, 0), "liked_by" : ["data_guy", "data_gal", "mike"] } Let’s say we need to

. And if we emit a value of 1 for each update that contains “data science,” we can simply get the total number using sum: def data_science_day_mapper(status_update): """yields (day_of_week, 1) if status_update contains "data science" """ if "data science" in status_update["text"].lower(): day_of_week = status_update["created_at

Bokeh is a project that brings D3-style functionality into Python. R Although you can totally get away with not learning R, a lot of data scientists and data science projects use it, so it’s worth getting at least familiar with it. In part, this is so that you can understand people’s

Data rescaling, Rescaling-Rescaling data mining, What Is Machine Learning? data scienceabout, Data Science defined, What Is Data Science? doing, projects of the author, Do Data Science from scratch, From Scratch learning more about, Go Forth and Do Data Science-And You? skills needed for, Data Science using libraries, Not from Scratch data visualization, Visualizing Data-For Further Explorationbar

with vectors, Vectors About the Author Joel Grus is a software engineer at Google. Previously he worked as a data scientist at several startups. He lives in Seattle, where he regularly attends data science happy hours. He blogs infrequently at joelgrus.com and tweets all day long at @joelgrus. Colophon The animal on

in This Book Using Code Examples Safari® Books Online How to Contact Us Acknowledgments 1. Introduction The Ascendance of Data What Is Data Science? Motivating Hypothetical: DataSciencester Finding Key Connectors Data Scientists You May Know Salaries and Experience Paid Accounts Topics of Interest Onward 2. A Crash Course in Python The Basics Getting Python

Data Science for Business: What You Need to Know About Data Mining and Data-Analytic Thinking

by Foster Provost and Tom Fawcett  · 30 Jun 2013  · 660pp  · 141,595 words

Business is intended for several sorts of readers: Business people who will be working with data scientists, managing data science–oriented projects, or investing in data science ventures, Developers who will be implementing data science solutions, and Aspiring data scientists. This is not a book about algorithms, nor is it a replacement for a book about algorithms. We deliberately

data mining techniques and algorithms, as well as data science applications, quite generally. Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist Before proceeding, we should briefly revisit the engineering side of data science. At the time of this writing, discussions of data science commonly mention not just analytical skills and techniques for

and to be willing to invest in data and experimentation. Furthermore, unless the firm has on its management team a seasoned, practical data scientist, often the management must steer the data science team carefully to make sure that the team stays on track toward an eventually useful business solution. This is very difficult if

business, but generally few companies are able to execute it. For example, you must have confidence that you have one of the best data science teams, since the effectiveness of data scientists has a huge variance, with the best being much more talented than the average. If you have a great team, you may

Your model is not what your data scientists design, it’s what your engineers implement. Superior Data Scientists Maybe our data scientists simply are much better than our competitors’. There is a huge variance in the quality and ability of data scientists. Even among well-trained data scientists, it is well accepted within the data science community that certain individuals have the

, we have a substantial and sustained advantage over competitors who are having trouble hiring data scientists. Further, top-notch data scientists like to work with other top-notch data scientists, which compounds our advantage. We also must embrace the fact that data science is in part a craft. Analytical expertise takes time to acquire, and all the

in the sense of what one might find in an online professional networking system; an effective data scientist needs to have deep connections to other data scientists throughout the data science community. The reason is simply that the field of data science is immense and there are far too many diverse topics for any individual to master. A

And, the best data scientists have the best connections. Superior Data Science Management Possibly even more critical to success for data science in business is having good management of the data science team. Good data science managers are especially hard to find. They need to understand the fundamentals of data science well, possibly even being competent data scientists themselves. Good data science managers also must possess

or expensive for a competitor to duplicate because we can hire data scientists and data science managers better. This may be due to our reputation and brand appeal with data scientists—a data scientist may prefer to work for a company known as being friendly to data science and data scientists. Or our firm may have a more subtle appeal. So

we mentioned above, there can be a huge difference between the effectiveness of a great data scientist and an average data scientist, and between a great data science team and an individually great data scientist. But how can one confidently engage top-notch data scientists? How can we create great teams? This is a very difficult question to answer in

that have an advantage in hiring are those that create an environment for nurturing data science and data scientists. If you do not have a critical mass of data scientists, be creative. Encourage your data scientists to become part of local data science technical communities and global data science academic communities. A note on publishing Science is a social endeavor, and the

a very small number of clients to help them develop their data science capabilities (such as Data Scientists, LLC).[76] You can find a large list of data-science service companies, as well as a wide variety of other data science resources, at KDnuggets. A caveat about engaging data science consulting firms is that their interests are not always well

deal-with issues that can be quickly dispatched, or even written about well as a section or chapter of a data science book. If you are either a data scientist or a business stakeholder in data science projects, you should care about privacy concerns, and you will need to invest serious time in thinking carefully about

whether the target variable has been defined appropriately for the problem to be solved, etc. Knowing what are the different sorts of data science tasks helps to keep the data scientist from treating all business problems as nails for the particular hammers that he knows well. Thinking carefully about what is important to the

errors, Regression via Mathematical Functions accuracy (term), Plain Accuracy and Its Problems accuracy results, From Holdout Evaluation to Cross-Validation ACM SIGKDD, Superior Data Scientists, Is There More to Data Science? ad impressions, Example: Targeting Online Consumers With Advertisements adding variables to functions, Example: Overfitting Linear Functions advertising, Example: Targeting Online Consumers With

Pieces business strategy, Data Science and Business Strategy–A Firm’s Data Science Maturity accepting creative ideas, Be Ready to Accept Creative Ideas from Any Source case studies, examining, Examine Data Science Case Studies competitive advantages, Achieving Competitive Advantage with Data Science–Achieving Competitive Advantage with Data Science, Sustaining Competitive Advantage with Data Science–Superior Data Science Management data scientists, evaluating, Superior Data Scientists–Superior Data Scientists evaluating proposals, Be

Ready to Evaluate Proposals for Data Science Projects–Flaws in the Big Red Proposal historical advantages and, Formidable

Historical Advantage intangible collateral assets and, Unique Intangible Collateral Assets intellectual property and, Unique Intellectual Property managing data scientists

effectively, Superior Data Science Management–Superior Data Science

Management maturity of the data science

, A Firm’s Data Science

Maturity–A Firm’s Data Science Maturity thinking data-analytically for, Thinking Data-Analytically, Redux–Thinking Data-Analytically, Redux C Caesars Entertainment, Data

Final Words and adding value to applications, Decision Analytic Thinking I: What Is a Good Model? as craft, Superior Data Scientists as strategic asset, Data and Data Science Capability as a Strategic Asset–Data and Data Science Capability as a Strategic Asset baseline methods of, Summary behavior predictions based on past actions, Example: Hurricane Frances Big

“Big Data” data-driven decision-making, Data Science, Engineering, and Data-Driven Decision Making–Data Science, Engineering, and Data-Driven Decision Making engineering, Data Science, Engineering, and Data-Driven Decision Making–Data Science, Engineering, and Data-Driven Decision Making engineering and, Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist evolving uses for, From Big Data

development vs., A Firm’s Data Science Maturity structure, Machine Learning and Data Mining techniques, Data Science, Engineering, and Data-Driven Decision Making technology vs. theory of, Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist–Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist understanding, The Ubiquity of Data

Opportunities, Data Processing and “Big Data” data science maturity, of firms, A Firm’s Data Science Maturity–A Firm’s Data Science Maturity data scientists academic, Attracting and Nurturing Data Scientists and Their Teams as

scientific advisors, Attracting and Nurturing Data Scientists and Their

Teams attracting/nurturing, Attracting and Nurturing Data Scientists and

Their Teams–Attracting and Nurturing Data Scientists and Their Teams evaluating, Superior Data Scientists–Superior Data Scientists managing, Superior Data Science Management–Superior Data Science Management Data Scientists,

LLC, Attracting and Nurturing Data Scientists and

Their Teams data sources, Evaluation, Baseline Performance, and Implications for Investments in Data data understanding, Data Understanding–Data Understanding expected value decomposition and, From an Expected Value Decomposition to a Data Science Solution–From

an Expected Value Decomposition to a Data Science

Example: Jazz Musicians, Example: Jazz Musicians email, Why Text Is Important engineering, Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist, Business Understanding engineering problems, business problems vs., Other Data Science Tasks and Techniques ensemble method, Bias, Variance, and Ensemble Methods–Bias, Variance, and Ensemble Methods entropy, Selecting Informative Attributes

Other Analytics Techniques and Technologies big-data, Data Processing and “Big Data” theory in data science vs., Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist–Chemistry Is Not About Test Tubes: Data Science Versus the Work of the Data Scientist term frequency (TF), Term Frequency–Term Frequency defined, Term Frequency in TFIDF, Combining

Doing Data Science: Straight Talk From the Frontline

by Cathy O'Neil and Rachel Schutt  · 8 Oct 2013  · 523pp  · 112,185 words

. Who knows what you might find? After I got my PhD, I worked at Google for a few years around the same time that “data science” and “data scientist” were becoming terms in Silicon Valley. The world is opening up with possibilities for people who are quantitatively minded and interested in putting their brains

. Reason 1: I wanted to give students an education in what it’s like to be a data scientist in industry and give them some of the skills data scientists have. I was working on the Google+ data science team with an interdisciplinary team of PhDs. There was me (a statistician), a social scientist, an engineer

Engineering in July 2012. This course created an opportunity to develop the theory of data science and to formalize it as a legitimate science. Reason 3: I kept hearing from data scientists in industry that you can’t teach data science in a classroom or university setting, and I took that on as a challenge. I

thought of my classroom as an incubator of data science teams. The students I had were very impressive and are turning into top-notch data scientists. They’ve contributed a chapter to this book, in fact. Origins of the Book The class would not

implement it yourself and play around with it to truly understand it. Who This Book Is For Because of the media coverage around data science and the characterization of data scientists as “rock stars,” you may feel like it’s impossible for you to enter into this realm. If you’re the type of

here, which constitute an ongoing discussion of the difference between a statistician and a data scientist. Cosma basically argues that any statistics department worth its salt does all the stuff in the descriptions of data science that he sees, and therefore data science is just a rebranding and unwelcome takeover of statistics. For a slightly different

. Much of the development of the field is happening in industry, not academia. That is, there are people with the job title data scientist in companies, but no professors of data science in academia. (Though this may be changing.) Not long ago, DJ Patil described how he and Jeff Hammerbacher—then at LinkedIn and

Facebook, respectively—coined the term “data scientist” in 2008. So that is when “data scientist” emerged as a job title. (Wikipedia finally gained an entry on data science in 2012.) It makes sense to us that once the skill set required to thrive at Google

social sciences, which could be thought of as a subset of data science. But we can go back even further. In 2001, William Cleveland wrote a position paper about data science called “Data Science: An action plan to expand the field of statistics.” So data science existed before data scientists? Is this semantics, or does it make sense? This all

begs a few questions: can you define data science by what data scientists do? Who gets to define the field, anyway? There

specialize in all those things. We’ll talk about this more after we look at the composite set of skills in demand for today’s data scientists. A Data Science Profile In the class, Rachel handed out index cards and asked everyone to profile themselves (on a relative rather than absolute scale) with respect

expertise Communication and presentation skills Data visualization As an example, Figure 1-2 shows Rachel’s data science profile. Figure 1-2. Rachel’s data science profile, which she created to illustrate trying to visualize oneself as a data scientist; she wanted students and guest lecturers to “riff” on this—to add buckets or remove skills

us wonder if it might be more worthwhile to define a “data science team”—as shown in Figure 1-3—than to define a data scientist. Figure 1-3. Data science team profiles can be constructed from data scientist profiles; there should be alignment between the data science team profile and the profile of the data problems they try to

Harris, Sean Murphy, and Marck Vaisman based on a survey of several hundred data science practitioners in mid-2012 OK, So What Is a Data Scientist, Really? Perhaps the most concrete approach is to define data science is by its usage—e.g., what data scientists get paid to do. With that as motivation, we’ll describe what

data scientists do. And we’ll cheat a bit by talking first about data scientists in academia. In Academia The

” at a university, or for applying for a grant that supplies money for data science research. Instead, let’s ask a related question: who in academia plans to become a data scientist? There were 60 students in the Intro to Data Science class at Columbia. When Rachel proposed the course, she assumed the makeup of

information to form an opinion. Given that this is how the popular press is currently describing and influencing public perception of data science and modeling, it’s incumbent upon us as data scientists to be aware of it and to chime in with informed comments. With that context, then, what do we mean when

and making a plan for how the problem will be attacked. And that someone is the data scientist or our beloved data science team. Let’s revise or at least add an overlay to make clear that the data scientist needs to be involved in this process throughout, meaning they are involved in the actual coding

as in the higher-level process, as shown in Figure 2-3. Figure 2-3. The data scientist is involved in every part of this process Connection to the Scientific Method We can think of the data science process as an extension of or variation of the scientific method: Ask a question. Do background

), and, as we’ll learn a little later, Kaggle itself. Some remarks about data science competitions are warranted. First, data science competitions are part of the data science ecosystem—one of the cultural forces at play in the current data science landscape, and so aspiring data scientists ought to be aware of them. Second, creating these competitions puts one in

point: what attributes of the existing competitions capture data science, and what aspects of data science are missing? Finally, competitors in the the various competitions get ranked, and so one metric of a “top” data scientist could be their standings in these competitions. But notice that many top data scientists, especially women, and including the authors of this

and more about more and more, until you know nothing about everything. — Will Cukierski Kaggle is a company whose tagline is, “We’re making data science a sport.” Kaggle forms relationships with companies and with data scientists. For a fee, Kaggle hosts competitions for businesses that essentially want to crowdsource (or leverage the wider

data science community) to solve their data problems. Kaggle provides the infrastructure and attracts the data science talent. They also have in house a bunch of top-notch data scientists, including Will himself. The companies are their paying customers, and they provide datasets

an interview. There were 422 competitors. We think it’s convenient for Facebook to have interviewees for data science positions in such a posture of gratitude for the mere interview. Cathy thinks this distracts data scientists from asking hard questions about what the data policies are and the underlying ethics of the company. Kaggle

engines, also called recommendation systems, are the quintessential data product and are a good starting point when you’re explaining to non–data scientists what you do or what data science really is. This is because many people have interacted with recommendation systems when they’ve been suggested books on Amazon.com or gotten

computer science, and got her PhD in information systems at NYU. She now teaches a class to business students on data science, where she addresses how to assess data science work and how to manage data scientists. Claudia is also a famously successful data mining competition winner. She won the KDD Cup in 2003, 2007, 2008

Claudia started by asking what people’s reference point might be to evaluate where they stand with their own data science profile (hers is shown in Table 13-1. Referring to the data scientist profile from Chapter 1, she said, “There is one skill that you do not have here and that is the

external communication to represent M6D; and 20% of her time in “leadership” of her data group. On Being a Female Data Scientist Being a woman works well in the field of data science, where intuition is useful and is regularly applied. One’s nose gets so well developed by now that one can smell

the creators of Hadoop, and Jeff Hammerbacher, who we mentioned back in Chapter 1 because he co-coined the job title “data scientist” when he worked at Facebook and built the data science team there. Cloudera is like Red Hat for Hadoop, by which we mean they took an open source project and built

, but…” The lectures were discretized in this way, and it was our job to interpolate a continuous narrative about data science. We had to create our own meaning from the course, just as data scientists continue to construct the field to which they belong. That is not to say that Rachel left us in

the dark. On the first day of class, she proposed a working definition of data science. A data scientist, she said, was a person whose aptitude was distributed over the following: mathematics, statistics, computer science, machine learning, visualization, communication, and domain expertise. We

members of the class to create an essay-grading algorithm. The homework often simulated the data scientist’s experience in industry, but Kaggle competitions could be described as the dick-measuring contests of data science. It encouraged all of the best data science practices we had learned while putting us in the thick of a quintessential

of problem you might be assigned in a more general machine learning or data mining class. As fledgling data scientists, our first reaction is now skeptical. At which point in the process of doing data science would a problem like this even present itself? How much would we have to have already done to

the book. It was the theme of the book, the central question, and the mantra. Data science could be defined simply as what data scientists do, as we did earlier when we talked about profiles of data scientists. In fact, before Rachel taught the data science course at Columbia, she wrote up a list of all the things

your abilities—give yourself reality checks by making sure you can code what you speak and by chatting with other data scientists about approaches. Thought Experiment Revisited: Teaching Data Science How would you design a data science class around habits of mind rather than technical skills? How would you quantify it? How would you evaluate it

traction they have. The preceding chapters—dedicated to the means for cultivating the diverse capacities of the data scientist—make mincemeat of any facile dichotomy of the data expert and the traditional expert. Doing data science has put a tempering of hubris, especially algorithmic hubris, at the center of its technical training. Obama’

thought experiment, Thought Experiment: Meta-Definition privacy and, Privacy process of, The Data Science Process–A Data Scientist’s Role in This Process RealDirect case study, Case Study: RealDirect–Sample R code scientific method and, A Data Scientist’s Role in This Process scientists, A Data Science Profile–Thought Experiment: Meta-Definition sociology and, Gabriel Tarde teams, A

Data Science Profile Venn diagram of, The Current Landscape (with a Little History) data science competitions, Background: Data Science Competitions Kaggle and, A Single Contestant data scientists, A Data Science Profile–Thought Experiment: Meta-Definition as

problem solvers, Being Problem Solvers chief, The Life of a Chief Data Scientist defining, A Data Science Profile ethics of, Being an Ethical Data Scientist–Being an Ethical Data Scientist female

, On Being a Female Data Scientist hubris and, Being an Ethical Data Scientist–Being an Ethical Data Scientist

in academia, In Academia in industry, In Industry next generation of, What Are Next-Gen Data Scientists?–Being Question Askers questioning as, Being

Question Askers role of, in data science process, A Data Scientist

learning Suriowiecki, James, Background: Crowdsourcing Survival Analysis, Example: User Retention T tacit knowledge, Being an Ethical Data Scientist Tarde, Gabriel, Gabriel Tarde, Your Mileage May Vary Idea of Quantification, Gabriel Tarde Taylor Series, In practice teaching data science thought experiment, What Just Happened? test sets, Training and test sets tests, Code readability and reusability

Becoming Data Literate: Building a great business, culture and leadership through data and analytics

by David Reed  · 31 Aug 2021  · 168pp  · 49,067 words

the enterprise by getting the foundations right. Empower – unlocking the value of data to support and empower the business. Beat the market – using AI and data science to take analytics to the next level. Mindset – building a data-driven mindset across the enterprise. Lloyd’s of London, in another example of an

figure report Compliance KYC manager Identity validation Ecommerce Channel manager Identity validation Information security Information security officer Identity validation Customer experience Cx manager Behavioural modelling Data science Data scientist Behavioural modelling Centralisations of roles, for example into a data and analytics centre of excellence, removes role and task duplication while supporting multiple internal

analyst Data and analytics centre of excellence Board, Marketing Churn propensity modelling Customer churn analyst Data and analytics centre of excellence Marketing Behavioural modelling Data scientist Data and analytics centre of excellence Cx management Identity validation Information security officer Information security Ecommerce The reality of multi-stakeholder data and analytics tasks

The market for data practitioners is overheated at almost all levels, leading to salary inflation, which is exaggerated within certain in-demand roles, such as data science or data engineering. If these are broken down into task sets, it may be possible to define a role that does not have the inflationary

tasks required to avoid duplication of effort. Chapter 3: Becoming data literate Roadmap – in this chapter: Aviva changed its culture to be customer-centric using data science. Data literacy has five dimensions: vision, business strategy, value creation, culture and data foundations. Each of these needs the support of four pillars: strategy,

in 2016 with the creation of its customer science team as part of a vision to create engaging, relevant customer experiences through intelligent use of data science. A key moment occurred in 2019 when this team realised it needed to make customer data central and relevant to the whole organisation. This

would enable data science to avoid being just another siloed department by bringing data science to life for colleagues and providing a common language for the business to talk about customers. Of course, data foundations

on 16m customers covering product holding, demographics and behavioural indicators – the largest customer data set in UK insurance. But as Tom Spencer, head of customer data science at Aviva, notes, the data asset in itself would not lead to customer-centricity. “Essential to the success of this work was close collaboration with

said. At the level of business strategy, the company wanted to move forward from conventional age- and affluence-based rules models by developing a new data science-led customer segmentation using machine learning (ML). Crucially, says Spencer, “in order to achieve cultural change, the segmentation had to be simple and to

CDO brings to the table. For business strategy – test-and-learn will become part of the working method across the enterprise, not just within data science and analytics. There will be no fear of failure, but rather a desire to fail fast in order to identify successful strategies. Ensuring engagement by

own these successes. Once data practitioners get to work examining the available data, they will often uncover contradictory truths. This is especially true of data science if it is given free licence to look at the organisation end-to-end. Few of those business myths survive this examination. It is the

data scientists were recruited and tasked to “move fast and break things”. Significant changes did start to be realised, including a reduction in the friction during insurance pricing and quotation through predictive modelling based on customer data. Robust data underpinnings and platforms were put in place to support both customer science and data science

could misinterpret BI if they were not data literate and numerate, for example. But equally, there was a need to get data scientists to understand the commercial business. It had 30 data scientists who needed to make things “painfully simple” for business users, rather than following their natural desire to explain the supporting

identified a similar need and are moving to create cross-departmental courses that allow everybody to learn, adopt and deploy new data and analytics skills. Data science is a prime example. There is a large appetite from the C-suite down to bring leading-edge techniques, such as AI and ML,

an understanding of multiple disciplines, including data, coding and mathematics. In DataIQ research, for example, 39.5% of data practitioners identified a weak understanding of data science in their business as the number one challenge they face. But these concepts are teachable when broken down into digestible components that build towards a

aspect of learning and development. NatWest Group started its data academy – the first one to be launched by a UK bank – initially with a data science course, working with the Bayes Centre in Edinburgh to build a community of practice across the group. It set a clear goal of developing techniques

pathways to allow it to drive innovation and be at the leading edge of disruptive data trends. To do this, it needed to move data science higher up the value chain by training its analysts to automate basic tasks, deliver self-service data tools and support the rollout of AI initiatives

, it is better to accept an estimated figure than to fail to agree and miss out on the eventual benefit being seen as stemming from data science. While these metrics provide departmental-level indicators of productivity, the data leader will also need to measure individual-level output. A growing method for

governance roles, for example. These individuals are ‘force multipliers’ since their actions help to enhance what other team members are able to achieve. A data scientist may be heavily reliant on a data engineer to deliver a working model into the production environment – their individual outputs are very different, but mutually

tend to have well-established graduate programmes. Recently, some of these have adapted to new skills requirements in the data department, such as the data science graduate scheme at a global bank which is now in its second year and offers a combination of experience across customer science, ML and data

technology department or as part of the compliance team and has no direct line to value creation – this takes place inside advanced analytics or data science functions that operate within a host department. It is also the case that a data practitioner drawn to this type of CDO role is unlikely

may sit separately within lines of business, such as data engineering being part of IT, or as stand-alone functions, such as advanced analytics or data science. According to DataIQ research, while 34.5% of organisations have a centralised data department, the same percentage have federated data while 13.8% operate

hub-and-spoke basis. That means nearly half of data teams (48.3%) could find themselves working alongside stakeholders rather than their own peers. Data science operates in a more consolidated way with 40.7% having a centralised function and 26.7% a team working within a centralised analytics function (see

Figure 7.1). Figure 7.1: How is data science set up in your organisation? Whatever approach has been taken to the data department and its component functions, the data leader is presented with a

creation for the organisation. Data can appear to its internal customers as something of a ‘black box’, especially the work of functions like analytics and data science which generate models through an invisible, quasi-magical process. The level of communication with stakeholders has a significant impact on this, especially if data teams

is true only if the audience for what scientists produce is only other scientists and then only if they specialise in the same domain. For data scientists – indeed, for all analysts and everybody working in data – the audience will most often be non-technical, business-oriented. These stakeholders may be highly

insights digestible, winning belief and support even when what they have to say does not want to be heard. As Tom Spencer, head of customer data science at Aviva, says: “Keep communication clear and concise. In a technical field, it’s all too tempting to fall into jargon and complexity. Senior

While the data department does not operate in the same way as R&D, it can still incubate pioneering solutions, especially if it houses a data science function and allows a proportion of time to be given over to free exploration. One major initiative that can support this is a partnership with

academia, especially by building links with universities that run analytics, data science or related courses. These have expanded significantly in number in recent years and a key feature of most is an appetite for access to real

funding for a new university in Milton Keynes, which will have 5,000 students and be the UK’s first university focused on digital and data science skills. Similarly, the University of Cambridge has established the Cambridge Centre for Data-Driven Discovery (C2D3), a hub intended to harness the knowledge of

academics in data science, ethics and a wide range of disciplines. C2D3 will allow experts across a number of areas, including those with technical, mathematical and topic knowledge, to

spectrum of skills is essential, not just specific technical abilities. A personal brand gives credibility, visibility and accessibility to data practitioners. Data individuals – especially data scientists – can be at the heart of digital transformations, but cultural obstacles can prevent this from happening. To continue to develop as individuals (and as a

data native at the point where it starts a digital transformation (or has one thrust upon it). DataIQ researched the issue of how to get data science – often a key innovator that drives digital transformation – transferred into production. What the survey revealed is that culture factors make up three out of

five of the biggest challenges identified. Significantly, a lack of understanding of data science within the business emerged as the leading challenge, even above legacy infrastructure (39.5% vs 38.4% – see Figure 8.2). This underlines just

it is for data teams to demonstrate the art of the possible to stakeholders, as discussed in Chapter 7. Similarly, the issue of conflict between data science and IT, as identified by 33.6%, shows how difficult it is to establish data as an accepted and engaged department. Assuming, that is,

that enough data practitioners can be recruited, trained and retained – some 32.7% struggle to staff up their data science function. Again, this is where a positive data culture helps because it makes the organisation more appealing to candidates and incumbent practitioners. Figure 8.2

: What are your biggest data science challenges? Becoming a data champion One way to avoid these obstacles to getting data accepted as an enabler of digital transformation is by becoming a

identify since automation, AI, ML and the rest all require significant stretch by both data practitioners and the organisation. Recalling the challenges identified around data science in Figure 8.2, there is still a lot of work to be done in bringing these tools into the production environment. What is often

potential stakeholders. Should I do a Masters? A notable feature of recent years has been the emergence of formal, academic qualifications in data (including data science). It is now possible to build on degree-level technical skills, re-skill or up-skill from non-technical disciplines, or validate abilities that have

and analytics enjoyed a relatively prolonged honeymoon phase during the 2010s and some functions, such as data science, still do. In DataIQ research in H2 of 2020, 4.9% of organisations said they were still running data science as a cost centre, for example. By contrast, in a LinkedIn post in January 2021,

his team. Just as significantly, the same DataIQ study found 29.1% of organisations did not know what level of revenue uplift to expect from data science. At some point, all investments come under scrutiny and need to prove their worth, with the CFO likely to want to see a significant

in 2013 by Michal Kosinski, then deputy director of the University of Cambridge Psychometric Centre, revealed just what had become possible when ‘big data’ met data science. He started with the research question of whether psychological traits – in particular the ‘big five’ personality traits (openness, conscientiousness, extraversion, agreeableness, neuroticism) together with

one of his peers – Aleksandr Kogan – sold a similar solution to a then-nascent business called Cambridge Analytica which specialised in applying big data and data science to political campaigns. (Kosinski demurred from having his own work applied in the political realm on principle.) As is now only too familiar, this

and over 2,900 other orders for critical services during April and May 2020 alone. Transport for London (TfL): TfL’s transport analytics and data science team is instrumental in providing analysis of travel patterns on the public and private transport network. During the Covid-19 crisis, it provided analysis of

Mood Machine: The Rise of Spotify and the Costs of the Perfect Playlist

by Liz Pelly  · 7 Jan 2025  · 293pp  · 104,461 words

a head when I started researching how Warner A&Rs actually operate. In 2021, a couple of executives from the in-house Warner Music Group data science team explained, in a video, that the company was then processing information about its roughly 4.5 billion streams per day, all of which powered

Designing the Mind: The Principles of Psychitecture

by Designing The Mind and Ryan A Bush  · 10 Jan 2021

, or that movie rental is probably not a great industry to build a career in right now, or that you have a master’s in data science. Honestly, I have no idea what you saw in that job in the first place, Sarah. Once you learn and strengthen your ability to use

The Antisocial Network: The GameStop Short Squeeze and the Ragtag Group of Amateur Traders That Brought Wall Street to Its Knees

by Ben Mezrich  · 6 Sep 2021  · 239pp  · 74,845 words

was privileged—his dad still sent him checks every month to cover his living expenses and tuition. Before Covid, he had worked part time doing data science for a professor to help cover his student loans, but now he was mostly on the parental dole. He knew there were a lot of

will want to be susceptible to these types of dynamics. I think there will be a lot closer monitoring of message boards…we have a data science team that will be looking at that…You know, whatever regulation that you guys come up with—certainly we’ll abide by.” Even over livestream

The Biology of Belief: Unleashing the Power of Consciousness, Matter & Miracles

by Bruce H. Lipton  · 1 Jan 2005  · 220pp  · 66,518 words

The Outside In.” Science 292: 1284-1286. Jansen, R., H. Yu, et al. (2003). “A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data.” Science 302: 449-453. Jin, M., M. Blank, et al. (2000). “ERK1/2 Phosphorylation, Induced by Electromagnetic Fields, Diminishes During Neoplastic Transformation.” Journal of Cell Biology

The Dealmaker: Lessons From a Life in Private Equity

by Guy Hands  · 4 Nov 2021  · 341pp  · 107,933 words

marketing skills. This meant they were much less likely to be impressed by an arts student from Cambridge than someone with a Ph.D. in data science. With the bank’s financial support I set up what became known as the Cyber Room – a room full of extremely analytical, ludicrously intelligent, quantitative

The God Species: Saving the Planet in the Age of Humans

by Mark Lynas  · 3 Oct 2011  · 369pp  · 98,776 words

Circumpolar Permafrost Region,” Global Biogeochemical Cycles, 23, GB2023. 41. A. Bloom et al., 2010: “Large-Scale Controls of Methanogenesis Inferred from Methane and Gravity Spaceborne Data,” Science, 327, 5963, 322–5. 42. N. Shakhova et al., 2010: “Extensive Methane Venting to the Atmosphere from Sediments of the East Siberian Arctic Shelf,” Science

Bread, Wine, Chocolate: The Slow Loss of Foods We Love

by Simran Sethi  · 10 Nov 2015  · 396pp  · 112,832 words

Skin in the Game: Hidden Asymmetries in Daily Life

by Nassim Nicholas Taleb  · 20 Feb 2018  · 306pp  · 82,765 words

The End of College: Creating the Future of Learning and the University of Everywhere

by Kevin Carey  · 3 Mar 2015  · 319pp  · 90,965 words

The Man Behind the Microchip: Robert Noyce and the Invention of Silicon Valley

by Leslie Berlin  · 9 Jun 2005

Deep Time Reckoning: How Future Thinking Can Help Earth Now

by Vincent Ialenti  · 22 Sep 2020  · 224pp  · 69,593 words

Boom: Bubbles and the End of Stagnation

by Byrne Hobart and Tobias Huber  · 29 Oct 2024  · 292pp  · 106,826 words

American Marxism

by Mark R. Levin  · 12 Jul 2021  · 314pp  · 88,524 words

In Our Own Image: Savior or Destroyer? The History and Future of Artificial Intelligence

by George Zarkadakis  · 7 Mar 2016  · 405pp  · 117,219 words

How to Survive a Pandemic

by Michael Greger, M.D., FACLM  · 1,072pp  · 237,186 words

Apollo's Arrow: The Profound and Enduring Impact of Coronavirus on the Way We Live

by Nicholas A. Christakis  · 27 Oct 2020  · 475pp  · 127,389 words

Brotopia: Breaking Up the Boys' Club of Silicon Valley

by Emily Chang  · 6 Feb 2018  · 334pp  · 104,382 words

Army of None: Autonomous Weapons and the Future of War

by Paul Scharre  · 23 Apr 2018  · 590pp  · 152,595 words

The Buddha and the Badass: The Secret Spiritual Art of Succeeding at Work

by Vishen Lakhiani  · 14 Sep 2020

The Art of Corporate Success: The Story of Schlumberger

by Ken Auletta  · 28 Sep 2015  · 161pp  · 52,058 words

The Sirens of Mars: Searching for Life on Another World

by Sarah Stewart Johnson  · 6 Jul 2020  · 400pp  · 99,489 words

Superminds: The Surprising Power of People and Computers Thinking Together

by Thomas W. Malone  · 14 May 2018  · 344pp  · 104,077 words

Blueprint: The Evolutionary Origins of a Good Society

by Nicholas A. Christakis  · 26 Mar 2019

The Long History of the Future: Why Tomorrow's Technology Still Isn't Here

by Nicole Kobie  · 3 Jul 2024  · 348pp  · 119,358 words

Nexus: A Brief History of Information Networks From the Stone Age to AI

by Yuval Noah Harari  · 9 Sep 2024  · 566pp  · 169,013 words

Collaborative Society

by Dariusz Jemielniak and Aleksandra Przegalinska  · 18 Feb 2020  · 187pp  · 50,083 words

The Quiet Coup: Neoliberalism and the Looting of America

by Mehrsa Baradaran  · 7 May 2024  · 470pp  · 158,007 words

How Elites Ate the Social Justice Movement

by Fredrik Deboer  · 4 Sep 2023  · 211pp  · 78,547 words

Morning After the Revolution: Dispatches From the Wrong Side of History

by Nellie Bowles  · 13 May 2024  · 207pp  · 62,397 words

The Coming Wave: Technology, Power, and the Twenty-First Century's Greatest Dilemma

by Mustafa Suleyman  · 4 Sep 2023  · 444pp  · 117,770 words

Nobody's Fool: Why We Get Taken in and What We Can Do About It

by Daniel Simons and Christopher Chabris  · 10 Jul 2023  · 338pp  · 104,815 words

Rule of the Robots: How Artificial Intelligence Will Transform Everything

by Martin Ford  · 13 Sep 2021  · 288pp  · 86,995 words

The Creative Curve: How to Develop the Right Idea, at the Right Time

by Allen Gannett  · 11 Jun 2018  · 247pp  · 69,593 words

Explaining Humans: What Science Can Teach Us About Life, Love and Relationships

by Camilla Pang  · 12 Mar 2020  · 256pp  · 67,563 words

What Algorithms Want: Imagination in the Age of Computing

by Ed Finn  · 10 Mar 2017  · 285pp  · 86,853 words

The Myth of Artificial Intelligence: Why Computers Can't Think the Way We Do

by Erik J. Larson  · 5 Apr 2021

Artificial Intelligence: A Modern Approach

by Stuart Russell and Peter Norvig  · 14 Jul 2019  · 2,466pp  · 668,761 words

Places of the Heart: The Psychogeography of Everyday Life

by Colin Ellard  · 14 May 2015  · 313pp  · 92,053 words

Randomistas: How Radical Researchers Changed Our World

by Andrew Leigh  · 14 Sep 2018  · 340pp  · 94,464 words

The Blockchain Alternative: Rethinking Macroeconomic Policy and Economic Theory

by Kariappa Bheemaiah  · 26 Feb 2017  · 492pp  · 118,882 words

Small Wars, Big Data: The Information Revolution in Modern Conflict

by Eli Berman, Joseph H. Felter, Jacob N. Shapiro and Vestal Mcintyre  · 12 May 2018  · 517pp  · 147,591 words

The Half-Life of Facts: Why Everything We Know Has an Expiration Date

by Samuel Arbesman  · 31 Aug 2012  · 284pp  · 79,265 words

The End of Theory: Financial Crises, the Failure of Economics, and the Sweep of Human Interaction

by Richard Bookstaber  · 1 May 2017  · 293pp  · 88,490 words

No Is Not Enough: Resisting Trump’s Shock Politics and Winning the World We Need

by Naomi Klein  · 12 Jun 2017  · 357pp  · 94,852 words

Inside the Nudge Unit: How Small Changes Can Make a Big Difference

by David Halpern  · 26 Aug 2015  · 387pp  · 120,155 words

Big Data and the Welfare State: How the Information Revolution Threatens Social Solidarity

by Torben Iversen and Philipp Rehm  · 18 May 2022

Imaginable: How to See the Future Coming and Feel Ready for Anything―Even Things That Seem Impossible Today

by Jane McGonigal  · 22 Mar 2022  · 420pp  · 135,569 words

The Star Builders: Nuclear Fusion and the Race to Power the Planet

by Arthur Turrell  · 2 Aug 2021  · 297pp  · 84,447 words

New Power: How Power Works in Our Hyperconnected World--And How to Make It Work for You

by Jeremy Heimans and Henry Timms  · 2 Apr 2018  · 416pp  · 100,130 words

Two and Twenty: How the Masters of Private Equity Always Win

by Sachin Khajuria  · 13 Jun 2022  · 229pp  · 75,606 words

The Singularity Is Nearer: When We Merge with AI

by Ray Kurzweil  · 25 Jun 2024

Literary Theory for Robots: How Computers Learned to Write

by Dennis Yi Tenen  · 6 Feb 2024  · 169pp  · 41,887 words

The Science of Hate: How Prejudice Becomes Hate and What We Can Do to Stop It

by Matthew Williams  · 23 Mar 2021  · 592pp  · 125,186 words

I, Warbot: The Dawn of Artificially Intelligent Conflict

by Kenneth Payne  · 16 Jun 2021  · 339pp  · 92,785 words

Chasing My Cure: A Doctor's Race to Turn Hope Into Action; A Memoir

by David Fajgenbaum  · 9 Sep 2019

The Metropolitan Revolution: How Cities and Metros Are Fixing Our Broken Politics and Fragile Economy

by Bruce Katz and Jennifer Bradley  · 10 Jun 2013

Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World

by Bruce Schneier  · 2 Mar 2015  · 598pp  · 134,339 words

Know Thyself

by Stephen M Fleming  · 27 Apr 2021

Why We Drive: Toward a Philosophy of the Open Road

by Matthew B. Crawford  · 8 Jun 2020  · 386pp  · 113,709 words

Soonish: Ten Emerging Technologies That'll Improve And/or Ruin Everything

by Kelly Weinersmith and Zach Weinersmith  · 16 Oct 2017  · 398pp  · 105,032 words

Blockchain: Blueprint for a New Economy

by Melanie Swan  · 22 Jan 2014  · 271pp  · 52,814 words

Platform Revolution: How Networked Markets Are Transforming the Economy--And How to Make Them Work for You

by Sangeet Paul Choudary, Marshall W. van Alstyne and Geoffrey G. Parker  · 27 Mar 2016  · 421pp  · 110,406 words

Lost in Math: How Beauty Leads Physics Astray

by Sabine Hossenfelder  · 11 Jun 2018  · 340pp  · 91,416 words

The Ethical Algorithm: The Science of Socially Aware Algorithm Design

by Michael Kearns and Aaron Roth  · 3 Oct 2019

World Without Mind: The Existential Threat of Big Tech

by Franklin Foer  · 31 Aug 2017  · 281pp  · 71,242 words

User Friendly: How the Hidden Rules of Design Are Changing the Way We Live, Work & Play

by Cliff Kuang and Robert Fabricant  · 7 Nov 2019

They Don't Represent Us: Reclaiming Our Democracy

by Lawrence Lessig  · 5 Nov 2019  · 404pp  · 115,108 words

Your Face Belongs to Us: A Secretive Startup's Quest to End Privacy as We Know It

by Kashmir Hill  · 19 Sep 2023  · 487pp  · 124,008 words

The 5 Types of Wealth: A Transformative Guide to Design Your Dream Life

by Sahil Bloom  · 4 Feb 2025  · 363pp  · 94,341 words

Breaking News: The Remaking of Journalism and Why It Matters Now

by Alan Rusbridger  · 14 Oct 2018  · 579pp  · 160,351 words

When McKinsey Comes to Town: The Hidden Influence of the World's Most Powerful Consulting Firm

by Walt Bogdanich and Michael Forsythe  · 3 Oct 2022  · 689pp  · 134,457 words

The Death of Truth: Notes on Falsehood in the Age of Trump

by Michiko Kakutani  · 17 Jul 2018  · 137pp  · 38,925 words

Fluke: Chance, Chaos, and Why Everything We Do Matters

by Brian Klaas  · 23 Jan 2024  · 250pp  · 96,870 words

The Art of R Programming

by Norman Matloff  · 404pp  · 43,442 words

Free Speech: Ten Principles for a Connected World

by Timothy Garton Ash  · 23 May 2016  · 743pp  · 201,651 words

The Sharing Economy: The End of Employment and the Rise of Crowd-Based Capitalism

by Arun Sundararajan  · 12 May 2016  · 375pp  · 88,306 words

The Elements of Statistical Learning (Springer Series in Statistics)

by Trevor Hastie, Robert Tibshirani and Jerome Friedman  · 25 Aug 2009  · 764pp  · 261,694 words

Throwing Rocks at the Google Bus: How Growth Became the Enemy of Prosperity

by Douglas Rushkoff  · 1 Mar 2016  · 366pp  · 94,209 words

Hit Refresh: The Quest to Rediscover Microsoft's Soul and Imagine a Better Future for Everyone

by Satya Nadella, Greg Shaw and Jill Tracie Nichols  · 25 Sep 2017  · 391pp  · 71,600 words

Laziness Does Not Exist

by Devon Price  · 5 Jan 2021  · 362pp  · 87,462 words

Safe Haven: Investing for Financial Storms

by Mark Spitznagel  · 9 Aug 2021  · 231pp  · 64,734 words

Ask Your Developer: How to Harness the Power of Software Developers and Win in the 21st Century

by Jeff Lawson  · 12 Jan 2021  · 282pp  · 85,658 words

The Man Who Solved the Market: How Jim Simons Launched the Quant Revolution

by Gregory Zuckerman  · 5 Nov 2019  · 407pp  · 104,622 words

Posh Boys: How English Public Schools Ruin Britain

by Robert Verkaik  · 14 Apr 2018  · 419pp  · 119,476 words

The Measure of Progress: Counting What Really Matters

by Diane Coyle  · 15 Apr 2025  · 321pp  · 112,477 words

Mapmatics: How We Navigate the World Through Numbers

by Paulina Rowinska  · 5 Jun 2024  · 361pp  · 100,834 words

The Digital Party: Political Organisation and Online Democracy

by Paolo Gerbaudo  · 19 Jul 2018  · 302pp  · 84,881 words

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

by Martin Kleppmann  · 16 Mar 2017  · 1,237pp  · 227,370 words

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

by Martin Kleppmann  · 17 Apr 2017

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython

by Wes McKinney  · 25 Sep 2017  · 1,829pp  · 135,521 words

Shadow Work: The Unpaid, Unseen Jobs That Fill Your Day

by Craig Lambert  · 30 Apr 2015  · 229pp  · 72,431 words

Numpy Beginner's Guide - Third Edition

by Ivan Idris  · 23 Jun 2015  · 681pp  · 64,159 words

Pearls of Functional Algorithm Design

by Richard Bird  · 15 Sep 2010

The Participation Revolution: How to Ride the Waves of Change in a Terrifyingly Turbulent World

by Neil Gibb  · 15 Feb 2018  · 217pp  · 63,287 words

Lurking: How a Person Became a User

by Joanne McNeil  · 25 Feb 2020  · 239pp  · 80,319 words

Life as a Passenger: How Driverless Cars Will Change the World

by David Kerrigan  · 18 Jun 2017  · 472pp  · 80,835 words

Machine Learning for Email

by Drew Conway and John Myles White  · 25 Oct 2011  · 163pp  · 42,402 words

Binge Times: Inside Hollywood's Furious Billion-Dollar Battle to Take Down Netflix

by Dade Hayes and Dawn Chmielewski  · 18 Apr 2022  · 414pp  · 117,581 words

Searches: Selfhood in the Digital Age

by Vauhini Vara  · 8 Apr 2025  · 301pp  · 105,209 words

This Is Service Design Doing: Applying Service Design Thinking in the Real World: A Practitioners' Handbook

by Marc Stickdorn, Markus Edgar Hormess, Adam Lawrence and Jakob Schneider  · 12 Jan 2018  · 704pp  · 182,312 words

Who Will Defend Europe?: An Awakened Russia and a Sleeping Continent

by Keir Giles  · 24 Oct 2024  · 296pp  · 81,440 words

Chaos Monkeys: Obscene Fortune and Random Failure in Silicon Valley

by Antonio Garcia Martinez  · 27 Jun 2016  · 559pp  · 155,372 words

Machine Learning for Hackers

by Drew Conway and John Myles White  · 10 Feb 2012  · 451pp  · 103,606 words

Powerful: Teams, Leaders and the Culture of Freedom and Responsibility

by Patty McCord  · 9 Jan 2018  · 130pp  · 43,665 words

This Is Not Normal: The Collapse of Liberal Britain

by William Davies  · 28 Sep 2020  · 210pp  · 65,833 words

An Elegant Puzzle: Systems of Engineering Management

by Will Larson  · 19 May 2019  · 227pp  · 63,186 words

Roads and Bridges

by Nadia Eghbal  · 139pp  · 35,022 words

Blitzscaling: The Lightning-Fast Path to Building Massively Valuable Companies

by Reid Hoffman and Chris Yeh  · 14 Apr 2018  · 286pp  · 87,401 words

Platform Scale: How an Emerging Business Model Helps Startups Build Large Empires With Minimum Investment

by Sangeet Paul Choudary  · 14 Sep 2015  · 302pp  · 73,581 words

Blockchain Chicken Farm: And Other Stories of Tech in China's Countryside

by Xiaowei Wang  · 12 Oct 2020  · 196pp  · 61,981 words

Robot, Take the Wheel: The Road to Autonomous Cars and the Lost Art of Driving

by Jason Torchinsky  · 6 May 2019  · 175pp  · 54,755 words

Underwater: How Our American Dream of Homeownership Became a Nightmare

by Ryan Dezember  · 13 Jul 2020  · 279pp  · 87,875 words

Ours to Hack and to Own: The Rise of Platform Cooperativism, a New Vision for the Future of Work and a Fairer Internet

by Trebor Scholz and Nathan Schneider  · 14 Aug 2017  · 237pp  · 67,154 words

Fulfillment: Winning and Losing in One-Click America

by Alec MacGillis  · 16 Mar 2021  · 426pp  · 136,925 words

Algospeak: How Social Media Is Transforming the Future of Language

by Adam Aleksic  · 15 Jul 2025  · 278pp  · 71,701 words

Flash Crash: A Trading Savant, a Global Manhunt, and the Most Mysterious Market Crash in History

by Liam Vaughan  · 11 May 2020  · 268pp  · 81,811 words

Learn Algorithmic Trading

by Sebastien Donadio  · 7 Nov 2019

Network Security Through Data Analysis: Building Situational Awareness

by Michael S Collins  · 23 Feb 2014  · 446pp  · 102,421 words

The Network Imperative: How to Survive and Grow in the Age of Digital Business Models

by Barry Libert and Megan Beck  · 6 Jun 2016  · 285pp  · 58,517 words

Common Stocks and Uncommon Profits and Other Writings

by Philip A. Fisher  · 13 Apr 2015

Solr in Action

by Trey Grainger and Timothy Potter  · 14 Sep 2014  · 1,085pp  · 219,144 words

The Art of Invisibility: The World's Most Famous Hacker Teaches You How to Be Safe in the Age of Big Brother and Big Data

by Kevin Mitnick, Mikko Hypponen and Robert Vamosi  · 14 Feb 2017  · 305pp  · 93,091 words

Capital Allocators: How the World’s Elite Money Managers Lead and Invest

by Ted Seides  · 23 Mar 2021  · 199pp  · 48,162 words

Ethics of Big Data: Balancing Risk and Innovation

by Kord Davis and Doug Patterson  · 30 Dec 2011  · 98pp  · 25,753 words

Why Startups Fail: A New Roadmap for Entrepreneurial Success

by Tom Eisenmann  · 29 Mar 2021  · 387pp  · 106,753 words

Natural Language Processing with Python and spaCy

by Yuli Vasiliev  · 2 Apr 2020

The War on Normal People: The Truth About America's Disappearing Jobs and Why Universal Basic Income Is Our Future

by Andrew Yang  · 2 Apr 2018  · 300pp  · 76,638 words

Makers and Takers: The Rise of Finance and the Fall of American Business

by Rana Foroohar  · 16 May 2016  · 515pp  · 132,295 words

The Great Fragmentation: And Why the Future of All Business Is Small

by Steve Sammartino  · 25 Jun 2014  · 247pp  · 81,135 words

Hadoop: The Definitive Guide

by Tom White  · 29 May 2009  · 933pp  · 205,691 words

Platform Capitalism

by Nick Srnicek  · 22 Dec 2016  · 116pp  · 31,356 words

How to Turn Down a Billion Dollars: The Snapchat Story

by Billy Gallagher  · 13 Feb 2018  · 359pp  · 96,019 words

Python for Finance

by Yuxing Yan  · 24 Apr 2014  · 408pp  · 85,118 words

Open for Business Harnessing the Power of Platform Ecosystems

by Lauren Turner Claire, Laure Claire Reillier and Benoit Reillier  · 14 Oct 2017  · 240pp  · 78,436 words

Working Backwards: Insights, Stories, and Secrets From Inside Amazon

by Colin Bryar and Bill Carr  · 9 Feb 2021  · 302pp  · 100,493 words

Re-Educated: Why It’s Never Too Late to Change Your Life

by Lucy Kellaway  · 30 Jun 2021  · 184pp  · 60,229 words

Just Keep Buying: Proven Ways to Save Money and Build Your Wealth

by Nick Maggiulli  · 15 May 2022  · 287pp  · 62,824 words

Python for Data Analysis

by Wes McKinney  · 30 Dec 2011  · 752pp  · 131,533 words

The Jobs to Be Done Playbook: Align Your Markets, Organization, and Strategy Around Customer Needs

by Jim Kalbach  · 6 Apr 2020

Technically Wrong: Sexist Apps, Biased Algorithms, and Other Threats of Toxic Tech

by Sara Wachter-Boettcher  · 9 Oct 2017  · 223pp  · 60,909 words

Agile Project Management With Kanban

by Eric Brechner  · 25 Feb 2015

The Upstarts: How Uber, Airbnb, and the Killer Companies of the New Silicon Valley Are Changing the World

by Brad Stone  · 30 Jan 2017  · 373pp  · 112,822 words

Measure What Matters: How Google, Bono, and the Gates Foundation Rock the World With OKRs

by John Doerr  · 23 Apr 2018  · 280pp  · 71,268 words

Python for Algorithmic Trading: From Idea to Cloud Deployment

by Yves Hilpisch  · 8 Dec 2020  · 1,082pp  · 87,792 words

Winning Now, Winning Later

by David M. Cote  · 17 Apr 2020  · 297pp  · 93,882 words

Extreme Teams: Why Pixar, Netflix, AirBnB, and Other Cutting-Edge Companies Succeed Where Most Fail

by Robert Bruce Shaw, James Foster and Brilliance Audio  · 14 Oct 2017  · 280pp  · 82,355 words

Where Does It Hurt?: An Entrepreneur's Guide to Fixing Health Care

by Jonathan Bush and Stephen Baker  · 14 May 2014  · 238pp  · 68,914 words

Hands-On RESTful API Design Patterns and Best Practices

by Harihara Subramanian  · 31 Jan 2019  · 422pp  · 86,414 words

The Gig Economy: A Critical Introduction

by Jamie Woodcock and Mark Graham  · 17 Jan 2020  · 207pp  · 59,298 words

Going Infinite: The Rise and Fall of a New Tycoon

by Michael Lewis  · 2 Oct 2023  · 263pp  · 92,618 words

Number Go Up: Inside Crypto's Wild Rise and Staggering Fall

by Zeke Faux  · 11 Sep 2023  · 385pp  · 106,848 words

Palace Coup: The Billionaire Brawl Over the Bankrupt Caesars Gaming Empire

by Sujeet Indap and Max Frumes  · 16 Mar 2021  · 362pp  · 116,497 words

The Revolution That Wasn't: GameStop, Reddit, and the Fleecing of Small Investors

by Spencer Jakab  · 1 Feb 2022  · 420pp  · 94,064 words

Radicalized

by Cory Doctorow  · 19 Mar 2019  · 444pp  · 84,486 words

The Big Fix: How Companies Capture Markets and Harm Canadians

by Denise Hearn and Vass Bednar  · 14 Oct 2024  · 175pp  · 46,192 words

Careless People: A Cautionary Tale of Power, Greed, and Lost Idealism

by Sarah Wynn-Williams  · 11 Mar 2025  · 370pp  · 115,318 words

Nginx Essentials

by Valery Kholodkov  · 21 Jul 2015  · 174pp  · 34,672 words

The Rust Programming Language

by Steve Klabnik and Carol Nichols  · 14 Jun 2018  · 821pp  · 178,631 words

Personal Finance with Python

by Max Humber  · 156pp  · 15,746 words

How to Work Without Losing Your Mind

by Cate Sevilla  · 14 Jan 2021

Python Tricks: The Book

by Dan Bader  · 14 Oct 2017  · 262pp  · 60,248 words

Easy Money: Cryptocurrency, Casino Capitalism, and the Golden Age of Fraud

by Ben McKenzie and Jacob Silverman  · 17 Jul 2023  · 329pp  · 99,504 words

The Long Good Buy: Analysing Cycles in Markets

by Peter Oppenheimer  · 3 May 2020  · 333pp  · 76,990 words

Super Pumped: The Battle for Uber

by Mike Isaac  · 2 Sep 2019  · 444pp  · 127,259 words

Growth Hacker Marketing: A Primer on the Future of PR, Marketing, and Advertising

by Ryan Holiday  · 2 Sep 2013  · 52pp  · 14,333 words

Getting Started with D3

by Mike Dewar  · 26 Jun 2012  · 100pp  · 15,500 words

What's Yours Is Mine: Against the Sharing Economy

by Tom Slee  · 18 Nov 2015  · 265pp  · 69,310 words

Gigged: The End of the Job and the Future of Work

by Sarah Kessler  · 11 Jun 2018  · 246pp  · 68,392 words

Relevant Search: With Examples Using Elasticsearch and Solr

by Doug Turnbull and John Berryman  · 30 Apr 2016  · 593pp  · 118,995 words

Python Data Analytics: With Pandas, NumPy, and Matplotlib

by Fabio Nelli  · 27 Sep 2018  · 688pp  · 107,867 words

Tech Titans of China: How China's Tech Sector Is Challenging the World by Innovating Faster, Working Harder, and Going Global

by Rebecca Fannin  · 2 Sep 2019  · 269pp  · 70,543 words

The Nation City: Why Mayors Are Now Running the World

by Rahm Emanuel  · 25 Feb 2020  · 212pp  · 69,846 words

How I Built This: The Unexpected Paths to Success From the World's Most Inspiring Entrepreneurs

by Guy Raz  · 14 Sep 2020  · 361pp  · 107,461 words

Whistleblower: My Journey to Silicon Valley and Fight for Justice at Uber

by Susan Fowler  · 18 Feb 2020  · 205pp  · 71,872 words

Subscribed: Why the Subscription Model Will Be Your Company's Future - and What to Do About It

by Tien Tzuo and Gabe Weisert  · 4 Jun 2018  · 244pp  · 66,977 words

The Wealth Ladder: Proven Strategies for Every Step of Your Financial Life

by Nick Maggiulli  · 22 Jul 2025

App Kid: How a Child of Immigrants Grabbed a Piece of the American Dream

by Michael Sayman  · 20 Sep 2021  · 285pp  · 91,144 words

Filterworld: How Algorithms Flattened Culture

by Kyle Chayka  · 15 Jan 2024  · 321pp  · 105,480 words

It's Not TV: The Spectacular Rise, Revolution, and Future of HBO

by Felix Gillette and John Koblin  · 1 Nov 2022  · 575pp  · 140,384 words

No Filter: The Inside Story of Instagram

by Sarah Frier  · 13 Apr 2020  · 484pp  · 114,613 words

Rebooting India: Realizing a Billion Aspirations

by Nandan Nilekani  · 4 Feb 2016  · 332pp  · 100,601 words

The Airbnb Story: How Three Ordinary Guys Disrupted an Industry, Made Billions...and Created Plenty of Controversy

by Leigh Gallagher  · 14 Feb 2017  · 290pp  · 87,549 words

Tomorrow's Lawyers: An Introduction to Your Future

by Richard Susskind  · 10 Jan 2013  · 160pp  · 45,516 words

The Science of Storytelling: Why Stories Make Us Human, and How to Tell Them Better

by Will Storr  · 3 Apr 2019  · 234pp  · 68,798 words

Still Broke: Walmart's Remarkable Transformation and the Limits of Socially Conscious Capitalism

by Rick Wartzman  · 15 Nov 2022  · 215pp  · 69,370 words

The Shame Machine: Who Profits in the New Age of Humiliation

by Cathy O'Neil  · 15 Mar 2022  · 318pp  · 73,713 words

Failed State: The Sunday Times Bestselling Investigation Into Why Britain Is Struggling

by Sam Freedman  · 10 Jul 2024  · 368pp  · 101,133 words

Everybody Loses: The Tumultuous Rise of American Sports Gambling

by Danny Funt  · 20 Jan 2026  · 285pp  · 100,897 words

Text Analytics With Python: A Practical Real-World Approach to Gaining Actionable Insights From Your Data

by Dipanjan Sarkar  · 1 Dec 2016

Trillions: How a Band of Wall Street Renegades Invented the Index Fund and Changed Finance Forever

by Robin Wigglesworth  · 11 Oct 2021  · 432pp  · 106,612 words

Masters of Scale: Surprising Truths From the World's Most Successful Entrepreneurs

by Reid Hoffman, June Cohen and Deron Triff  · 14 Oct 2021  · 309pp  · 96,168 words

The Truth About Lies: The Illusion of Honesty and the Evolution of Deceit

by Aja Raden  · 10 May 2021  · 291pp  · 85,822 words

Interactive Data Visualization for the Web

by Scott Murray  · 15 Mar 2013  · 186pp  · 50,651 words

Zucked: Waking Up to the Facebook Catastrophe

by Roger McNamee  · 1 Jan 2019  · 382pp  · 105,819 words

Understanding Sponsored Search: Core Elements of Keyword Advertising

by Jim Jansen  · 25 Jul 2011  · 298pp  · 43,745 words

The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling

by Ralph Kimball and Margy Ross  · 30 Jun 2013

Terms of Service: Social Media and the Price of Constant Connection

by Jacob Silverman  · 17 Mar 2015  · 527pp  · 147,690 words

Hacker, Hoaxer, Whistleblower, Spy: The Story of Anonymous

by Gabriella Coleman  · 4 Nov 2014  · 457pp  · 126,996 words

An Ugly Truth: Inside Facebook's Battle for Domination

by Sheera Frenkel and Cecilia Kang  · 12 Jul 2021  · 372pp  · 100,947 words

Finding Alphas: A Quantitative Approach to Building Trading Strategies

by Igor Tulchinsky  · 30 Sep 2019  · 321pp

Hands-On Machine Learning With Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems

by Aurelien Geron  · 14 Aug 2019

Politics on the Edge: The Instant #1 Sunday Times Bestseller From the Host of Hit Podcast the Rest Is Politics

by Rory Stewart  · 13 Sep 2023  · 534pp  · 157,700 words

Cabin Fever: The Harrowing Journey of a Cruise Ship at the Dawn of a Pandemic

by Michael Smith and Jonathan Franklin  · 14 Jul 2022  · 244pp  · 78,238 words

The Messy Middle: Finding Your Way Through the Hardest and Most Crucial Part of Any Bold Venture

by Scott Belsky  · 1 Oct 2018  · 425pp  · 112,220 words

The Future Is Analog: How to Create a More Human World

by David Sax  · 15 Jan 2022  · 282pp  · 93,783 words

Vassal State

by Angus Hanton  · 25 Mar 2024  · 277pp  · 81,718 words

Don't Be Evil: How Big Tech Betrayed Its Founding Principles--And All of US

by Rana Foroohar  · 5 Nov 2019  · 380pp  · 109,724 words

Extremely Hardcore: Inside Elon Musk's Twitter

by Zoë Schiffer  · 13 Feb 2024  · 343pp  · 92,693 words

Mattering: The Secret to Building a Life of Deep Connection and Purpose

by Jennifer Breheny Wallace  · 13 Jan 2026  · 206pp  · 68,830 words

Rich White Men: What It Takes to Uproot the Old Boys' Club and Transform America

by Garrett Neiman  · 19 Jun 2023  · 386pp  · 112,064 words

Arriving Today: From Factory to Front Door -- Why Everything Has Changed About How and What We Buy

by Christopher Mims  · 13 Sep 2021  · 385pp  · 112,842 words

Data Wrangling With Python: Tips and Tools to Make Your Life Easier

by Jacqueline Kazil  · 4 Feb 2016

The Ones We've Been Waiting For: How a New Generation of Leaders Will Transform America

by Charlotte Alter  · 18 Feb 2020  · 504pp  · 129,087 words

Traffic: Genius, Rivalry, and Delusion in the Billion-Dollar Race to Go Viral

by Ben Smith  · 2 May 2023

Age of the City: Why Our Future Will Be Won or Lost Together

by Ian Goldin and Tom Lee-Devlin  · 21 Jun 2023  · 248pp  · 73,689 words

Transaction Man: The Rise of the Deal and the Decline of the American Dream

by Nicholas Lemann  · 9 Sep 2019  · 354pp  · 118,970 words

Peers Inc: How People and Platforms Are Inventing the Collaborative Economy and Reinventing Capitalism

by Robin Chase  · 14 May 2015  · 330pp  · 91,805 words

Forward: Notes on the Future of Our Democracy

by Andrew Yang  · 15 Nov 2021

Exploring Everyday Things with R and Ruby

by Sau Sheong Chang  · 27 Jun 2012

Super Founders: What Data Reveals About Billion-Dollar Startups

by Ali Tamaseb  · 14 Sep 2021  · 251pp  · 80,831 words

Automating Inequality

by Virginia Eubanks  · 294pp  · 77,356 words

Happy City: Transforming Our Lives Through Urban Design

by Charles Montgomery  · 12 Nov 2013  · 432pp  · 124,635 words

The History of the Future: Oculus, Facebook, and the Revolution That Swept Virtual Reality

by Blake J. Harris  · 19 Feb 2019  · 561pp  · 163,916 words

Digital Minimalism: Choosing a Focused Life in a Noisy World

by Cal Newport  · 5 Feb 2019  · 279pp  · 71,542 words

Advances in Financial Machine Learning

by Marcos Lopez de Prado  · 2 Feb 2018  · 571pp  · 105,054 words

Bullshit Jobs: A Theory

by David Graeber  · 14 May 2018  · 385pp  · 123,168 words

The Choice Factory: 25 Behavioural Biases That Influence What We Buy

by Richard Shotton  · 12 Feb 2018  · 184pp  · 46,395 words

A World of Three Zeros: The New Economics of Zero Poverty, Zero Unemployment, and Zero Carbon Emissions

by Muhammad Yunus  · 25 Sep 2017  · 278pp  · 74,880 words

Billion Dollar Brand Club: How Dollar Shave Club, Warby Parker, and Other Disruptors Are Remaking What We Buy

by Lawrence Ingrassia  · 28 Jan 2020  · 290pp  · 90,057 words

Modern Vim: Craft Your Development Environment With Vim 8 and Neovim

by Drew Neil  · 2 May 2018  · 241pp  · 43,252 words

The Smart Wife: Why Siri, Alexa, and Other Smart Home Devices Need a Feminist Reboot

by Yolande Strengers and Jenny Kennedy  · 14 Apr 2020

Digital Empires: The Global Battle to Regulate Technology

by Anu Bradford  · 25 Sep 2023  · 898pp  · 236,779 words

Seeking SRE: Conversations About Running Production Systems at Scale

by David N. Blank-Edelman  · 16 Sep 2018

Algorithms of Oppression: How Search Engines Reinforce Racism

by Safiya Umoja Noble  · 8 Jan 2018  · 290pp  · 73,000 words

Like, Comment, Subscribe: Inside YouTube's Chaotic Rise to World Domination

by Mark Bergen  · 5 Sep 2022  · 642pp  · 141,888 words

The Wires of War: Technology and the Global Struggle for Power

by Jacob Helberg  · 11 Oct 2021  · 521pp  · 118,183 words

Digital Transformation at Scale: Why the Strategy Is Delivery

by Andrew Greenway,Ben Terrett,Mike Bracken,Tom Loosemore  · 18 Jun 2018

The Internet of Us: Knowing More and Understanding Less in the Age of Big Data

by Michael P. Lynch  · 21 Mar 2016  · 230pp  · 61,702 words

Beautiful Visualization

by Julie Steele  · 20 Apr 2010

Smarter Faster Better: The Secrets of Being Productive in Life and Business

by Charles Duhigg  · 8 Mar 2016  · 401pp  · 119,488 words

On the Edge: The Art of Risking Everything

by Nate Silver  · 12 Aug 2024  · 848pp  · 227,015 words

Work in the Future The Automation Revolution-Palgrave MacMillan (2019)

by Robert Skidelsky Nan Craig  · 15 Mar 2020

This Is Not Propaganda: Adventures in the War Against Reality

by Peter Pomerantsev  · 29 Jul 2019  · 240pp  · 74,182 words

Cogs and Monsters: What Economics Is, and What It Should Be

by Diane Coyle  · 11 Oct 2021  · 305pp  · 75,697 words

Demystifying Smart Cities

by Anders Lisdorf

Data and the City

by Rob Kitchin,Tracey P. Lauriault,Gavin McArdle  · 2 Aug 2017

Unit X: How the Pentagon and Silicon Valley Are Transforming the Future of War

by Raj M. Shah and Christopher Kirchhoff  · 8 Jul 2024  · 272pp  · 103,638 words

We Are the Nerds: The Birth and Tumultuous Life of Reddit, the Internet's Culture Laboratory

by Christine Lagorio-Chafkin  · 1 Oct 2018

Lessons from the Titans: What Companies in the New Economy Can Learn from the Great Industrial Giants to Drive Sustainable Success

by Scott Davis, Carter Copeland and Rob Wertheimer  · 13 Jul 2020  · 372pp  · 101,678 words

Amazon Unbound: Jeff Bezos and the Invention of a Global Empire

by Brad Stone  · 10 May 2021  · 569pp  · 156,139 words

The Best Interface Is No Interface: The Simple Path to Brilliant Technology (Voices That Matter)

by Golden Krishna  · 10 Feb 2015  · 271pp  · 62,538 words

Coders: The Making of a New Tribe and the Remaking of the World

by Clive Thompson  · 26 Mar 2019  · 499pp  · 144,278 words

The Deep Learning Revolution (The MIT Press)

by Terrence J. Sejnowski  · 27 Sep 2018

Industry 4.0: The Industrial Internet of Things

by Alasdair Gilchrist  · 27 Jun 2016

Big Data: A Revolution That Will Transform How We Live, Work, and Think

by Viktor Mayer-Schonberger and Kenneth Cukier  · 5 Mar 2013  · 304pp  · 82,395 words

Move Fast and Break Things: How Facebook, Google, and Amazon Cornered Culture and Undermined Democracy

by Jonathan Taplin  · 17 Apr 2017  · 222pp  · 70,132 words

Lean Analytics: Use Data to Build a Better Startup Faster

by Alistair Croll and Benjamin Yoskovitz  · 1 Mar 2013  · 567pp  · 122,311 words

Machine, Platform, Crowd: Harnessing Our Digital Future

by Andrew McAfee and Erik Brynjolfsson  · 26 Jun 2017  · 472pp  · 117,093 words

Everydata: The Misinformation Hidden in the Little Data You Consume Every Day

by John H. Johnson  · 27 Apr 2016  · 250pp  · 64,011 words

The Inner Lives of Markets: How People Shape Them—And They Shape Us

by Tim Sullivan  · 6 Jun 2016  · 252pp  · 73,131 words

The Inevitable: Understanding the 12 Technological Forces That Will Shape Our Future

by Kevin Kelly  · 6 Jun 2016  · 371pp  · 108,317 words

Web Scraping With Python: Collecting Data From the Modern Web

by Ryan Mitchell  · 14 Jun 2015  · 255pp  · 78,207 words

Hello World: Being Human in the Age of Algorithms

by Hannah Fry  · 17 Sep 2018  · 296pp  · 78,631 words

Infonomics: How to Monetize, Manage, and Measure Information as an Asset for Competitive Advantage

by Douglas B. Laney  · 4 Sep 2017  · 374pp  · 94,508 words

Ten Arguments for Deleting Your Social Media Accounts Right Now

by Jaron Lanier  · 28 May 2018  · 151pp  · 39,757 words

Radical Markets: Uprooting Capitalism and Democracy for a Just Society

by Eric Posner and E. Weyl  · 14 May 2018  · 463pp  · 105,197 words

Uncanny Valley: A Memoir

by Anna Wiener  · 14 Jan 2020  · 237pp  · 74,109 words

The Elements of Data Analytic Style

by Jeff Leek  · 1 Mar 2015  · 50pp  · 13,399 words

Frenemies: The Epic Disruption of the Ad Business

by Ken Auletta  · 4 Jun 2018  · 379pp  · 109,223 words

PostgreSQL: Up and Running, 3rd Edition

by Unknown

Bold: How to Go Big, Create Wealth and Impact the World

by Peter H. Diamandis and Steven Kotler  · 3 Feb 2015  · 368pp  · 96,825 words

The Levelling: What’s Next After Globalization

by Michael O’sullivan  · 28 May 2019  · 756pp  · 120,818 words

Them and Us: How Immigrants and Locals Can Thrive Together

by Philippe Legrain  · 14 Oct 2020  · 521pp  · 110,286 words

The Fifth Domain: Defending Our Country, Our Companies, and Ourselves in the Age of Cyber Threats

by Richard A. Clarke and Robert K. Knake  · 15 Jul 2019  · 409pp  · 112,055 words

Science Fictions: How Fraud, Bias, Negligence, and Hype Undermine the Search for Truth

by Stuart Ritchie  · 20 Jul 2020

Future Politics: Living Together in a World Transformed by Tech

by Jamie Susskind  · 3 Sep 2018  · 533pp

Robot Rules: Regulating Artificial Intelligence

by Jacob Turner  · 29 Oct 2018  · 688pp  · 147,571 words

The Smartphone Society

by Nicole Aschoff

How Medicine Works and When It Doesn't: Learning Who to Trust to Get and Stay Healthy

by F. Perry Wilson  · 24 Jan 2023  · 286pp  · 92,521 words

The People's Republic of Walmart: How the World's Biggest Corporations Are Laying the Foundation for Socialism

by Leigh Phillips and Michal Rozworski  · 5 Mar 2019  · 202pp  · 62,901 words

Escape From Model Land: How Mathematical Models Can Lead Us Astray and What We Can Do About It

by Erica Thompson  · 6 Dec 2022  · 250pp  · 79,360 words

The Nature of Software Development: Keep It Simple, Make It Valuable, Build It Piece by Piece

by Ron Jeffries  · 14 Aug 2015  · 444pp  · 118,393 words

The Drunkard's Walk: How Randomness Rules Our Lives

by Leonard Mlodinow  · 12 May 2008  · 266pp  · 86,324 words

Rationality: What It Is, Why It Seems Scarce, Why It Matters

by Steven Pinker  · 14 Oct 2021  · 533pp  · 125,495 words

Nervous States: Democracy and the Decline of Reason

by William Davies  · 26 Feb 2019  · 349pp  · 98,868 words

Why Machines Learn: The Elegant Math Behind Modern AI

by Anil Ananthaswamy  · 15 Jul 2024  · 416pp  · 118,522 words

Everything Is Predictable: How Bayesian Statistics Explain Our World

by Tom Chivers  · 6 May 2024  · 283pp  · 102,484 words

The Attention Merchants: The Epic Scramble to Get Inside Our Heads

by Tim Wu  · 14 May 2016  · 515pp  · 143,055 words

Humankind: A Hopeful History

by Rutger Bregman  · 1 Jun 2020  · 578pp  · 131,346 words

Stories Are Weapons: Psychological Warfare and the American Mind

by Annalee Newitz  · 3 Jun 2024  · 251pp  · 68,713 words

The Future Is Faster Than You Think: How Converging Technologies Are Transforming Business, Industries, and Our Lives

by Peter H. Diamandis and Steven Kotler  · 28 Jan 2020  · 501pp  · 114,888 words

What If We Get It Right?: Visions of Climate Futures

by Ayana Elizabeth Johnson  · 17 Sep 2024  · 588pp  · 160,825 words

The Premonition: A Pandemic Story

by Michael Lewis  · 3 May 2021  · 285pp  · 98,832 words

Rage Inside the Machine: The Prejudice of Algorithms, and How to Stop the Internet Making Bigots of Us All

by Robert Elliott Smith  · 26 Jun 2019  · 370pp  · 107,983 words

Tools and Weapons: The Promise and the Peril of the Digital Age

by Brad Smith and Carol Ann Browne  · 9 Sep 2019  · 482pp  · 121,173 words

Bezonomics: How Amazon Is Changing Our Lives and What the World's Best Companies Are Learning From It

by Brian Dumaine  · 11 May 2020  · 411pp  · 98,128 words

If Then: How Simulmatics Corporation Invented the Future

by Jill Lepore  · 14 Sep 2020  · 467pp  · 149,632 words

The Coming of Neo-Feudalism: A Warning to the Global Middle Class

by Joel Kotkin  · 11 May 2020  · 393pp  · 91,257 words

The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies

by Erik Brynjolfsson and Andrew McAfee  · 20 Jan 2014  · 339pp  · 88,732 words

The Hype Machine: How Social Media Disrupts Our Elections, Our Economy, and Our Health--And How We Must Adapt

by Sinan Aral  · 14 Sep 2020  · 475pp  · 134,707 words

The AI-First Company

by Ash Fontana  · 4 May 2021  · 296pp  · 66,815 words

The Alignment Problem: Machine Learning and Human Values

by Brian Christian  · 5 Oct 2020  · 625pp  · 167,349 words

Conflicted: How Productive Disagreements Lead to Better Outcomes

by Ian Leslie  · 23 Feb 2021  · 280pp  · 82,393 words

Enlightenment Now: The Case for Reason, Science, Humanism, and Progress

by Steven Pinker  · 13 Feb 2018  · 1,034pp  · 241,773 words

What to Think About Machines That Think: Today's Leading Thinkers on the Age of Machine Intelligence

by John Brockman  · 5 Oct 2015  · 481pp  · 125,946 words

Head, Hand, Heart: Why Intelligence Is Over-Rewarded, Manual Workers Matter, and Caregivers Deserve More Respect

by David Goodhart  · 7 Sep 2020  · 463pp  · 115,103 words

Leadership by Algorithm: Who Leads and Who Follows in the AI Era?

by David de Cremer  · 25 May 2020  · 241pp  · 70,307 words

Futureproof: 9 Rules for Humans in the Age of Automation

by Kevin Roose  · 9 Mar 2021  · 208pp  · 57,602 words

The Data Journalism Handbook

by Jonathan Gray, Lucy Chambers and Liliana Bounegru  · 9 May 2012

Succeeding With AI: How to Make AI Work for Your Business

by Veljko Krunic  · 29 Mar 2020

The Job: The Future of Work in the Modern Era

by Ellen Ruppel Shell  · 22 Oct 2018  · 402pp  · 126,835 words

The Age of Surveillance Capitalism

by Shoshana Zuboff  · 15 Jan 2019  · 918pp  · 257,605 words

Superforecasting: The Art and Science of Prediction

by Philip Tetlock and Dan Gardner  · 14 Sep 2015  · 317pp  · 100,414 words

Human + Machine: Reimagining Work in the Age of AI

by Paul R. Daugherty and H. James Wilson  · 15 Jan 2018  · 523pp  · 61,179 words

Architects of Intelligence

by Martin Ford  · 16 Nov 2018  · 586pp  · 186,548 words

Radical Abundance: How a Revolution in Nanotechnology Will Change Civilization

by K. Eric Drexler  · 6 May 2013  · 445pp  · 105,255 words

Mining Social Media: Finding Stories in Internet Data

by Lam Thuy Vo  · 21 Nov 2019  · 237pp  · 65,794 words

Mastering Pandas

by Femi Anthony  · 21 Jun 2015  · 589pp  · 69,193 words

Trees on Mars: Our Obsession With the Future

by Hal Niedzviecki  · 15 Mar 2015  · 343pp  · 102,846 words

The Seventh Sense: Power, Fortune, and Survival in the Age of Networks

by Joshua Cooper Ramo  · 16 May 2016  · 326pp  · 103,170 words

Surveillance Valley: The Rise of the Military-Digital Complex

by Yasha Levine  · 6 Feb 2018  · 474pp  · 130,575 words

Capitalism: the unknown ideal

by Ayn Rand  · 15 Aug 1966  · 400pp  · 129,841 words

The Big Nine: How the Tech Titans and Their Thinking Machines Could Warp Humanity

by Amy Webb  · 5 Mar 2019  · 340pp  · 97,723 words

The World in 2050: Four Forces Shaping Civilization's Northern Future

by Laurence C. Smith  · 22 Sep 2010  · 421pp  · 120,332 words

Pinpoint: How GPS Is Changing Our World

by Greg Milner  · 4 May 2016  · 385pp  · 103,561 words

Lab Rats: How Silicon Valley Made Work Miserable for the Rest of Us

by Dan Lyons  · 22 Oct 2018  · 252pp  · 78,780 words

The Organized Mind: Thinking Straight in the Age of Information Overload

by Daniel J. Levitin  · 18 Aug 2014  · 685pp  · 203,949 words

Overcomplicated: Technology at the Limits of Comprehension

by Samuel Arbesman  · 18 Jul 2016  · 222pp  · 53,317 words

Hacking Growth: How Today's Fastest-Growing Companies Drive Breakout Success

by Sean Ellis and Morgan Brown  · 24 Apr 2017  · 344pp  · 96,020 words

Algorithms to Live By: The Computer Science of Human Decisions

by Brian Christian and Tom Griffiths  · 4 Apr 2016  · 523pp  · 143,139 words

Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die

by Eric Siegel  · 19 Feb 2013  · 502pp  · 107,657 words

But What if We're Wrong? Thinking About the Present as if It Were the Past

by Chuck Klosterman  · 6 Jun 2016  · 281pp  · 78,317 words

Prediction Machines: The Simple Economics of Artificial Intelligence

by Ajay Agrawal, Joshua Gans and Avi Goldfarb  · 16 Apr 2018  · 345pp  · 75,660 words

AI Superpowers: China, Silicon Valley, and the New World Order

by Kai-Fu Lee  · 14 Sep 2018  · 307pp  · 88,180 words

The Fifth Risk

by Michael Lewis  · 1 Oct 2018  · 157pp  · 53,125 words

Invisible Women

by Caroline Criado Perez  · 12 Mar 2019  · 480pp  · 119,407 words

Beautiful Data: The Stories Behind Elegant Data Solutions

by Toby Segaran and Jeff Hammerbacher  · 1 Jul 2009

Big Data Analytics: Turning Big Data Into Big Money

by Frank J. Ohlhorst  · 28 Nov 2012  · 133pp  · 42,254 words

Thank You for Being Late: An Optimist's Guide to Thriving in the Age of Accelerations

by Thomas L. Friedman  · 22 Nov 2016  · 602pp  · 177,874 words

The Future of the Brain: Essays by the World's Leading Neuroscientists

by Gary Marcus and Jeremy Freeman  · 1 Nov 2014  · 336pp  · 93,672 words

Utopia Is Creepy: And Other Provocations

by Nicholas Carr  · 5 Sep 2016  · 391pp  · 105,382 words

The Water Will Come: Rising Seas, Sinking Cities, and the Remaking of the Civilized World

by Jeff Goodell  · 23 Oct 2017  · 292pp  · 92,588 words

The Internet Is Not the Answer

by Andrew Keen  · 5 Jan 2015  · 361pp  · 81,068 words

Uberland: How Algorithms Are Rewriting the Rules of Work

by Alex Rosenblat  · 22 Oct 2018  · 343pp  · 91,080 words

Who Owns the Future?

by Jaron Lanier  · 6 May 2013  · 510pp  · 120,048 words

Rise of the Robots: Technology and the Threat of a Jobless Future

by Martin Ford  · 4 May 2015  · 484pp  · 104,873 words

Future Crimes: Everything Is Connected, Everyone Is Vulnerable and What We Can Do About It

by Marc Goodman  · 24 Feb 2015  · 677pp  · 206,548 words

The Industries of the Future

by Alec Ross  · 2 Feb 2016  · 364pp  · 99,897 words

Facebook: The Inside Story

by Steven Levy  · 25 Feb 2020  · 706pp  · 202,591 words

The Black Box Society: The Secret Algorithms That Control Money and Information

by Frank Pasquale  · 17 Nov 2014  · 320pp  · 87,853 words

The Patient Will See You Now: The Future of Medicine Is in Your Hands

by Eric Topol  · 6 Jan 2015  · 588pp  · 131,025 words

Data-Ism: The Revolution Transforming Decision Making, Consumer Behavior, and Almost Everything Else

by Steve Lohr  · 10 Mar 2015  · 239pp  · 70,206 words

The Data Detective: Ten Easy Rules to Make Sense of Statistics

by Tim Harford  · 2 Feb 2021  · 428pp  · 103,544 words

WTF?: What's the Future and Why It's Up to Us

by Tim O'Reilly  · 9 Oct 2017  · 561pp  · 157,589 words

The Data Revolution: Big Data, Open Data, Data Infrastructures and Their Consequences

by Rob Kitchin  · 25 Aug 2014

Uncharted: How to Map the Future

by Margaret Heffernan  · 20 Feb 2020  · 335pp  · 97,468 words

Hit Makers: The Science of Popularity in an Age of Distraction

by Derek Thompson  · 7 Feb 2017  · 416pp  · 108,370 words

The Costs of Connection: How Data Is Colonizing Human Life and Appropriating It for Capitalism

by Nick Couldry and Ulises A. Mejias  · 19 Aug 2019  · 458pp  · 116,832 words

Human Compatible: Artificial Intelligence and the Problem of Control

by Stuart Russell  · 7 Oct 2019  · 416pp  · 112,268 words

Whiteshift: Populism, Immigration and the Future of White Majorities

by Eric Kaufmann  · 24 Oct 2018  · 691pp  · 203,236 words

Democracy for Sale: Dark Money and Dirty Politics

by Peter Geoghegan  · 2 Jan 2020  · 388pp  · 111,099 words

Augmented: Life in the Smart Lane

by Brett King  · 5 May 2016  · 385pp  · 111,113 words

The Creativity Code: How AI Is Learning to Write, Paint and Think

by Marcus Du Sautoy  · 7 Mar 2019  · 337pp  · 103,522 words

The Loop: How Technology Is Creating a World Without Choices and How to Fight Back

by Jacob Ward  · 25 Jan 2022  · 292pp  · 94,660 words

The Ransomware Hunting Team: A Band of Misfits' Improbable Crusade to Save the World From Cybercrime

by Renee Dudley and Daniel Golden  · 24 Oct 2022  · 392pp  · 114,189 words

Possible Minds: Twenty-Five Ways of Looking at AI

by John Brockman  · 19 Feb 2019  · 339pp  · 94,769 words

Model Thinker: What You Need to Know to Make Data Work for You

by Scott E. Page  · 27 Nov 2018  · 543pp  · 153,550 words

SuperBetter: The Power of Living Gamefully

by Jane McGonigal  · 14 Sep 2015  · 525pp  · 147,008 words

Applied Artificial Intelligence: A Handbook for Business Leaders

by Mariya Yao, Adelyn Zhou and Marlene Jia  · 1 Jun 2018  · 161pp  · 39,526 words

The Great Disruption: Why the Climate Crisis Will Bring on the End of Shopping and the Birth of a New World

by Paul Gilding  · 28 Mar 2011  · 337pp  · 103,273 words

Thinking with Data

by Max Shron  · 15 Aug 2014

Likewar: The Weaponization of Social Media

by Peter Warren Singer and Emerson T. Brooking  · 15 Mar 2018

Ghost Work: How to Stop Silicon Valley From Building a New Global Underclass

by Mary L. Gray and Siddharth Suri  · 6 May 2019  · 346pp  · 97,330 words

How to Prevent the Next Pandemic

by Bill Gates  · 2 May 2022  · 406pp  · 88,977 words

More From Less: The Surprising Story of How We Learned to Prosper Using Fewer Resources – and What Happens Next

by Andrew McAfee  · 30 Sep 2019  · 372pp  · 94,153 words

Spike: The Virus vs The People - The Inside Story

by Jeremy Farrar and Anjana Ahuja  · 15 Jan 2021  · 245pp  · 71,886 words

AIQ: How People and Machines Are Smarter Together

by Nick Polson and James Scott  · 14 May 2018  · 301pp  · 85,126 words

A World Without Email: Reimagining Work in an Age of Communication Overload

by Cal Newport  · 2 Mar 2021  · 350pp  · 90,898 words

Democracy's Data: The Hidden Stories in the U.S. Census and How to Read Them

by Dan Bouk  · 22 Aug 2022  · 424pp  · 123,180 words

Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

by Karen Hao  · 19 May 2025  · 660pp  · 179,531 words

Survival of the Richest: Escape Fantasies of the Tech Billionaires

by Douglas Rushkoff  · 7 Sep 2022  · 205pp  · 61,903 words

The Book of Why: The New Science of Cause and Effect

by Judea Pearl and Dana Mackenzie  · 1 Mar 2018

The Bill Gates Problem: Reckoning With the Myth of the Good Billionaire

by Tim Schwab  · 13 Nov 2023  · 618pp  · 179,407 words

More Everything Forever: AI Overlords, Space Empires, and Silicon Valley's Crusade to Control the Fate of Humanity

by Adam Becker  · 14 Jun 2025  · 381pp  · 119,533 words

The Geek Way: The Radical Mindset That Drives Extraordinary Results

by Andrew McAfee  · 14 Nov 2023  · 381pp  · 113,173 words

Outnumbered: From Facebook and Google to Fake News and Filter-Bubbles – the Algorithms That Control Our Lives

by David Sumpter  · 18 Jun 2018  · 276pp  · 81,153 words

When It All Burns: Fighting Fire in a Transformed World

by Jordan Thomas  · 27 May 2025  · 347pp  · 105,327 words

The Technology Trap: Capital, Labor, and Power in the Age of Automation

by Carl Benedikt Frey  · 17 Jun 2019  · 626pp  · 167,836 words

The Equality Machine: Harnessing Digital Technology for a Brighter, More Inclusive Future

by Orly Lobel  · 17 Oct 2022  · 370pp  · 112,809 words

Don't Trust Your Gut: Using Data to Get What You Really Want in LIfe

by Seth Stephens-Davidowitz  · 9 May 2022  · 287pp  · 69,655 words

The Globotics Upheaval: Globalisation, Robotics and the Future of Work

by Richard Baldwin  · 10 Jan 2019  · 301pp  · 89,076 words

Four Battlegrounds

by Paul Scharre  · 18 Jan 2023

The Chaos Machine: The Inside Story of How Social Media Rewired Our Minds and Our World

by Max Fisher  · 5 Sep 2022  · 439pp  · 131,081 words

This Could Be Our Future: A Manifesto for a More Generous World

by Yancey Strickler  · 29 Oct 2019  · 254pp  · 61,387 words

Strength in Numbers: How Polls Work and Why We Need Them

by G. Elliott Morris  · 11 Jul 2022  · 252pp  · 71,176 words

Code Dependent: Living in the Shadow of AI

by Madhumita Murgia  · 20 Mar 2024  · 336pp  · 91,806 words

Co-Intelligence: Living and Working With AI

by Ethan Mollick  · 2 Apr 2024  · 189pp  · 58,076 words

Not the End of the World

by Hannah Ritchie  · 9 Jan 2024  · 335pp  · 101,992 words

Bad Data Handbook

by Q. Ethan McCallum  · 14 Nov 2012  · 398pp  · 86,855 words

Dark Mirror: Edward Snowden and the Surveillance State

by Barton Gellman  · 20 May 2020  · 562pp  · 153,825 words

She Has Her Mother's Laugh

by Carl Zimmer  · 29 May 2018

Exponential: How Accelerating Technology Is Leaving Us Behind and What to Do About It

by Azeem Azhar  · 6 Sep 2021  · 447pp  · 111,991 words

Artificial Unintelligence: How Computers Misunderstand the World

by Meredith Broussard  · 19 Apr 2018  · 245pp  · 83,272 words

The Theory That Would Not Die: How Bayes' Rule Cracked the Enigma Code, Hunted Down Russian Submarines, and Emerged Triumphant From Two Centuries of Controversy

by Sharon Bertsch McGrayne  · 16 May 2011  · 561pp  · 120,899 words

AI 2041: Ten Visions for Our Future

by Kai-Fu Lee and Qiufan Chen  · 13 Sep 2021

Uncontrolled Spread: Why COVID-19 Crushed Us and How We Can Defeat the Next Pandemic

by Scott Gottlieb  · 20 Sep 2021

System Error: Where Big Tech Went Wrong and How We Can Reboot

by Rob Reich, Mehran Sahami and Jeremy M. Weinstein  · 6 Sep 2021

The Knowledge Machine: How Irrationality Created Modern Science

by Michael Strevens  · 12 Oct 2020

Kill It With Fire: Manage Aging Computer Systems

by Marianne Bellotti  · 17 Mar 2021  · 232pp  · 71,237 words

The Happiness Industry: How the Government and Big Business Sold Us Well-Being

by William Davies  · 11 May 2015  · 317pp  · 87,566 words

Physics of the Future: How Science Will Shape Human Destiny and Our Daily Lives by the Year 2100

by Michio Kaku  · 15 Mar 2011  · 523pp  · 148,929 words

Failures of State: The Inside Story of Britain's Battle With Coronavirus

by Jonathan Calvert and George Arbuthnott  · 18 Mar 2021  · 432pp  · 143,491 words

Keeping Up With the Quants: Your Guide to Understanding and Using Analytics

by Thomas H. Davenport and Jinho Kim  · 10 Jun 2013  · 204pp  · 58,565 words

Big Data at Work: Dispelling the Myths, Uncovering the Opportunities

by Thomas H. Davenport  · 4 Feb 2014

Dataclysm: Who We Are (When We Think No One's Looking)

by Christian Rudder  · 8 Sep 2014  · 366pp  · 76,476 words

The People vs Tech: How the Internet Is Killing Democracy (And How We Save It)

by Jamie Bartlett  · 4 Apr 2018  · 170pp  · 49,193 words

The Art of Statistics: Learning From Data

by David Spiegelhalter  · 14 Oct 2019  · 442pp  · 94,734 words

The Weather of the Future

by Heidi Cullen  · 2 Aug 2010  · 391pp  · 99,963 words

Your Computer Is on Fire

by Thomas S. Mullaney, Benjamin Peters, Mar Hicks and Kavita Philip  · 9 Mar 2021  · 661pp  · 156,009 words

Artificial Whiteness

by Yarden Katz

Data Action: Using Data for Public Good

by Sarah Williams  · 14 Sep 2020

The Art of Statistics: How to Learn From Data

by David Spiegelhalter  · 2 Sep 2019  · 404pp  · 92,713 words

Immortality, Inc.

by Chip Walter  · 7 Jan 2020  · 232pp  · 72,483 words

New Laws of Robotics: Defending Human Expertise in the Age of AI

by Frank Pasquale  · 14 May 2020  · 1,172pp  · 114,305 words

Climate Change

by Joseph Romm  · 3 Dec 2015  · 358pp  · 93,969 words

Life as We Made It: How 50,000 Years of Human Innovation Refined--And Redefined--Nature

by Beth Shapiro  · 15 Dec 2021  · 338pp  · 105,112 words

AI in Museums: Reflections, Perspectives and Applications

by Sonja Thiel and Johannes C. Bernhardt  · 31 Dec 2023  · 321pp  · 113,564 words

Upscale: What It Takes to Scale a Startup. By the People Who've Done It.

by James Silver  · 15 Nov 2018  · 291pp  · 90,771 words

The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World

by Pedro Domingos  · 21 Sep 2015  · 396pp  · 117,149 words

The Future of the Professions: How Technology Will Transform the Work of Human Experts

by Richard Susskind and Daniel Susskind  · 24 Aug 2015  · 742pp  · 137,937 words

NumPy Cookbook

by Ivan Idris  · 30 Sep 2012  · 197pp  · 35,256 words

Only Humans Need Apply: Winners and Losers in the Age of Smart Machines

by Thomas H. Davenport and Julia Kirby  · 23 May 2016  · 347pp  · 97,721 words

Data for the Public Good

by Alex Howard  · 21 Feb 2012  · 25pp  · 5,789 words

Deep Medicine: How Artificial Intelligence Can Make Healthcare Human Again

by Eric Topol  · 1 Jan 2019  · 424pp  · 114,905 words

Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are

by Seth Stephens-Davidowitz  · 8 May 2017  · 337pp  · 86,320 words

Merchants of Truth: The Business of News and the Fight for Facts

by Jill Abramson  · 5 Feb 2019  · 788pp  · 223,004 words

Calling Bullshit: The Art of Scepticism in a Data-Driven World

by Jevin D. West and Carl T. Bergstrom  · 3 Aug 2020

We Are Data: Algorithms and the Making of Our Digital Selves

by John Cheney-Lippold  · 1 May 2017  · 420pp  · 100,811 words

Mindf*ck: Cambridge Analytica and the Plot to Break America

by Christopher Wylie  · 8 Oct 2019

Growth: A Reckoning

by Daniel Susskind  · 16 Apr 2024  · 358pp  · 109,930 words

No Ordinary Disruption: The Four Global Forces Breaking All the Trends

by Richard Dobbs and James Manyika  · 12 May 2015  · 389pp  · 87,758 words

Reinventing Capitalism in the Age of Big Data

by Viktor Mayer-Schönberger and Thomas Ramge  · 27 Feb 2018  · 267pp  · 72,552 words

Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy

by Cathy O'Neil  · 5 Sep 2016  · 252pp  · 72,473 words

Exponential Organizations: Why New Organizations Are Ten Times Better, Faster, and Cheaper Than Yours (And What to Do About It)

by Salim Ismail and Yuri van Geest  · 17 Oct 2014  · 292pp  · 85,151 words

Mastering Machine Learning With Scikit-Learn

by Gavin Hackeling  · 31 Oct 2014

We Are Bellingcat: Global Crime, Online Sleuths, and the Bold Future of News

by Eliot Higgins  · 2 Mar 2021  · 277pp  · 70,506 words

Targeted: The Cambridge Analytica Whistleblower's Inside Story of How Big Data, Trump, and Facebook Broke Democracy and How It Can Happen Again

by Brittany Kaiser  · 21 Oct 2019  · 391pp  · 123,597 words

Scikit-Learn Cookbook

by Trent Hauck  · 3 Nov 2014

Devil's Bargain: Steve Bannon, Donald Trump, and the Storming of the Presidency

by Joshua Green  · 17 Jul 2017  · 296pp  · 78,112 words

Architecting Modern Data Platforms: A Guide to Enterprise Hadoop at Scale

by Jan Kunigk, Ian Buss, Paul Wilkinson and Lars George  · 8 Jan 2019  · 1,409pp  · 205,237 words

Designing Great Data Products

by Jeremy Howard, Mike Loukides and Margit Zwemer  · 23 Mar 2012  · 23pp  · 5,264 words

Machine Learning Design Patterns: Solutions to Common Challenges in Data Preparation, Model Building, and MLOps

by Valliappa Lakshmanan, Sara Robinson and Michael Munn  · 31 Oct 2020

Character Limit: How Elon Musk Destroyed Twitter

by Kate Conger and Ryan Mac  · 17 Sep 2024

Hands-On Machine Learning With Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems

by Aurélien Géron  · 13 Mar 2017  · 1,331pp  · 163,200 words

Applied Text Analysis With Python: Enabling Language-Aware Data Products With Machine Learning

by Benjamin Bengfort, Rebecca Bilbro and Tony Ojeda  · 10 Jun 2018  · 125pp  · 27,675 words

Mining the Social Web: Finding Needles in the Social Haystack

by Matthew A. Russell  · 15 Jan 2011  · 541pp  · 109,698 words

Pandas for Everyone: Python Data Analysis

by Unknown