June 1, 2016 (Vol. 36, No. 11)
Vicki Glaser Writer GEN
Computer Scientist Pedro Domingos Is Possessed Of an Ambition That Is Archimedean
Pedro Domingos, Ph.D., professor of computer science and engineering at the University of Washington, believes that cancer can be eliminated if we get serious about machine learning, an approach to artificial intelligence (AI) that gives computers the ability to “think” for themselves. Dr. Domingos argues that a thinking machine—one that learns from experience, not programming—will draw inferences, generate discoveries, and offer ever more accurate predictions about each cancer’s origins and vulnerabilities. The ultimate output, suggests Dr. Domingos, will consist of treatments and cures.
Dr. Domingos is the author of “The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World.” This book holds that with access to enough data, a Master Algorithm could derive all knowledge including scientific knowledge.
More and more scientific data is becoming available to “learners,” algorithms that can figure things out on their own and essentially program themselves. Much of this scientific data is relevant to bioinformatics.
When the field of bioinformatics was new, data was relatively scarce, and there was a clearly perceived disconnect between the black-and-white nature of computational analysis and the gray, fuzzy nature of biology. Now bioinformatics is maturing, and large amounts of data are being produced by high-throughput screening, DNA microarrays, and next-generation sequencing. And so, as Dr. Domingos explains in this interview, bioinformatics is gaining the heft needed to take on cancer.
GEN: How is machine learning different than what is commonly known as AI?
Dr. Domingos: Machine learning is a subfield of artificial intelligence. Different subfields of AI deal with different aspects of intelligence: reasoning, language, vision, problem-solving, etc. Learning is arguably the most important one. If a computer were as intelligent as a human but had no ability to learn, it would immediately fall behind and never recover. Machine learning is what’s driving the current wave of progress in AI.
GEN: What do you view as the biggest misconceptions about machine learning at present and the greatest flaws in the arguments of machine-learning skeptics?
GEN: In your book, you present a scenario in which a Master Algorithm could cure cancer. To realize this scenario, would all aspects of biology and pathology have to be translated into digital information? Is this an achievable goal now or in the near future?
GEN: You describe machine learning as “the scientific method on steroids,” being able to generate, test, discard, and refine hypotheses in silico. What would the Master Algorithm you propose, a single, universal learning algorithm—which would have access to all of the information in the biomedical literature and to patient records—look like and be capable of in terms of discovering a cure for cancer? How would that work?
Dr. Domingos: It would provide a detailed model of how cells work, both healthy and cancerous. We would then be able to instantiate that model to each particular patient and cancer, and probe it with different drugs until we were able to find one that works, or even design a new drug, if needed. All of this would be done at high speed, giving results the same day that the tumor is sequenced.
GEN: In fact, is it more appropriate to talk about a Master Algorithm discovering “cures” for cancer, since cancer is not one disease, has many different causes and presentations, and tumors can mutate as they grow and spread?
Dr. Domingos: Exactly. There is no single cure for cancer in the sense of a single drug that cures all cancers. The real cure is a machine learning system that inputs the cancer’s details as well as the patient’s genome, medical history, etc., and outputs the recommended treatment.
GEN: You write that machine learning alone will not cure cancer. Instead, you suggest that machine learning will do so in concert with cancer patients who will share their data for the benefit of future patients. Why is access to patient data and clinical outcomes so important? How does it contribute to “inverse deduction,” which you describe as the first step in curing cancer?
GEN: You call the complex, machine learning-based program that will one day be able to input a cancer’s genome and output a drug to kill the tumor CanceRx, and write that it is now possible to picture what that program will look like. Can you describe it? Can you tell us how far along it is in development?
Dr. Domingos: As I mentioned, CanceRx could be as simple as a system that directly predicts which drug to use from the patient’s data, or as complex as using a detailed model of how cells work to test candidate drugs in silico. Rapid progress is being made across the full spectrum, from assembling patient data and learning from it to modeling metabolic and gene regulatory networks, but there is still a long way to go.
GEN: Can some of the same concepts and uses of machine learning discussed above be applied to vaccine development, and is it being used for vaccine discovery? When an infectious agent such as Zika virus emerges and rapidly spreads, do you envision that a Master Algorithm could relatively quickly identify an effective vaccine?