October 1, 2009 (Vol. 29, No. 17)
Each Finding Adds to Knowledge Base and Helps Advance Field
Biomarkers are rapidly gaining importance in biological research and drug discovery and development. A biomarker, which is measurable by minimally invasive means, reveals important physiologic information about an organism. Most people associate biomarkers with serum protein diagnostics. Biomarkers can, however, also be genetic, epigenetic, small molecule, or even an imaging marker.
As often as they are sought in diagnostics, they are also of interest for prognostic prediction, therapeutic monitoring, surrogate endpoints in clinical studies, and as companion diagnostics in drug development. All major pharmaceutical companies now have biomarker programs that complement their drug discovery programs from the earliest stages.
Protein biomarkers in human blood are some of the most desired because of their obvious clinical utility. Estimates of the number of proteins in the human plasma proteome start at around 50,000 and trend upward, depending on how one counts isomers, post-translational modifications, and other variations.
The range of abundance of proteins in blood or plasma covers about 12 orders of magnitude. Albumin is by far the most abundant protein in blood, and must usually be removed before studying low-abundance proteins. Devising a platform for protein biomarker discovery in the blood is, therefore, challenging.
Mass spectrometry is one of the most powerful methods for analyzing protein biomarkers, and Monarch Life Sciences uses it for preclinical and clinical biomarker sample testing, and biomarker discovery and validation. An assay based on mass spectrometry can be much more sensitive and specific in distinguishing isoforms or protein modifications.
In a case study presented by Monarch scientists at CHI’s “Biomarker World Congress” earlier this year, procollagen-1 n-terminal peptide (P1NP), was detected in rat plasma by selected reaction monitoring, a technique that measures the abundance of a target molecule based on peptide fragments from a trypsin digest. P1NP is an indicator of bone growth or other bone-related changes and has recently been implicated in some other diseases and conditions.
A critical part of assay development is establishing a linear working range for protein concentration. For P1NP, the effective range was from about 2 nM to 200 nM, which covers the effective physiologic range of the protein. By changing individual amino acids in the peptide targeted by mass spec, the assay can follow a therapeutic program from preclinical to clinical stages, progressing from rat, through dog and monkey, to human subjects.
“The hardest thing is the dynamic range of the proteome,” says Mu Wang, Ph.D., vp of research. Efforts are ongoing at Monarch to expand the effective dynamic range of its assays. In spite of this drawback, the mass spectrometry based approach has some major advantages over immunoassay-based methods, not least of which is that there is no need for antibodies and the development time can be as short as a few weeks.
Where antibodies are available, however, an immunoassay is an effective way to monitor a biomarker without investment in a mass spectrometer. Caspase-cleaved cytokeratin 18 (ccCK18) is a marker for apoptosis that is useful for monitoring the effects of cancer therapy. An immunoassay for ccCK18 (M30-Apoptosense) has been developed by Peviva in collaboration with Stig Linder, Ph.D., at the Karolinska Institute. The company also has an assay for total epithelial cell death (M65 ELISA). The combined use of these assays facilitates the determination of cell-death mode (i.e., distinguishing apoptosis from necrosis).
If a therapeutic agent is effective, cancer cells will begin to undergo apoptosis and release ccCK18 into the blood where it can be detected within a couple of days of treatment. Such early detection of therapeutic response is highly desirable—patients may, otherwise, endure weeks or months of chemotherapy before tumor response can be measured.
Because CK18 is expressed in epithelial cells, and most tumors (e.g., breast, prostate, lung, colon, liver, ovarian) are derived from epithelial cells, the assay can be very specific. “Most cancer therapeutical agents,” says Dr. Linder, “are toxic to white blood cells” which, he notes, do not express cytokeratins. “If we see an increased signal in patient blood using the M30-Apoptosense ELISA, we know that it is derived from an apoptosis product formed in epithelially derived cells (and not from white cells).” Dr. Linder will talk about his work at the upcoming Select Biosciences “European Biomarker Summit” in Barcelona.
Mining for Multivariate Markers
One of the biggest changes in recent years is the migration to multivariate biomarkers. It is becoming clear that the serum proteome does not offer many clear, individual biomarkers of disease, and that further advances will require looking for panels or profiles that can track changes in a number of molecules simultaneously.
Darius Dziuda, Ph.D., a professor in the department of mathematical sciences at Central Connecticut State University, is working to educate scientists about using data-mining methods to identify multivariate biomarkers. Using a multivariate approach is critically important, according to Dr. Dziuda.
“Too many studies are still limited to the univariate approach, if some of them result in efficient classifiers, it’s ok. However, the univariate approach not only neglects correlations between genes, but also removes from considerations, genes that are not significant univariately, but are very important in combination with other genes.”
Using a multivariate approach means looking for a set of genes or variables that can differentiate between classes or disease states. The focus of Dr. Dziuda’s paper at the Barcelona meeting will be his methods for identification of stable multivariate biomarkers. “First, using heuristic multivariate methods, we identify the informative set of genes that includes all significant discriminatory information. There are typically a few hundred genes in such a set. Some of them are univariately significant, others could not be identified by univariate methods.
“Then, we build a large number of bootstrap-based classifiers, which are used to vote for variables and to identify the most important expression patterns. Finally, feature selection performed on these patterns leads to small multivariate biomarkers that are stable and biologically interpretable.”
The next step is validating the resulting multivariate biomarker using external data. Validation of biomarkers is a somewhat contentious subject. There is an argument to be made that a biomarker panel does not need to be validated or mechanistically characterized in order to be useful—that the pattern alone is sufficient for clinical or research purposes.
It is becoming more and more apparent, however, that in order to make the best use of a set of genes, their function and relationship should be discovered. (The function of the individual molecule does not necessarily translate to the biological interpretation of the set.) So, while multivariate biomarkers could be useful without a biological context, this will inevitably be a temporary situation.
One of the projects Dr. Dziuda has finished uses publicly available data from acute lymphoblastic leukemia. “After filtering noise, we had about 7,000 genes. The informative set of genes included about 200 genes. Using ensembles of classifiers we identified the most frequently used genes and the most important expression patterns. Then, heuristic feature selection identified a multivariate biomarker of five genes.
“This biomarker worked well on independent test data. This and other case studies indicate that this approach works very well and results in robust multivariate biomarkers.” The method is applicable, not just for early diagnosis of disease, but for prognosis, therapeutic response, and many other situations.
“Whenever you have a case that has a number of classes that are not that easy to differentiate, it is possible that there’s a multivariate gene- or protein-expression pattern that can be used for efficient classification.”
Epigenetics Ease Cell Identification
One limitation of genetic markers is that in many cases they can only indicate a predisposition for a disease, but do not offer a quantitative differential between a diagnosable disease state and nondisease state. Cancerous tumors are a notable exception, because genetic changes can be observed in the tumor cells. However, for many conditions, a different type of marker is needed.
Epiontis is developing epigenetic markers for cell identification. The company’s initial focus was quality control tests for cellular therapeutics, working in collaboration with companies such as Genzyme. Some of its first assays identified chondrocytes in cartilage transplants for knee injuries. The marker they used was DNA methylation of specific gene regions, and the test was eventually licensed to Genzyme.
“Epiontis is now pursuing the same technology for immune monitoring,” says CBO Ulrich Hoffmueller, who will speak at the Barcelona conference. “We had several candidates and identified that demethylation of the FOXP3 gene was the best marker that could be found for regulatory T cells.”
FOXP3 has two benefits. First, it has much higher specificity than the protein that was used before. Second, the qPCR assay is technically much simpler than the fluorescence-activated cell sorting (FACS) analysis that would be required to identify the T cells otherwise. This is helpful on two levels. It is simpler and quicker to do in the laboratory, and it is also possible to compare results from different laboratories.
For large, multicenter clinical trials, it is often necessary to collect and analyze samples at separate laboratories. Using FACS, it is necessary to analyze all samples at the same time on one machine, in order to get the best results. Epiontis currently uses its regulatory T-cell qPCR assay for clinical trial immune monitoring services for clients.
Biomarkers encompass many seemingly unrelated branches of science and many disparate applications. However, each time a new biomarker, or set of biomarkers, is discovered it brings into focus a portion of the biological system from which it came. Biomarkers contribute to the understanding of biological systems, which contributes to the understanding of disease mechanism, which contributes to the discovery of new and better biomarkers.
Significant hurdles remain, especially when it comes to global biomarker discovery. It may be quite a long time before scientists can sift through the entire proteome, genome, or metabolome for the perfect biomarker. Improved technology platforms boost discovery and development efforts for biomarkers. Bioinformatics and systems biology help researchers study molecular networks, instead of single molecules, increasing the odds of developing successful biomarker assays.