Leading the Way in Life Science Technologies

GEN Exclusives

More »


More »
February 15, 2012 (Vol. 32, No. 4)

Finding New, High-Quality Compounds

Combining Chemists’ Expertise and Software Capabilities to Explore Different Paths with StarDrop

  • What Makes a Good Idea?

    Click Image To Enlarge +
    Figure 2. The goal of Pareto optimization is a compound with both high solubility and potency (represented by the yellow star). While this ideal is not achievable, the red points represent Pareto optimal points, i.e., for each of these compounds there are no compounds with both better potency and solubility (for example the point labeled A). Conversely, the point B is not Pareto optimal because there is a point that has both higher potency and solubility.

    A successful drug must possess a delicate balance of many properties in order to be efficacious and safe, including potency against its therapeutic target, appropriate physicochemical and absorption, distribution, metabolism, and elimination (ADME) properties, and a lack of off-target effects and nonspecific toxicity. The simultaneous optimization of many of these properties is commonly described as multiparameter optimization (MPO), and many approaches have been developed to facilitate this in drug discovery.

    A predictive model of a single property can be used to prioritize compound ideas for one objective, while MPO can be used to simultaneously consider many criteria that a drug discovery project must optimize. However, the greatest benefit can be gained by coupling algorithms for idea generation with MPO, to generate new ideas that have the best balance of properties for a drug discovery project’s therapeutic objectives. This ensures that efforts are focused on those chemistries with the highest chance of downstream success.

    Pareto Ligand Designer takes the approach of combining a transformation-based approach for idea generation with a Pareto optimization algorithm. Pareto optimization does not select compounds based on a single ideal profile, but explores a range of solutions each with a different, optimal balance of properties (Figure 2).

    This approach is most useful when the best combination of properties is not known a priori and therefore it is advantageous to explore a range of different property profiles.

  • Click Image To Enlarge +
    Figure 3. An example of a scoring profile that defines the property criteria for an ideal compound and the importance of each individual criterion to the overall objective of a project. Because it is often impossible to find an ideal compound that meets all of the criteria, the importance values define the acceptable compromises. Underlying each of the criteria is a desirability function that can be used to make more subtle distinctions between property values. An example is shown for blood brain barrier penetration (the log of the ratio between concentrations in brain and blood); the ideal range is between -0.2 and 1 and a compound with a value below this range is worse than a compound with too high a value.

    An alternative approach, employed by StarDrop, is to prioritize compounds with the best balance of properties relative to an ideal profile, as defined by a drug discovery project team (an example is shown in Figure 3). Examples of such methods include the calculation of a score for each compound considering the desirability of its property values or a probabilistic scoring approach that prioritizes compounds with the highest chance of success against the required profile.

    This conjures a utopian vision (at least for some!) of a computer that automatically generates and explores new compound ideas before selecting a small number for synthesis and testing, confident in the knowledge that these will yield a high-quality drug. This would dramatically reduce the time and effort required for drug discovery, and increase the chance of finding a high-quality candidate drug by exploring a very large diversity of possibilities.

    Alas, the realization of such a vision remains remote in practice. This is because the accuracy of predictive methods is insufficient to identify a compound and say with confidence that it will achieve the requirements for a high-quality outcome. The accuracies of predictive models range from a factor of two to ten in prediction of binding affinities or other biological properties. Therefore, while models can be very useful in discarding poor compounds, they can only identify those compounds most likely to meet the ideal criteria. A computer cannot yet design a perfect compound.

    Furthermore, other requirements of a good compound, such as synthetic tractability and novelty from an IP perspective, remain subjective factors that rely on the expertise and creativity of a medicinal chemist. Therefore, the ideal approach combines a chemist’s expertise with the capabilities of a computer. The computer can help to explore a wide range of possibilities and prioritize those most likely to be of interest for detailed consideration, while the expert will make the final decision on the strategy to be adopted.

Related content