Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.

By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection.

See our privacy policy for more information on the use of your personal data.

for further information and to change your choices.

Skip to main content

Integrating logic-based machine learning and virtual screening to discover new drugs

Investigational Novel Drug Discovery by Example (INDDExâ„¢) is a technology developed to guide hit to lead discovery by learning rules from existing active compounds that link activity to chemical substructure. INDDEx is based on Inductive Logic Programming [1], which learns easily interpretable qualitative logic rules from active ligands that give an insight into chemistry, relate molecular substructure to activity, and can be used to guide the next steps of drug design chemistry. Support Vector Machines weight the rules to produce a quantitative model of structure-activity relationships. Whereas earlier testing [2, 3] was performed on single dataset examples, this talk presents the largest and fullest test of the method. The method was benchmarked on the Directory of Useful Decoys (DUD) datasets [4], using the same methodology described in the paper on the assessment of LASSO [5] and DOCK. For each of the DUD datasets, the known active ligands were mixed with all the decoy compounds in DUD, and the retrieval rates of INDDEx and DUD were measured when they were trained on 2, 4, and 8 of the known active ligands (Figure 2). Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping. This work was supported by a BBSRC case studentship with Equinox Pharma Ltd (http://www.equinoxpharma.com).

Figure 1
figure 1

Recovery of actives in each of the DUD datasets from all decoys in the DUD, averaged across all 40 datasets.

References

  1. Muggleton SH: Inductive logic programming. New Gen Comp. 1995, 13: 245-286. 10.1007/BF03037227.

    Article  Google Scholar 

  2. Amini A, et al: A Novel Logic-Based Approach for Quantitative Toxicology Prediction. J Chem Inf Model. 2007, 47: 998-1006. 10.1021/ci600223d.

    Article  CAS  Google Scholar 

  3. Cannon EO, et al: Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds. J Comput Aided Mol Des. 2007, 21: 269-280. 10.1007/s10822-007-9113-3.

    Article  CAS  Google Scholar 

  4. Huang N, et al: Benchmarking Sets for Molecular Docking. J Med Chem. 2006, 49: 6789-6801. 10.1021/jm0608356.

    Article  CAS  Google Scholar 

  5. Reid D, et al: LASSO-ligand activity by surface similarity order: a new tool for ligand based virtual screening. J Comput Aided Mol Des. 2008, 22: 479-487. 10.1007/s10822-007-9164-5.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christopher R Reynolds.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Reynolds, C.R., Sternberg, M.J. Integrating logic-based machine learning and virtual screening to discover new drugs. J Cheminform 4 (Suppl 1), O10 (2012). https://doi.org/10.1186/1758-2946-4-S1-O10

Download citation

  • Published:

  • DOI: https://doi.org/10.1186/1758-2946-4-S1-O10

Keywords