Full Paper
Visual Interpretation of Kernel-Based Prediction Models
Article first published online: 5 SEP 2011
DOI: 10.1002/minf.201100059
Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Issue

Molecular Informatics
Special Issue: Charting Chemical Space: „Challenges and Opportunities for Artificial Intelligence and Machine Learning”
Volume 30, Issue 9, pages 817–826, September 2011
Additional Information
How to Cite
Hansen, K., Baehrens, D., Schroeter, T., Rupp , M. and Müller , K.-R. (2011), Visual Interpretation of Kernel-Based Prediction Models. Molecular Informatics, 30: 817–826. doi: 10.1002/minf.201100059
Publication History
- Issue published online: 14 SEP 2011
- Article first published online: 5 SEP 2011
- Manuscript Accepted: 21 JUL 2011
- Manuscript Received: 13 APR 2011
Funded by
- FP7-ICT Programme
- European Community, under the PASCAL2 Network of Excellence
- ICT-216886
- DFG. Grant Number: MU 987/4-2
Keywords:
- Kernel-based learning;
- Confidence estimation;
- Domain of applicability;
- QSAR;
- QSPR
Abstract
Statistical models are frequently used to estimate molecular properties, e.g., to establish quantitative structure-activity and structure-property relationships. For such models, interpretability, knowledge of the domain of applicability, and an estimate of confidence in the predictions are essential. We develop and validate a method for the interpretation of kernel-based prediction models. As a consequence of interpretability, the method helps to assess the domain of applicability of a model, to judge the reliability of a prediction, and to determine relevant molecular features. Increased interpretability also facilitates the acceptance of such models. Our method is based on visualization: For each prediction, the most contributing training samples are computed and visualized. We quantitatively show the effectiveness of our approach by conducting a questionnaire study with 71 participants, resulting in significant improvements of the participants’ ability to distinguish between correct and incorrect predictions of a Gaussian process model for Ames mutagenicity.

1868-1751/asset/2022_left.gif?v=1&s=55861aec609bfeb0bc3c0534a51214d53d9fdc7d)
1868-1751/asset/olbannerright.gif?v=1&s=ecd199dedfd0b2cddec070f1d2c6b8962951f728)
