Get access

The Calculation of Molecular Structural Similarity: Principles and Practice

Authors

  • Peter Willett

    Corresponding author
    1. Information School, University of Sheffield, 211 Portobello Street, Sheffield S1 4DP, UK phone: +44-114-2222633
    • Information School, University of Sheffield, 211 Portobello Street, Sheffield S1 4DP, UK phone: +44-114-2222633===

    Search for more papers by this author

Abstract

Measures of structural similarity play an important role in chemoinformatics for applications such as similarity searching, database clustering and molecular diversity analysis. A similarity measure comprises three components: a structure representation; a weighting scheme; and a similarity coefficient. The paper introduces these components and describes methods for comparing different measures. The use of similarity measures in chemoinformatics research is illustrated by recent projects in the author’s laboratory on: the interactions between a weighting scheme and a similarity coefficient; the design of comparative studies of similarity measures; the use of 2D fingerprints for scaffold-hopping searches; and the registration of orphan drugs for rare diseases.

Get access to the full text of this article

Ancillary