The full text of this article hosted at iucr.org is unavailable due to technical difficulties.

Article
Free Access

Six mathematical gems from the history of distance geometry

Leo Liberti

E-mail address:liberti@lix.polytechnique.fr

CNRS LIX, École Polytechnique, Palaiseau, France

Search for more papers by this author
Carlile Lavor

E-mail address:clavor@ime.unicamp.br

IMECC, University of Campinas, Campinas, SP, Brazil

Search for more papers by this author
First published: 30 April 2015
Cited by: 4

Abstract

This is a partial account of the fascinating history of distance geometry. We make no claim to completeness, but we do promise a dazzling display of beautiful, elementary mathematics. We prove Heron's formula, Cauchy's theorem on the rigidity of polyhedra, Cayley's generalization of Heron's formula to higher dimensions, Menger's characterization of abstract semimetric spaces, a result of Gödel on metric spaces on the sphere, and Schoenberg's equivalence of distance and positive semidefinite matrices, which is at the basis of multidimensional scaling.

1. Introduction

Distance geometry (DG) is the study of geometry with the basic entity being distance (instead of lines, planes, circles, polyhedra, conics, surfaces, and varieties). As did much of mathematics, it all began with the Greeks: specifically Heron, or Hero, of Alexandria, sometime between 150 BC and 250 AD, showed how to compute the area of a triangle given its side lengths.

After a hiatus of almost two thousand years, we reach Arthur Cayley: the first paper of volume I of his Collected Papers, dated 1841, which is about the relationships between the distances of five points in space (Cayley, 1841). The gist of what he showed is that a tetrahedron can only exist in a plane if it is flat (in fact, he discussed the situation in one more dimension). This yields algebraic relations on the side lengths of the tetrahedron.

Hilbert's influence on foundations and axiomatization was very strong in the 1930s Mitteleuropa (Hilbert, 1903). This pushed many people toward axiomatizing existing mathematical theories (Henkin et al., 1959). Karl Menger, a young professor of geometry at the University of Vienna and an attendee of the Vienna Circle, proposed in 1928 a new axiomatization of metric spaces using the concept of distance and the relation of congruence, and, using an extension of Cayley's algebraic machinery (which is now known as Cayley–Menger determinant), generalized Heron's theorem to compute the volume of arbitrary K‐dimensional simplices using their side lengths (Menger, 1928).

The Vienna Circle was a group of philosophers and mathematicians who convened in Vienna's Reichsrat café around the 1930s to discuss philosophy, mathematics, and, presumably, drink coffee. When the meetings became excessively politicized, Menger distanced himself from the Circle, and organized instead a seminar series, which ran from 1929 to 1937 (Menger, 1998). A notable name crops up in the intersection of Menger's geometry students, the Vienna Circle participants, and the speakers at Menger's Kolloquium: Kurt Gödel. Most of the papers Gödel published in the Kolloquium proceedings are about logic and foundations, but two, dated 1933, are about the geometry of distances on spheres and surfaces. The first (Menger, 1998, 18 Feb. 1932, p. 198) answers a question posed at a previous seminar by Laura Klanfer, and shows that a set X of four points in any metric space, congruent to four noncoplanar points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0001, can be realized on the surface of a three‐dimensional sphere using geodesic distances. The second (Menger, 1998, 17 May 1933, p. 252) shows that Cayley's relationship hold locally on certain surfaces that behave locally like Euclidean spaces (Fig. 1).

The pace quickens: in 1935, Isaac Schoenberg published some remarks on a paper (Schoenberg, 1935) by Fréchet on the Annals of Mathematics, and gave, among other things, an algebraic proof of equivalence between Euclidean distance matrices (EDM) and Gram matrices. This is the same result that is nowadays given to show the validity of the classical multidimensional scaling (MDS) technique (Borg and Groenen, 2010, § 12.1).

This brings us to the computer era, where the historical account ends and the contemporary treatment begins. Computers allow the efficient treatment of masses of data, some of which are incomplete and noisy. Many of these data concern, or can be reduced to, distances, and DG techniques are the subject of an application‐oriented renaissance (Mucherino et al., 2013; Liberti et al., 2014). Motivated by the global positioning system, for example, the old geographical concept of trilateration (a system for computing the position of a point given its distances from three known points) makes its way into DG in wireless sensor networks (Eren, 2004). Wüthrich's Nobel Prize for using nuclear magnetic resonance techniques in the study of proteins brings DG to the forefront of structural bioinformatics research (Havel and Wüthrich, 1985). The massive use of robotics in mechanical production lines requires mathematical methods based on DG (Rojas and Thomas, 2013).

DG is also tightly connected with graph rigidity (Graver et al., 1993). This is an abstract mathematical formulation of statics, the study of structures under the action of balanced forces (Maxwell, 1864), which is on the basis of architecture (Varignon, 1725). Rigidity of polyhedra gave rise to a conjecture of Euler's (1736) about closed polyhedral surfaces, which was proved correct only for some polyhedra: strictly convex (Cauchy, 1813), convex and higher dimensional (Alexandrov, 2005), and generic (a polyhedron is generic if no algebraic relations on urn:x-wiley:09696016:media:itor12170:itor12170-math-0002 hold on the components of the vectors that represent its vertices) (Gluck , 1975). It was however disproved in general by means of a very special, nongeneric nonconvex polyhedron (Connelly, 1879).

The rest of this paper will focus on the following results, listed here in a chronological order: Heron's theorem (Section 2.), Euler's conjecture and Cauchy's proof for strictly convex polyhedra (Section 3.), Cayley–Menger determinants (Section 4.), Menger's axiomatization of geometry by means of distances (Section 5.), a result by Gödel's concerning DG on the sphere (Section 6.), and Schoenberg's equivalence (Section 7.) between EDM and positive semidefinite matrices (PSD). There are many more results in DG: this is simply a choice of our favorite results among those we know best.

2. Heron's formula

Heron's formula, which is usually taught at school, relates the area urn:x-wiley:09696016:media:itor12170:itor12170-math-0003 of a triangle to the length of its sides urn:x-wiley:09696016:media:itor12170:itor12170-math-0004 and its semiperimeter urn:x-wiley:09696016:media:itor12170:itor12170-math-0005 as follows:

(1)

There are many ways to prove its validity. Shannon Umberger, a student of the “Foundations of Geometry I” course given at the University of Georgia in the fall of 2000, proposes, as part of his final project (http://jwilson.coe.uga.edu/emt668/emat6680.2000/umberger/MATH7200/HeronFormulaProject/finalproject.html), three detailed proofs: an algebraic one, a geometric one, and a trigonometric one. John Conway and Peter Doyle discuss Heron's formula proofs in a publicly available e‐mail exchange (https://math.dartmouth.edu/∼doyle/docs/heron/heron.txt) from 1997 to 2001.

Our favorite proof is based on complex numbers, and was submitted (http://www.artofproblemsolving.com/Resources/Papers/Heron.pdf) to the “Art of Problem Solving” online school for gifted mathematics students by Miles Edwards (also see http://newsinfo.iu.edu/news/page/normal/13885.html and http://www.jstor.org/stable/10.4169/amer.math.monthly.121.02.149 for more recent career achievements of this gifted student) when he was studying at Lassiter High School in Marietta, Georgia.

Theorem 2.1. ([Heron's formula Heron of Alexandria])Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0007 be the area of a triangle with side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0008 and semiperimeter length urn:x-wiley:09696016:media:itor12170:itor12170-math-0009. Then urn:x-wiley:09696016:media:itor12170:itor12170-math-0010.

Proof.(Edwards, 2011) Consider a triangle with sides urn:x-wiley:09696016:media:itor12170:itor12170-math-0011 (opposite to the vertices urn:x-wiley:09696016:media:itor12170:itor12170-math-0012, respectively) and its inscribed circle centered at O with radius r. The perpendiculars from O to the triangle sides split a into urn:x-wiley:09696016:media:itor12170:itor12170-math-0013, b into urn:x-wiley:09696016:media:itor12170:itor12170-math-0014, and c into urn:x-wiley:09696016:media:itor12170:itor12170-math-0015 as shown in Fig. 2. Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0016 be the segments joining O with urn:x-wiley:09696016:media:itor12170:itor12170-math-0017, respectively.

First, we note that urn:x-wiley:09696016:media:itor12170:itor12170-math-0018, which implies urn:x-wiley:09696016:media:itor12170:itor12170-math-0019. Next, the following complex identities are easy to verify geometrically in Fig. 2:

These imply:
where the last step uses Euler's identity urn:x-wiley:09696016:media:itor12170:itor12170-math-0022 (Euler, 1922, I–VIII, § 138–140, p. 148). Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0023 is real, the imaginary part of urn:x-wiley:09696016:media:itor12170:itor12170-math-0024 must be zero. Expanding the product and rearranging terms, we get urn:x-wiley:09696016:media:itor12170:itor12170-math-0025. Solving for r, we have the nonnegative root
(2)
We can write the semiperimeter of the triangle urn:x-wiley:09696016:media:itor12170:itor12170-math-0027 as urn:x-wiley:09696016:media:itor12170:itor12170-math-0028. Moreover,
so urn:x-wiley:09696016:media:itor12170:itor12170-math-0030, which implies that Equation 2 becomes
We now write the area urn:x-wiley:09696016:media:itor12170:itor12170-math-0032 of the triangle urn:x-wiley:09696016:media:itor12170:itor12170-math-0033 by summing it over the areas of the three triangles urn:x-wiley:09696016:media:itor12170:itor12170-math-0034, urn:x-wiley:09696016:media:itor12170:itor12170-math-0035, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0036, which yields
as claimed. urn:x-wiley:09696016:media:itor12170:itor12170-math-0038

We cannot refrain from mentioning Gödel's incompleteness theorem.
Heron's formula: a proof using complex numbers.

3. Euler's conjecture and the rigidity of polyhedra

Consider a square with unit sides, in the plane. One can shrink two opposite angles and correspondingly widen the other two to obtain a rhombus (see Fig. 3), which has the same side lengths but a different shape: no sequence of rotations, translations, or reflections can turn one into the other. In other words, a square is flexible. By contrast, a triangle is not flexible, i.e., it is rigid.

A square is flexed into a rhombus. The set of faces (the edges) are the same, and each maintains pairwise distances through the flexing, i.e., two points on the same edge have the same distance on the left as on the right figure.

Euler conjectured in 1766 (Euler, 1862) that all three‐dimensional polyhedra are rigid. The conjecture appears at the end of the discussion about the problem Invenire duas superficies, quarum alteram in alteram transformare liceat, ita ut in utraque singula puncta homologa easdem inter se teneat distantias, i.e.:

To find two surfaces for which it is possible to transform one into the other, in such a way that corresponding points on either keep the same pairwise distance. (†)

Toward the end of the paper, Euler writes Statim enim atque figura undique est clausa, nullam amplius mutationem patitur, which means “As soon as the shape is everywhere closed, it can no longer be transformed.” Although the wording appears ambiguous by today's standards, scholars of Euler and rigidity agree: what Euler really meant is that 3D polyhedra are rigid (Gluck, 1975).

To better understand this statement, we borrow from Alexandrov (1956) the precise definition of a polyhedron: a family urn:x-wiley:09696016:media:itor12170:itor12170-math-0039 of points, open segments and open triangles is a triangulation if (a) no two elements of urn:x-wiley:09696016:media:itor12170:itor12170-math-0040 have common points, and (b) all sides and vertices of the closure of any triangle of urn:x-wiley:09696016:media:itor12170:itor12170-math-0041, and both extreme points of the closure of any segment of urn:x-wiley:09696016:media:itor12170:itor12170-math-0042 are all in urn:x-wiley:09696016:media:itor12170:itor12170-math-0043 themselves. Note that the definition we are about to give is different from the usual definition employed in convex analysis, i.e.,  a polyhedron is an intersection of half‐spaces; however, a convex polyhedron in the sense given below is the same as a polytope in the sense of convex analysis. Here is the definition: given a triangulation urn:x-wiley:09696016:media:itor12170:itor12170-math-0044 in urn:x-wiley:09696016:media:itor12170:itor12170-math-0045 (where urn:x-wiley:09696016:media:itor12170:itor12170-math-0046), the union of all points of urn:x-wiley:09696016:media:itor12170:itor12170-math-0047 and all the points belonging to the segments and triangles of urn:x-wiley:09696016:media:itor12170:itor12170-math-0048 is called a polyhedron. Note that several triangular faces can belong to the same affine space, thereby forming polygonal faces.

Each polyhedron has an incidence structure of points on segments and segments on polygonal (not necessarily triangular) faces, which induces a partial order (p.o.) based on set inclusion. For example, the closure of the square urn:x-wiley:09696016:media:itor12170:itor12170-math-0049 contains the closures of the segments urn:x-wiley:09696016:media:itor12170:itor12170-math-0050, urn:x-wiley:09696016:media:itor12170:itor12170-math-0051, urn:x-wiley:09696016:media:itor12170:itor12170-math-0052, urn:x-wiley:09696016:media:itor12170:itor12170-math-0053, each of which contains the corresponding adjacent points urn:x-wiley:09696016:media:itor12170:itor12170-math-0054, urn:x-wiley:09696016:media:itor12170:itor12170-math-0055, urn:x-wiley:09696016:media:itor12170:itor12170-math-0056, urn:x-wiley:09696016:media:itor12170:itor12170-math-0057. Accordingly, the p.o. is urn:x-wiley:09696016:media:itor12170:itor12170-math-0058; urn:x-wiley:09696016:media:itor12170:itor12170-math-0059; urn:x-wiley:09696016:media:itor12170:itor12170-math-0060; urn:x-wiley:09696016:media:itor12170:itor12170-math-0061; urn:x-wiley:09696016:media:itor12170:itor12170-math-0062. Since this p.o. has a bottom element (the empty set) and a top element (the whole polyhedron), it is a lattice. A lattice isomorphism is a bijective mapping between two lattices, which preserves the p.o. Two polyhedra urn:x-wiley:09696016:media:itor12170:itor12170-math-0063 are combinatorially equivalent if their triangulations are lattice isomorphic. If, moreover, all the lattice isomorphic polygonal faces of urn:x-wiley:09696016:media:itor12170:itor12170-math-0064 are exactly equal, the polyhedra are said to be facewise equal.

Under the above definition, nothing prevents a polyhedron from being nonconvex (see Fig. 4). It is known that every closed surface, independently of the convexity of its interior, is homeomorphic (intuitively: smoothly deformable in) to some polyhedron (again Alexandrov, 1956, § 2.2). This is why we can replace “closed shape” with “polyhedron” in Euler's conjecture.

A nonconvex polyhedron.

The “rigidity” implicit in Euler's conjecture should be taken to mean that no point of the polyhedron can undergo a continuous motion under the constraint that the shape be the same at each point of the motion. As for the concept of “shape,” it is linked to that of distance, as appears clear from (†). The following is therefore a formal restatement of Euler's conjecture: two combinatorially equivalent facewise equal polyhedra must be isometric under the Euclidean distance, i.e., each pair of points in one polyhedron is equidistant with the corresponding pair in the other.

A natural question about the Euler conjecture stems from generalizing the example in Fig. 3 to 3D (see Fig. 5). Does this not disprove the conjecture?

A cube can be transformed into a rhomboid, but the set of faces is not the same anymore (accordingly, corresponding point pairs may not preserve their distance, as shown).

The answer is no: all the polygonal faces in the cube are squares, but this does not hold in the rhomboid. The question is more complicated than it looks at first sight, which is why it took 211 years to disprove it.

3.1. Strictly convex polyhedra: Cauchy's proof

Although Euler's conjecture is false in general, it is true for many important subclasses of polyhedra. Cauchy proved it true for strictly convex polyhedra (in fact Cauchy's proof contained two mistakes, corrected by Steinitz (Lyusternik, 1966, p. 67) and Lebesgue). There are many accounts of Cauchy's proof: Cauchy's original text, still readable today (Cauchy, 1813); Alexandrov's book (Alexandrov, 2005), Lyusternik's book (Lyusternik, 1966, § 20), Stoker's paper (Stoker, 1968), Connelly's chapter (Connelly, 1993) just to name a few. Here we follow the treatment given by Pak (2010).

We consider two combinatorially equivalent, facewise equal strictly convex polyhedra urn:x-wiley:09696016:media:itor12170:itor12170-math-0065, and aim to show that P and Q are isometric.

For a polyhedron P we consider its associated graph urn:x-wiley:09696016:media:itor12170:itor12170-math-0066, where V are the points of P and E its segments. Note that urn:x-wiley:09696016:media:itor12170:itor12170-math-0067 only depends on the incidence structure of the polygonal faces, segments, and points of P. Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0068 are combinatorially equivalent, urn:x-wiley:09696016:media:itor12170:itor12170-math-0069. Consider the dihedral angles (i.e., the angle smaller than π formed by two half‐planes in urn:x-wiley:09696016:media:itor12170:itor12170-math-0070 intersecting on a line L) urn:x-wiley:09696016:media:itor12170:itor12170-math-0071 (resp., urn:x-wiley:09696016:media:itor12170:itor12170-math-0072) in P (resp., Q) induced by the line containing the segment corresponding to the edge urn:x-wiley:09696016:media:itor12170:itor12170-math-0073. We assign to each edge urn:x-wiley:09696016:media:itor12170:itor12170-math-0074 a label urn:x-wiley:09696016:media:itor12170:itor12170-math-0075 (so urn:x-wiley:09696016:media:itor12170:itor12170-math-0076), and consider, for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0077, the edge sequence urn:x-wiley:09696016:media:itor12170:itor12170-math-0078, where urn:x-wiley:09696016:media:itor12170:itor12170-math-0079 is the set of nodes adjacent to v. The order of the edges in urn:x-wiley:09696016:media:itor12170:itor12170-math-0080 is given by any circuit around the polygon urn:x-wiley:09696016:media:itor12170:itor12170-math-0081 obtained by intersecting P with a plane γ that separates v from the other vertices in V (this is possible by strict convexity, see Fig. 6).

The plane γ separating v from the other vertices in the strictly convex polyhedron P, and the intersection polygon urn:x-wiley:09696016:media:itor12170:itor12170-math-0082 defined by urn:x-wiley:09696016:media:itor12170:itor12170-math-0083. The line L (lying in urn:x-wiley:09696016:media:itor12170:itor12170-math-0084) separates the +1 and −1 labels applied to the points urn:x-wiley:09696016:media:itor12170:itor12170-math-0085 of intersections between the edges of P and urn:x-wiley:09696016:media:itor12170:itor12170-math-0086.

It is easy to see in Fig. 6 that every edge urn:x-wiley:09696016:media:itor12170:itor12170-math-0087 corresponds to a vertex of urn:x-wiley:09696016:media:itor12170:itor12170-math-0088. Therefore, a circuit over urn:x-wiley:09696016:media:itor12170:itor12170-math-0089 defines an order over urn:x-wiley:09696016:media:itor12170:itor12170-math-0090. We also assume that this order is periodic, i.e., its last element precedes the first one. Any such sequence urn:x-wiley:09696016:media:itor12170:itor12170-math-0091 naturally induces a sign sequence urn:x-wiley:09696016:media:itor12170:itor12170-math-0092; we let urn:x-wiley:09696016:media:itor12170:itor12170-math-0093 be the sequence urn:x-wiley:09696016:media:itor12170:itor12170-math-0094 without the zeros, and we count the number urn:x-wiley:09696016:media:itor12170:itor12170-math-0095 of sign changes in urn:x-wiley:09696016:media:itor12170:itor12170-math-0096, including the sign change occurring between the last and first elements.

Lemma 3.1.For all urn:x-wiley:09696016:media:itor12170:itor12170-math-0097, urn:x-wiley:09696016:media:itor12170:itor12170-math-0098 is even.

Proof.Suppose urn:x-wiley:09696016:media:itor12170:itor12170-math-0099 is odd, and proceed by induction on urn:x-wiley:09696016:media:itor12170:itor12170-math-0100: if urn:x-wiley:09696016:media:itor12170:itor12170-math-0101, then there is only one sign change. So, the first edge urn:x-wiley:09696016:media:itor12170:itor12170-math-0102 in urn:x-wiley:09696016:media:itor12170:itor12170-math-0103 to be labeled with urn:x-wiley:09696016:media:itor12170:itor12170-math-0104 has the property that, going around the periodic sequence with only one sign change, urn:x-wiley:09696016:media:itor12170:itor12170-math-0105 is also labeled with urn:x-wiley:09696016:media:itor12170:itor12170-math-0106, which yields urn:x-wiley:09696016:media:itor12170:itor12170-math-0107, a contradiction. A trivial induction step yields the same contradiction for all odd urn:x-wiley:09696016:media:itor12170:itor12170-math-0108. urn:x-wiley:09696016:media:itor12170:itor12170-math-0109

We now state a fundamental technical lemma, and provide what is essentially Cauchy's proof, rephrased as in Lemma 2 in § 20 of Lyusternik (1966).

Lemma 3.2.If P is strictly convex, then for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0110 we have either urn:x-wiley:09696016:media:itor12170:itor12170-math-0111 or urn:x-wiley:09696016:media:itor12170:itor12170-math-0112.

Proof.By Lemma 3.1, for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0113 we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0114, so we aim to show that urn:x-wiley:09696016:media:itor12170:itor12170-math-0115. Suppose, to get a contradiction, that urn:x-wiley:09696016:media:itor12170:itor12170-math-0116, and consider the polygon urn:x-wiley:09696016:media:itor12170:itor12170-math-0117 as in Fig. 6. By the correspondence between edges in urn:x-wiley:09696016:media:itor12170:itor12170-math-0118 and vertices of urn:x-wiley:09696016:media:itor12170:itor12170-math-0119, the labels urn:x-wiley:09696016:media:itor12170:itor12170-math-0120 are vertex labels in urn:x-wiley:09696016:media:itor12170:itor12170-math-0121. Since there are only two sign changes, the sequence of vertex labels can be partitioned in two contiguous sets of +1 and −1 (possibly interspersed by zeros). By convexity, there exists a line L separating the +1 and the −1 vertices (see Fig. 6). Since all of the angles marked +1 strictly increase, the segment urn:x-wiley:09696016:media:itor12170:itor12170-math-0122 also strictly increases (this statement was also proved in Cauchy's paper (Cauchy, 1813), but this proof contained a serious flaw, later corrected by Steinitz); but, at the same time, all of the angles marked −1 strictly decrease, so the segment urn:x-wiley:09696016:media:itor12170:itor12170-math-0123 also strictly decreases, which means that the same segment urn:x-wiley:09696016:media:itor12170:itor12170-math-0124 both strictly increases and decreases, which is a contradiction (see Fig. 7). urn:x-wiley:09696016:media:itor12170:itor12170-math-0125

Visual representation of the contradiction in the proof of Lemma 7. The angles at all vertices labeled +1 increase their magnitude, and those at −1 decrease: it follows that urn:x-wiley:09696016:media:itor12170:itor12170-math-0126 both increases and decreases its length, a contradiction.

Theorem 3.3. ([Cauchy's Theorem Cauchy, 1813])If two closed convex polyhedra urn:x-wiley:09696016:media:itor12170:itor12170-math-0127 are combinatorially equivalent and facewise equal, they are isometric.

We only present the proof of the base case where

(3)
and urn:x-wiley:09696016:media:itor12170:itor12170-math-0129 is a connected graph, and refer the reader to p. 251 of Pak (2010) for the other cases (which are mostly variations of the ideas given in the proof below).

Proof.If urn:x-wiley:09696016:media:itor12170:itor12170-math-0130 for all urn:x-wiley:09696016:media:itor12170:itor12170-math-0131, it means that all of the dihedral angles in P are equals to those of Q, which implies isometry. So we assume the alternative w.r.t. Equation 3 above: urn:x-wiley:09696016:media:itor12170:itor12170-math-0132, and aim for a contradiction. Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0133: by Lemma 3.2 and because urn:x-wiley:09696016:media:itor12170:itor12170-math-0134 for each v, we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0135, a lower bound for M. We now construct a contradicting upper bound for M. For every urn:x-wiley:09696016:media:itor12170:itor12170-math-0136, we let urn:x-wiley:09696016:media:itor12170:itor12170-math-0137 be the number of polygonal faces of P with h sides (or edges). The total number of polygonal faces in P (or Q) is urn:x-wiley:09696016:media:itor12170:itor12170-math-0138, and the total number of edges is therefore urn:x-wiley:09696016:media:itor12170:itor12170-math-0139 (we divide by 2 since each edge is counted twice in the sum—one per adjacent face—given that urn:x-wiley:09696016:media:itor12170:itor12170-math-0140 are closed). A simple term by term comparison of urn:x-wiley:09696016:media:itor12170:itor12170-math-0141 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0142 yields urn:x-wiley:09696016:media:itor12170:itor12170-math-0143. Since each polygonal face f of P is itself closed, the number urn:x-wiley:09696016:media:itor12170:itor12170-math-0144 of sign changes of the quantities urn:x-wiley:09696016:media:itor12170:itor12170-math-0145 over all edges urn:x-wiley:09696016:media:itor12170:itor12170-math-0146 adjacent to the face f is even, by the same argument given in Lemma 3.1. It follows that if the number h of edges adjacent to the face f is even, then urn:x-wiley:09696016:media:itor12170:itor12170-math-0147, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0148 if h is odd. This allows us to compute an upper bound on M:

The middle step follows by simply increasing each coefficient. The last step is based on Euler's characteristic (Euler, 1843): urn:x-wiley:09696016:media:itor12170:itor12170-math-0150. Hence we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0151, which is a contradiction.

3.2. Euler was wrong: Connelly's counterexample

Proofs behind counterexamples can rarely be termed “beautiful” since they usually lack generality (as they are applied to one particular example). Counterexamples can nonetheless be dazzling by themselves. Connelly's counterexample (Connelly, 1879) to the Euler's conjecture consists in a very special nongeneric nonconvex polyhedron that flexes, while keeping combinatorial equivalence and facewise equality with all polyhedra in the flex. Some years later, Klaus Steffen produced a much simpler polyhedron with the same properties (see http://demonstrations.wolfram.com/SteffensFlexiblePolyhedron/). It is this polyhedron we exhibit in Fig. 8.

Steffen's polyhedron: the flex (these two images were obtained as snapshot from the Mathematica (2014) demonstration cited on page 7). There is a rotation, in the direction showed by the arrows, around the edge, which is emphasized on the right picture. The short upper right edge only appears shorter on the right because of perspective.

4. Cayley–Menger determinants and the simplex volume

The foundation of modern DG, as investigated by Menger (1931) and Blumenthal (1953), rests on the fact that:

the four‐dimensional volume of a four‐dimensional simplex embedded in three‐dimensional space is zero, (*)

which we could also informally state as “flat simplices have zero volume.” This is related to DG because the volume of a simplex can be expressed in terms of the lengths of the simplex sides, which yields a polynomial in the length of the simplex side lengths that can be equated to zero. If these lengths are expressed in function of the vertex positions as urn:x-wiley:09696016:media:itor12170:itor12170-math-0152, this yields a polynomial equation in the positions urn:x-wiley:09696016:media:itor12170:itor12170-math-0153 of the simplex vertices in terms of its side lengths. Thus, if we know the positions of urn:x-wiley:09696016:media:itor12170:itor12170-math-0154, we can compute the unknown position of x5 or prove that no such position exists, through a process called trilateration (Lavor et al., 2012).

The proof of (*) was published by Arthur Cayley in 1841, during his undergraduate studies. It is based on the following well‐known lemma about determinants (stated without proof in Cayley's paper).

Lemma 4.1.If urn:x-wiley:09696016:media:itor12170:itor12170-math-0155 are square matrices having the same size, urn:x-wiley:09696016:media:itor12170:itor12170-math-0156.

Theorem 4.2. ([Cayley, 1841])Given five points urn:x-wiley:09696016:media:itor12170:itor12170-math-0157 all belonging to an affine 3D subspace of urn:x-wiley:09696016:media:itor12170:itor12170-math-0158, let urn:x-wiley:09696016:media:itor12170:itor12170-math-0159 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0160. Then

(4)

We note that Cayley's theorem is expressed for urn:x-wiley:09696016:media:itor12170:itor12170-math-0162 points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0163, but it also holds for any urn:x-wiley:09696016:media:itor12170:itor12170-math-0164 points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0165 (Blumenthal, 1953). Cayley explicitly remarks that it holds for the cases urn:x-wiley:09696016:media:itor12170:itor12170-math-0166 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0167 (see VIII, § 5 of Sommerville (1958) for the proof of general n). The determinant on the right‐hand side of Equation 4 is called Cayley–Menger determinant, denoted by Δ. We remark that in the proof below urn:x-wiley:09696016:media:itor12170:itor12170-math-0168 is the kth component of urn:x-wiley:09696016:media:itor12170:itor12170-math-0169, for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0170.

Proof.We follow Cayley's treatment. He pulls the following two matrices

out of a magic hat. He performs the product urn:x-wiley:09696016:media:itor12170:itor12170-math-0172, rearranging and collecting terms, and obtains a 6 × 6 matrix where the last row and column are (1, 1, 1, 1, 1, 0), and the urn:x-wiley:09696016:media:itor12170:itor12170-math-0173‐th component is urn:x-wiley:09696016:media:itor12170:itor12170-math-0174 for every urn:x-wiley:09696016:media:itor12170:itor12170-math-0175. To see this, it suffices to carry out the computations using Mathematica (2014); by way of an example, the first diagonal component of urn:x-wiley:09696016:media:itor12170:itor12170-math-0176 is urn:x-wiley:09696016:media:itor12170:itor12170-math-0177, and the component on the first row, second column of urn:x-wiley:09696016:media:itor12170:itor12170-math-0178 is urn:x-wiley:09696016:media:itor12170:itor12170-math-0179. In other words, urn:x-wiley:09696016:media:itor12170:itor12170-math-0180 is the Cayley–Menger determinant in Equation 4. On the other hand, if we set urn:x-wiley:09696016:media:itor12170:itor12170-math-0181 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0182, effectively projecting the five four‐dimensional points in three‐dimensional space, it is easy to show that urn:x-wiley:09696016:media:itor12170:itor12170-math-0183 since the fifth columns of both A and the fifth row of B are zero. Hence we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0184 by Lemma 4.1, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0185 is precisely Equation 4 as claimed. urn:x-wiley:09696016:media:itor12170:itor12170-math-0186

The missing link is the relationship of the Cayley–Menger determinant with the volume of an n‐simplex. Since this is not part of Cayley's paper, we only establish the relationship for urn:x-wiley:09696016:media:itor12170:itor12170-math-0187. Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0188, urn:x-wiley:09696016:media:itor12170:itor12170-math-0189, urn:x-wiley:09696016:media:itor12170:itor12170-math-0190. Then

where urn:x-wiley:09696016:media:itor12170:itor12170-math-0192 (this identity can be established by using, e.g., Mathematica, 2014). By Heron's theorem (Theorem 2.1 above) we know that the area of a triangle with side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0193 is urn:x-wiley:09696016:media:itor12170:itor12170-math-0194. So, for urn:x-wiley:09696016:media:itor12170:itor12170-math-0195, the determinant on the left‐hand side is proportional to the negative of the square of the triangle area. This result can be generalized to every value of n (Blumenthal, 1953, II, § 40, p. 98): it turns out that the n‐dimensional volume urn:x-wiley:09696016:media:itor12170:itor12170-math-0196 of an n‐simplex in urn:x-wiley:09696016:media:itor12170:itor12170-math-0197 with side length matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0198 is

The beauty of Cayley's proof is in its extreme compactness: it uses determinants to hide all the details of elimination theory that would be necessary otherwise. His paper also shows some of these details for the simplest case urn:x-wiley:09696016:media:itor12170:itor12170-math-0200. The starting equations, as well as the symbolic manipulation steps, depend on n. Although Cayley's proof is only given for urn:x-wiley:09696016:media:itor12170:itor12170-math-0201, Cayley's treatment goes through essentially unchanged for any number n of points in dimension urn:x-wiley:09696016:media:itor12170:itor12170-math-0202.

5. Menger's characterization of abstract metric spaces

At a time where mathematicians were heeding Hilbert's call to formalization and axiomatization, Menger presented new axioms for geometry based on the notion of distance, and provided conditions for arbitrary sets to “look like” Euclidean spaces, at least distancewise (Menger, 1928, 1931). Menger's system allows a formal treatment of geometry based on distances as “internal coordinates.” The starting point is to consider the relations of geometrical figures having proportional distances between pairs of corresponding points, i.e., congruence. Menger's definition of a congruence system is defined axiomatically, and the resulting characterization of abstract distance spaces with respect to subsets of Euclidean spaces (possibly his most important result) transforms a possibly infinite verification procedure (any subset of any number of points) into a finitistic one (any subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0203 points, where n is the dimension of the Euclidean space).

It is remarkable that almost none of the results mentioned next offers an intuitive geometrical grasp, such as the proofs of Heron's formula and Cayley's theorem do. As formal mathematics has it, part of the beauty in Menger's work consists in turning the “visual” geometrical proofs based on intuition into formal symbolic arguments based on sets and relations. On the other hand, Menger himself gave a geometric intuition of his results on p. 335 of Menger (1935), which we comment in Section 5.4..

5.1. Menger's axioms

Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0204 be a system of sets, and for any set urn:x-wiley:09696016:media:itor12170:itor12170-math-0205 and any two (not necessarily distinct) points urn:x-wiley:09696016:media:itor12170:itor12170-math-0206, denote the couple urn:x-wiley:09696016:media:itor12170:itor12170-math-0207 by urn:x-wiley:09696016:media:itor12170:itor12170-math-0208. Menger defines a relation ≈ by means of the following axioms.

  • 1. urn:x-wiley:09696016:media:itor12170:itor12170-math-0209, urn:x-wiley:09696016:media:itor12170:itor12170-math-0210, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0211, we have either urn:x-wiley:09696016:media:itor12170:itor12170-math-0212 or urn:x-wiley:09696016:media:itor12170:itor12170-math-0213 but not both.
  • 2. urn:x-wiley:09696016:media:itor12170:itor12170-math-0214 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0215 we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0216.
  • 3. urn:x-wiley:09696016:media:itor12170:itor12170-math-0217, urn:x-wiley:09696016:media:itor12170:itor12170-math-0218, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0219, we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0220 if and only if urn:x-wiley:09696016:media:itor12170:itor12170-math-0221.
  • 4. urn:x-wiley:09696016:media:itor12170:itor12170-math-0222, urn:x-wiley:09696016:media:itor12170:itor12170-math-0223, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0224, if urn:x-wiley:09696016:media:itor12170:itor12170-math-0225 then urn:x-wiley:09696016:media:itor12170:itor12170-math-0226.
  • 5. urn:x-wiley:09696016:media:itor12170:itor12170-math-0227, urn:x-wiley:09696016:media:itor12170:itor12170-math-0228, urn:x-wiley:09696016:media:itor12170:itor12170-math-0229, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0230, if urn:x-wiley:09696016:media:itor12170:itor12170-math-0231 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0232 then urn:x-wiley:09696016:media:itor12170:itor12170-math-0233.

The couple urn:x-wiley:09696016:media:itor12170:itor12170-math-0234 is called a congruence system, and the ≈ relation is called congruence.

Today, we think of relations as defined on a single set. We remark that, in Menger's treatment, congruence is a binary relation defined on sets of ordered pairs of points, where each point in each pair belongs to the same set as the other, yet left‐hand and right‐hand side terms may belong to different sets. We now interpret each axiom from a more contemporary point of view.

  • 1. Axiom 1 states that Menger's congruence relation is in fact a partial relation on urn:x-wiley:09696016:media:itor12170:itor12170-math-0235 (the Cartesian product of the union of all sets urn:x-wiley:09696016:media:itor12170:itor12170-math-0236 by itself), which is only defined for a couple urn:x-wiley:09696016:media:itor12170:itor12170-math-0237 whenever urn:x-wiley:09696016:media:itor12170:itor12170-math-0238 such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0239.
  • 2. By Axiom 2, the ≈ relation acts on sets of unordered pairs of (not necessarily distinct) points; we call urn:x-wiley:09696016:media:itor12170:itor12170-math-0240 the set of all unordered pairs of points from all sets urn:x-wiley:09696016:media:itor12170:itor12170-math-0241.
  • 3. By Axiom 3, urn:x-wiley:09696016:media:itor12170:itor12170-math-0242 is congruent to a pair urn:x-wiley:09696016:media:itor12170:itor12170-math-0243 where urn:x-wiley:09696016:media:itor12170:itor12170-math-0244 if and only if urn:x-wiley:09696016:media:itor12170:itor12170-math-0245.
  • 4. Axiom 4 states that ≈ is a symmetric relation.
  • 5. Axiom states that ≈ is a transitive relation.

Note that ≈ is also reflexive (i.e., urn:x-wiley:09696016:media:itor12170:itor12170-math-0246) since urn:x-wiley:09696016:media:itor12170:itor12170-math-0247 by two successive applications of Section 5.1.. So, using today's terminology, ≈ is an equivalence relation defined on a subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0248.

5.2. A model for the axioms

Menger's model for his axioms is a semimetric space S, i.e., a set S of points such that to each unordered pair urn:x-wiley:09696016:media:itor12170:itor12170-math-0249 of points in S we assign a nonnegative real number urn:x-wiley:09696016:media:itor12170:itor12170-math-0250, which we call distance between p and q. Under this interpretation, Axiom 2 tells us that urn:x-wiley:09696016:media:itor12170:itor12170-math-0251 for each pair of points urn:x-wiley:09696016:media:itor12170:itor12170-math-0252, and Axiom 3 tells us that urn:x-wiley:09696016:media:itor12170:itor12170-math-0253 is congruent to a single point if and only if urn:x-wiley:09696016:media:itor12170:itor12170-math-0254, which, together with non‐negativity, are the defining properties of semimetrics (the remaining property, the triangular inequality, tells semimetrics apart from metrics). Thus, the set urn:x-wiley:09696016:media:itor12170:itor12170-math-0255 of all semimetric spaces together with the relation given by urn:x-wiley:09696016:media:itor12170:itor12170-math-0256 is a congruence system.

5.3. A finitistic characterization of semimetric spaces

Two sets urn:x-wiley:09696016:media:itor12170:itor12170-math-0257 are congruent if there is a map (called congruence map) urn:x-wiley:09696016:media:itor12170:itor12170-math-0258, such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0259 for all urn:x-wiley:09696016:media:itor12170:itor12170-math-0260. We denote this relation by urn:x-wiley:09696016:media:itor12170:itor12170-math-0261, dropping the ϕ if it is clear from the context.

Lemma 5.1.Any congruence map urn:x-wiley:09696016:media:itor12170:itor12170-math-0262 is injective.

Proof.Suppose, to get a contradiction, that urn:x-wiley:09696016:media:itor12170:itor12170-math-0263 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0264 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0265: then urn:x-wiley:09696016:media:itor12170:itor12170-math-0266 and so, by Section 5.1., urn:x-wiley:09696016:media:itor12170:itor12170-math-0267 against assumption. urn:x-wiley:09696016:media:itor12170:itor12170-math-0268

If S is congruent to a subset of T, then we say that S is congruently embeddable in T.

5.3.1. Congruence order

Now consider a set urn:x-wiley:09696016:media:itor12170:itor12170-math-0269 and an integer urn:x-wiley:09696016:media:itor12170:itor12170-math-0270 with the following property: for any urn:x-wiley:09696016:media:itor12170:itor12170-math-0271, if all n‐point subsets of T are congruent to an n‐point subset of S, then T is congruently embeddable in S. If this property holds, then S is said to have congruence order n. Formally, the property is written as follows:

(5)

If urn:x-wiley:09696016:media:itor12170:itor12170-math-0273 for some positive integer n, then S can have congruence order n, since the definition is vacuously satisfied. So we assume in the following that urn:x-wiley:09696016:media:itor12170:itor12170-math-0274.

Proposition 5.2.If S has congruence order n in urn:x-wiley:09696016:media:itor12170:itor12170-math-0275, then it also has congruence order m for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0276.

Proof.By hypothesis, for every urn:x-wiley:09696016:media:itor12170:itor12170-math-0277, if every n‐point subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0278 of T is congruent to an n‐point subset of S, then there is a subset R of S such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0279. Now any m‐point subset of S is mapped by ϕ to a congruent m‐point subset of S, and again urn:x-wiley:09696016:media:itor12170:itor12170-math-0280, so Equation 5 is satisfied for S and m. urn:x-wiley:09696016:media:itor12170:itor12170-math-0281

In view of Proposition 5.2, it becomes important to find the minimum congruence order of a given metric space.

Proposition 5.3.urn:x-wiley:09696016:media:itor12170:itor12170-math-0282 (i.e., the Euclidean space that simply consists of the origin) has minimum congruence order 2 in urn:x-wiley:09696016:media:itor12170:itor12170-math-0283.

Proof.Pick any urn:x-wiley:09696016:media:itor12170:itor12170-math-0284 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0285. None of its two‐point subsets is congruent to any two‐point subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0286, since none exists. Moreover, T itself cannot be congruently embedded in urn:x-wiley:09696016:media:itor12170:itor12170-math-0287, since urn:x-wiley:09696016:media:itor12170:itor12170-math-0288 and no injective congruence map can be defined, against Lemma 5.1. So the integer 2 certainly (vacuously) satisfies Equation 5 for urn:x-wiley:09696016:media:itor12170:itor12170-math-0289, which means that urn:x-wiley:09696016:media:itor12170:itor12170-math-0290 has congruence order 2. In view of Proposition 5.2, it also has congruence order m for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0291. Hence we have to show next that the integer 1 cannot be a congruence order for urn:x-wiley:09696016:media:itor12170:itor12170-math-0292. To reach a contradiction, suppose the contrary, and let T be as above. By Axiom 3, every singleton subset of T is congruent to a subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0293, namely the subset containing the origin. Thus, by Equation 5, T must be congruent to a subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0294; but, again, urn:x-wiley:09696016:media:itor12170:itor12170-math-0295 contradicts Lemma 5.1: so T cannot be congruently embedded in urn:x-wiley:09696016:media:itor12170:itor12170-math-0296, which negates Equation 5. Hence the integer 1 cannot be a congruence order for urn:x-wiley:09696016:media:itor12170:itor12170-math-0297, as claimed. urn:x-wiley:09696016:media:itor12170:itor12170-math-0298

5.3.2. Menger's fundamental result

The fundamental result proved by Menger (1928) is that the Euclidean space urn:x-wiley:09696016:media:itor12170:itor12170-math-0299 has congruence order urn:x-wiley:09696016:media:itor12170:itor12170-math-0300 but not urn:x-wiley:09696016:media:itor12170:itor12170-math-0301 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0302 in the family urn:x-wiley:09696016:media:itor12170:itor12170-math-0303 of all semimetric spaces. The important implication of Menger's result is that in order to verify whether an abstract semimetric space is congruent to a subset of a Euclidean space, we only need to verify congruence of each of its urn:x-wiley:09696016:media:itor12170:itor12170-math-0304 point subsets.

We follow Blumenthal's treatment (Blumenthal, 1953), based on the following preliminary definitions and properties, which we will not prove:

  • 1. A congruent mapping of a semimetric space onto itself is called a motion;
  • 2. urn:x-wiley:09696016:media:itor12170:itor12170-math-0305 points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0306 are independent if they are not affinely dependent (i.e., if they do not all belong to a single hyperplane in urn:x-wiley:09696016:media:itor12170:itor12170-math-0307);
  • 3. two congruent urn:x-wiley:09696016:media:itor12170:itor12170-math-0308‐point subsets of urn:x-wiley:09696016:media:itor12170:itor12170-math-0309 are either both independent or both dependent;
  • 4. there is at most one point of urn:x-wiley:09696016:media:itor12170:itor12170-math-0310 with given distances from an independent urn:x-wiley:09696016:media:itor12170:itor12170-math-0311‐point subset;
  • 5. any congruence between any two subsets of urn:x-wiley:09696016:media:itor12170:itor12170-math-0312 can be extended to a motion;
  • 6. any congruence between any two independent urn:x-wiley:09696016:media:itor12170:itor12170-math-0313‐point subsets of urn:x-wiley:09696016:media:itor12170:itor12170-math-0314 can be extended to a unique motion.

Theorem 5.4. ([Menger, 1928])A nonempty semimetric space S is congruently embeddable in urn:x-wiley:09696016:media:itor12170:itor12170-math-0315 (but not in any urn:x-wiley:09696016:media:itor12170:itor12170-math-0316 for urn:x-wiley:09696016:media:itor12170:itor12170-math-0317) if and only if: (a) S contains an urn:x-wiley:09696016:media:itor12170:itor12170-math-0318‐point subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0319, which is congruent with an independent urn:x-wiley:09696016:media:itor12170:itor12170-math-0320‐point subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0321; and (b) each urn:x-wiley:09696016:media:itor12170:itor12170-math-0322‐point subset U of S containing urn:x-wiley:09696016:media:itor12170:itor12170-math-0323 is congruent to an urn:x-wiley:09696016:media:itor12170:itor12170-math-0324‐point subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0325.

The proof of Menger's theorem is very formal (see next) and somewhat difficult to follow. It is nonetheless a good example of a proof in an axiomatic setting, where logical reasoning is based on syntactical transformations induced by inference rules on the given axioms. An intuitive discussion is provided in Section 5.4..

Proof.(⇒) Assume first that urn:x-wiley:09696016:media:itor12170:itor12170-math-0326, where the affine closure of T has dimension n. Then T must contain an independent subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0327 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0328, which we can map back to a subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0329 using urn:x-wiley:09696016:media:itor12170:itor12170-math-0330. Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0331 are injective, urn:x-wiley:09696016:media:itor12170:itor12170-math-0332, and by Axiom 3 we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0333, so urn:x-wiley:09696016:media:itor12170:itor12170-math-0334, which establishes (a). Now take any urn:x-wiley:09696016:media:itor12170:itor12170-math-0335 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0336 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0337: this can be mapped via ϕ to a subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0338: Lemma 5.1 ensures injectivity of ϕ and hence urn:x-wiley:09696016:media:itor12170:itor12170-math-0339, establishing (b).

(⇐) Conversely, assume (a) and (b) hold. By (a), let urn:x-wiley:09696016:media:itor12170:itor12170-math-0340 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0341 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0342, with urn:x-wiley:09696016:media:itor12170:itor12170-math-0343 independent and urn:x-wiley:09696016:media:itor12170:itor12170-math-0344. We claim that ϕ can be extended to a mapping of S into urn:x-wiley:09696016:media:itor12170:itor12170-math-0345. Take any urn:x-wiley:09696016:media:itor12170:itor12170-math-0346: by (b), urn:x-wiley:09696016:media:itor12170:itor12170-math-0347 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0348. Note that urn:x-wiley:09696016:media:itor12170:itor12170-math-0349 by Section 5.1., which implies that for any urn:x-wiley:09696016:media:itor12170:itor12170-math-0350, we have urn:x-wiley:09696016:media:itor12170:itor12170-math-0351. Moreover, by Property 5.3 above, urn:x-wiley:09696016:media:itor12170:itor12170-math-0352 is independent and has cardinality urn:x-wiley:09696016:media:itor12170:itor12170-math-0353, which by Property 5.3 above implies that ω can be extended to a unique motion in urn:x-wiley:09696016:media:itor12170:itor12170-math-0354. So the action of ω is extended to q, and we can define urn:x-wiley:09696016:media:itor12170:itor12170-math-0355. We now show that this extension of ϕ is a congruence. Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0356: we aim to prove that urn:x-wiley:09696016:media:itor12170:itor12170-math-0357. Consider the set urn:x-wiley:09696016:media:itor12170:itor12170-math-0358: since urn:x-wiley:09696016:media:itor12170:itor12170-math-0359, by (b) there is urn:x-wiley:09696016:media:itor12170:itor12170-math-0360 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0361 such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0362. As above, we note that there is a subset urn:x-wiley:09696016:media:itor12170:itor12170-math-0363 such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0364 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0365, that urn:x-wiley:09696016:media:itor12170:itor12170-math-0366 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0367, and that ω is a motion of urn:x-wiley:09696016:media:itor12170:itor12170-math-0368. Hence urn:x-wiley:09696016:media:itor12170:itor12170-math-0369, as claimed.

5.4. An intuitive interpretation

Although we stated initially that part of the beauty of the formal treatment of geometry is that it is based on symbolic manipulation rather than visual intuition, we quote from a survey paper that Menger himself wrote (in Italian, with the help of L. Geymonat) to disseminate the work carried out at his seminar (Menger, 1935).

Affinché uno spazio metrico reale R sia applicabile a un insieme parziale di urn:x-wiley:09696016:media:itor12170:itor12170-math-0370 è necessario e sufficiente che per ogni urn:x-wiley:09696016:media:itor12170:itor12170-math-0371 e per ogni urn:x-wiley:09696016:media:itor12170:itor12170-math-0372 punti di esso sia urn:x-wiley:09696016:media:itor12170:itor12170-math-0373 e inoltre che ogni urn:x-wiley:09696016:media:itor12170:itor12170-math-0374 punti di R siano applicabili a punti di urn:x-wiley:09696016:media:itor12170:itor12170-math-0375.

The translation is “a real metric space R is embeddable in a subset of urn:x-wiley:09696016:media:itor12170:itor12170-math-0376 if and only if urn:x-wiley:09696016:media:itor12170:itor12170-math-0377 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0378‐ and urn:x-wiley:09696016:media:itor12170:itor12170-math-0379‐point subsets or R, and that each urn:x-wiley:09696016:media:itor12170:itor12170-math-0380‐point subset of R is embeddable in urn:x-wiley:09696016:media:itor12170:itor12170-math-0381.”

Since we know that Δ, the Cayley–Menger determinant of the pairwise distances of a set S of points (see Equation 4), is proportional to the volume of the simplex on S embedded in urn:x-wiley:09696016:media:itor12170:itor12170-math-0382 dimensions, what Menger is saying is that his result on the congruence order of Euclidean spaces can be intuitively interpreted as follows:

An abstract semimetric space R is congruently embeddable in urn:x-wiley:09696016:media:itor12170:itor12170-math-0383 if and only if: (i) there are urn:x-wiley:09696016:media:itor12170:itor12170-math-0384 points in R that are congruently embeddable in urn:x-wiley:09696016:media:itor12170:itor12170-math-0385; (ii) the volume of the simplex on each urn:x-wiley:09696016:media:itor12170:itor12170-math-0386 points of R is zero; (iii) the volume of the simplex on each urn:x-wiley:09696016:media:itor12170:itor12170-math-0387 points of R is zero.

This result is exploited in the algorithm for computing point positions from distances given on p. 2284 of Sippl and Scheraga (1986).

6. Gödel on spherical distances

Kurt Gödel's name is attached to what is possibly the most revolutionary result in all of mathematics, i.e., Gödel's incompleteness theorem, according to which any formal axiomatic system sufficient to encode the integers is either inconsistent (it proves A and urn:x-wiley:09696016:media:itor12170:itor12170-math-0388) or incomplete (there is some true statement A which the system cannot prove). This shattered Hilbert's dream of a formal system in which every true mathematical statement could be proved. Few people know that Gödel, who attended the Vienna Circle, Menger's course in geometry, and Menger's seminar, also contributed two results that are completely outside the domain of logic. These results only appeared in the proceedings of Menger's seminar (Menger, 1998), and concern DG on a spherical surface.

6.1. Four points on the surface of a sphere

The result we discuss here is a proof to the following theorem, conjectured at a previous seminar session by Laura Klanfer. We remark that a sphere in urn:x-wiley:09696016:media:itor12170:itor12170-math-0389 is a semimetric space whenever it is endowed with a distance corresponding to the length of a geodesic curve joining two points.

Theorem 6.1. ([Gödel, 1986])Given a semimetric space S of four points, congruently embeddable in urn:x-wiley:09696016:media:itor12170:itor12170-math-0390 but not urn:x-wiley:09696016:media:itor12170:itor12170-math-0391, is S also congruently embeddable on the surface of a sphere in urn:x-wiley:09696016:media:itor12170:itor12170-math-0392.

Gödel's proof looks at the circumscribed sphere around a tetrahedron in urn:x-wiley:09696016:media:itor12170:itor12170-math-0393, and analyzes the relationship of the geodesics, their corresponding chords, and the sphere radius. It then uses a fixed point argument to find the radius that corresponds to geodesics that are as long as the given sides.

Proof.The congruence embedding of S in urn:x-wiley:09696016:media:itor12170:itor12170-math-0394 defines a tetrahedron T having six (straight) sides with lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0395. Let r be the radius of the sphere circumscribed around T (i.e., the smallest sphere containing T). We will now consider a family of tetrahedra urn:x-wiley:09696016:media:itor12170:itor12170-math-0396, parametrized on a scalar urn:x-wiley:09696016:media:itor12170:itor12170-math-0397, defined as follows: urn:x-wiley:09696016:media:itor12170:itor12170-math-0398 is the tetrahedron in urn:x-wiley:09696016:media:itor12170:itor12170-math-0399 having side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0400, where urn:x-wiley:09696016:media:itor12170:itor12170-math-0401 is the length of the chord subtending a geodesic having length α on a sphere of radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0402. As x tends toward zero, each urn:x-wiley:09696016:media:itor12170:itor12170-math-0403 tends toward urn:x-wiley:09696016:media:itor12170:itor12170-math-0404 (for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0405), since the radius of the sphere tends toward infinity and each geodesic length tends toward the length of the subtending chord. This means that urn:x-wiley:09696016:media:itor12170:itor12170-math-0406 tends toward T, since T is precisely the tetrahedron having side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0407. For each urn:x-wiley:09696016:media:itor12170:itor12170-math-0408, let urn:x-wiley:09696016:media:itor12170:itor12170-math-0409 be the inverse of the radius of the sphere circumscribed about urn:x-wiley:09696016:media:itor12170:itor12170-math-0410. Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0411 as urn:x-wiley:09696016:media:itor12170:itor12170-math-0412, and the radius circumscribed about T is r, it follows that urn:x-wiley:09696016:media:itor12170:itor12170-math-0413 as urn:x-wiley:09696016:media:itor12170:itor12170-math-0414. Also, since T exists by hypothesis, we can define urn:x-wiley:09696016:media:itor12170:itor12170-math-0415 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0416. Also note that it is well known by elementary spherical geometry that:

(6)

Claim.if urn:x-wiley:09696016:media:itor12170:itor12170-math-0418 then ϕ has a fixed point in the open interval urn:x-wiley:09696016:media:itor12170:itor12170-math-0419.

Proof of the claim.First of all notice that τ(0) exists, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0420 is a continuous function for urn:x-wiley:09696016:media:itor12170:itor12170-math-0421 for each α (by Equation 6). Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0422 is defined by the chord lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0423, this also means that urn:x-wiley:09696016:media:itor12170:itor12170-math-0424 varies continuously for x in some open interval urn:x-wiley:09696016:media:itor12170:itor12170-math-0425 (for some constant urn:x-wiley:09696016:media:itor12170:itor12170-math-0426). In turn, this implies that urn:x-wiley:09696016:media:itor12170:itor12170-math-0427 exists by continuity. There are two cases: either urn:x-wiley:09696016:media:itor12170:itor12170-math-0428 is at the upper extremum of I, or it is not.

  • (i)

    If urn:x-wiley:09696016:media:itor12170:itor12170-math-0429, then urn:x-wiley:09696016:media:itor12170:itor12170-math-0430 exists, its longest edge has length urn:x-wiley:09696016:media:itor12170:itor12170-math-0431, so, by elementary spherical geometry, the radius of the sphere circumscribed around urn:x-wiley:09696016:media:itor12170:itor12170-math-0432 is greater than urn:x-wiley:09696016:media:itor12170:itor12170-math-0433, i.e., greater than urn:x-wiley:09696016:media:itor12170:itor12170-math-0434. Thus urn:x-wiley:09696016:media:itor12170:itor12170-math-0435. We also have, however, that urn:x-wiley:09696016:media:itor12170:itor12170-math-0436, so by the intermediate value theorem there must be some urn:x-wiley:09696016:media:itor12170:itor12170-math-0437 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0438.

  • (ii)

    Assume now urn:x-wiley:09696016:media:itor12170:itor12170-math-0439 and suppose urn:x-wiley:09696016:media:itor12170:itor12170-math-0440 is nonplanar. Then for each y in an arbitrary small neighborhood around urn:x-wiley:09696016:media:itor12170:itor12170-math-0441, urn:x-wiley:09696016:media:itor12170:itor12170-math-0442 must exist by continuity: in particular, there must be some urn:x-wiley:09696016:media:itor12170:itor12170-math-0443 where urn:x-wiley:09696016:media:itor12170:itor12170-math-0444 exists, which contradicts the definition of urn:x-wiley:09696016:media:itor12170:itor12170-math-0445. So urn:x-wiley:09696016:media:itor12170:itor12170-math-0446 is planar: this means that each geodesic is contained in the same plane, which implies that the geodesics are linear segments. It follows that the circumscribed sphere has infinite radius, or, equivalently, that urn:x-wiley:09696016:media:itor12170:itor12170-math-0447. Again, by urn:x-wiley:09696016:media:itor12170:itor12170-math-0448 and the intermediate value theorem, there must be some urn:x-wiley:09696016:media:itor12170:itor12170-math-0449 with urn:x-wiley:09696016:media:itor12170:itor12170-math-0450.

This concludes the proof of the claim.

So now let y be the fixed point of ϕ. The tetrahedron urn:x-wiley:09696016:media:itor12170:itor12170-math-0451 has side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0452 for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0453, and is circumscribed by a sphere σ with radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0454. It follows that, on the sphere σ, the geodesics corresponding to the chords given by the tetrahedron sides have lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0455 (for urn:x-wiley:09696016:media:itor12170:itor12170-math-0456), as claimed. urn:x-wiley:09696016:media:itor12170:itor12170-math-0457

6.2. Gödel's devilish genius

Gödel's proof exhibits an unusual peak of devilish genius. At first sight, it is a one‐dimensional fixed‐point argument that employs a couple of elementary notions in spherical geometry. Underneath the surface, the fixed‐point argument eschews a misleading visual intuition.

T is a given tetrahedron in urn:x-wiley:09696016:media:itor12170:itor12170-math-0458 that is assumed to be nonplanar and circumscribed by a sphere of finite positive radius r (see Fig. 9, left).

Tetrahedron T (left) and the tetrahedron urn:x-wiley:09696016:media:itor12170:itor12170-math-0459 (right). Beware of this visual interpretation: it may yield misleading insights (see Section 6.2.).
The impact of Schoenberg's remarks far exceeds that of the original paper: this is something that happens quite often in mathematics.

The map τ sends a scalar x to the tetrahedron having as side lengths the chords subtending the geodesics of length urn:x-wiley:09696016:media:itor12170:itor12170-math-0460 (urn:x-wiley:09696016:media:itor12170:itor12170-math-0461) on a sphere of radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0462 (see Fig. 9, right). The map τ is such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0463 since for urn:x-wiley:09696016:media:itor12170:itor12170-math-0464 the radius is infinite, which means that the geodesics are equal to their chords. Moreover, the map ϕ sends x to the inverse of the radius of the sphere circumscribing urn:x-wiley:09696016:media:itor12170:itor12170-math-0465. Since every geodesic on the sphere is a portion of a great circle, it would appear from Fig. 9 (right) that the radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0466 used to compute urn:x-wiley:09696016:media:itor12170:itor12170-math-0467 (urn:x-wiley:09696016:media:itor12170:itor12170-math-0468) is the same as the radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0469 of the sphere circumscribing urn:x-wiley:09696016:media:itor12170:itor12170-math-0470, which would immediately yield urn:x-wiley:09696016:media:itor12170:itor12170-math-0471 for every x—making the proof trivial. There is something inconsistent, however, in the visual interpretation of Fig. 9: the given tetrahedron T corresponds to the case urn:x-wiley:09696016:media:itor12170:itor12170-math-0472, which happens when urn:x-wiley:09696016:media:itor12170:itor12170-math-0473, i.e., the radius of the sphere circumscribed around T is ∞. But this would yield T to be a planar tetrahedron, which is a contradiction with an assumption of the theorem. Moreover, if urn:x-wiley:09696016:media:itor12170:itor12170-math-0474 were equal to x for each x, this would yield urn:x-wiley:09696016:media:itor12170:itor12170-math-0475, another contradiction.

The misleading concept is hidden in Fig. 9 (right). It shows a tetrahedron inscribed in a sphere, and a spherical tetrahedron on the same vertices. This is not true in general, i.e., the spherical tetrahedron with the given curved side lengths urn:x-wiley:09696016:media:itor12170:itor12170-math-0476 cannot, in general, be embedded in the surface of a sphere of any radius. For example, the case urn:x-wiley:09696016:media:itor12170:itor12170-math-0477 yields geodesics with infinite curvatures (i.e., straight lines laying in a plane), but urn:x-wiley:09696016:media:itor12170:itor12170-math-0478, and there is no flat tetrahedron with the same distances as those of T. The sense of Gödel's proof is that the function urn:x-wiley:09696016:media:itor12170:itor12170-math-0479 simply transforms a set of geodesic distances into a set of linear distances, i.e., it maps scalars to scalars rather than geodesics to segments, whereas Fig. 9 (right) shows the special case where the geodesics are mapped to the corresponding segments, with intersections at the same points (namely the distances urn:x-wiley:09696016:media:itor12170:itor12170-math-0480 can be embedded on the particular sphere shown in the picture). More specifically, the geodesic curves may or may not be realizable on a sphere of radius urn:x-wiley:09696016:media:itor12170:itor12170-math-0481. Gödel's proof shows exactly that there must be some x for which urn:x-wiley:09696016:media:itor12170:itor12170-math-0482, i.e., the geodesic curves become realizable.

6.3. Existential versus constructive proofs

Like many existential proofs based on fixed‐point theorems, this proof is beautiful because it asserts the truth of the theorem without any certificates other than its own logical validity. An alternative, constructive proof of Theorem 6.1 is given in Theorem 3' of Schoenberg (1935). The tools used in that proof, Cayley–Menger determinants and positive semidefiniteness, are discussed in Section 7..

7. The equivalence of EDM and PSD matrices

Many fundamental innovations stem from what are essentially footnotes to apparently deeper or more important work. Isaac Schoenberg, better known as the inventor of splines (Schoenberg, 1946), published a paper in 1935 titled Remarks to Maurice Fréchet's article “Sur la définition axiomatique d'une classe d'espace distanciés vectoriellement applicable sur l'espace de Hilbert” (Schoenberg, 1935). The impact of Schoenberg's remarks far exceeds that of the original paper: these remarks encode what amounts to the basis of the well‐known MDS techniques for visualizing high‐dimensional data (Cox and Cox, 2001), as well as all the solution techniques for DG problems based on semidefinite programming (Alfakih et al., 1999; Man‐Cho So and Ye, 2007) (Fig. 10).

7.1. Schoenberg's problem

Schoenberg poses the following problem, relevant to Menger's treatment of DG (Menger, 1931, p. 737):

Given an urn:x-wiley:09696016:media:itor12170:itor12170-math-0483 symmetric matrix D, what are necessary and sufficient conditions such that D is a EDM corresponding to n points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0484, with urn:x-wiley:09696016:media:itor12170:itor12170-math-0485 minimum?

Menger's solution is based on Cayley–Menger determinants; Schoenberg's solution is much simpler and more elegant, and rests upon the following theorem. Recall that a matrix is PSD if and only if all its eigenvalues are nonnegative.

Theorem 7.1. ([Schoenberg, 1935])The urn:x-wiley:09696016:media:itor12170:itor12170-math-0486 symmetric matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0487 is the EDM of a set of n points urn:x-wiley:09696016:media:itor12170:itor12170-math-0488 (with r minimum) if and only if the matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0489 is PSD of rank r.

Instead of providing Schoenberg's proof, we follow a more modern treatment, which also unearths the important link of this theorem with classical MDS (Cox and Cox, 2001, § 2.2.1), an approximate method for finding sets of points urn:x-wiley:09696016:media:itor12170:itor12170-math-0490 having EDM that approximates a given symmetric matrix. MDS is one of the cornerstones of the modern science of data analysis.

7.2. The proof of Schoenberg's theorem

Given a set urn:x-wiley:09696016:media:itor12170:itor12170-math-0491 of points in urn:x-wiley:09696016:media:itor12170:itor12170-math-0492, we can write x as an urn:x-wiley:09696016:media:itor12170:itor12170-math-0493 matrix having urn:x-wiley:09696016:media:itor12170:itor12170-math-0494 as ith column. The matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0495 having the scalar product urn:x-wiley:09696016:media:itor12170:itor12170-math-0496 as its urn:x-wiley:09696016:media:itor12170:itor12170-math-0497‐th component is called the Gram matrix or Gramian of x. The proof of Theorem 7.1 works by exhibiting a 1‐1 correspondence between squared EDMs and Gram matrices, and then by proving that a matrix is Gram if and only if it is PSD.

Without loss of generality, we can assume that the barycenter of the points in x is at the origin:

(7)
Now we remark that, for each urn:x-wiley:09696016:media:itor12170:itor12170-math-0499, we have
(8)

7.2.1. The Gram matrix in function of the EDM

We “invert” Equation 8 to compute the matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0501 in function of the matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0502. We sum Equation 8 over all values of urn:x-wiley:09696016:media:itor12170:itor12170-math-0503, obtaining

(9)
By Equation 7, the negative term in the right‐hand side of Equation 9 is zero. On dividing through by n, we have
(10)
Similarly for urn:x-wiley:09696016:media:itor12170:itor12170-math-0506, we obtain
(11)
We now sum Equation 10 over all j, getting
(12)
(the last equality in Equation 12 holds because the same quantity urn:x-wiley:09696016:media:itor12170:itor12170-math-0509 is being summed over the same range urn:x-wiley:09696016:media:itor12170:itor12170-math-0510, with the symbol k replaced by the symbol i first and j next). We then divide through by n to get
(13)

We now rearrange Equations 8, 11, 10 as follows:

(14)
(15)
(16)
and replace the left‐hand side terms of Equations 15 and 16 into Equation 14 to obtain
(17)
whence, on substituting the last term using Equation 13, we have
(18)

It turns out that Equation 18 can be written in a matrix form as

(19)
where urn:x-wiley:09696016:media:itor12170:itor12170-math-0518 and urn:x-wiley:09696016:media:itor12170:itor12170-math-0519.

Gram matrices are PSD matrices

Any Gram matrix urn:x-wiley:09696016:media:itor12170:itor12170-math-0520 derived by a point sequence (also called a realization) urn:x-wiley:09696016:media:itor12170:itor12170-math-0521 in urn:x-wiley:09696016:media:itor12170:itor12170-math-0522 for some non‐negative integer K has two important properties: (i) the rank of G is equal to the rank of x; and (ii) G is PSD, i.e., urn:x-wiley:09696016:media:itor12170:itor12170-math-0523 for all urn:x-wiley:09696016:media:itor12170:itor12170-math-0524. For simplicity, we only prove these properties in the case when urn:x-wiley:09696016:media:itor12170:itor12170-math-0525 is a urn:x-wiley:09696016:media:itor12170:itor12170-math-0526 matrix, i.e., urn:x-wiley:09696016:media:itor12170:itor12170-math-0527, and urn:x-wiley:09696016:media:itor12170:itor12170-math-0528 is a scalar for all urn:x-wiley:09696016:media:itor12170:itor12170-math-0529 (this is the case urn:x-wiley:09696016:media:itor12170:itor12170-math-0530 in Schoenberg's problem above).

  • (i)

    The ith column of G is the vector x multiplied by the scalar urn:x-wiley:09696016:media:itor12170:itor12170-math-0531, which means that every column of G is a scalar multiple of a single column vector, and hence that urn:x-wiley:09696016:media:itor12170:itor12170-math-0532;

  • (ii)

    For any vector y, urn:x-wiley:09696016:media:itor12170:itor12170-math-0533.

Moreover, G is a Gram matrix only if it is PSD. Let M be a PSD matrix. By spectral decomposition there is a unitary matrix Y such that urn:x-wiley:09696016:media:itor12170:itor12170-math-0534, where Λ is diagonal. By positive semidefiniteness, urn:x-wiley:09696016:media:itor12170:itor12170-math-0535 for each i, so urn:x-wiley:09696016:media:itor12170:itor12170-math-0536 exists. Hence urn:x-wiley:09696016:media:itor12170:itor12170-math-0537, which makes M the Gram matrix of the vector urn:x-wiley:09696016:media:itor12170:itor12170-math-0538. This concludes the proof of Theorem 7.1.

7.3. Finding the realization of a Gramian

Having computed the Gram matrix G from the EDM D in Section 7.2., we obtain the corresponding realization x as follows. This is essentially the same reasoning used above to show the equivalence of Gramians and PSD matrices, but we give a few more details.

Let urn:x-wiley:09696016:media:itor12170:itor12170-math-0539 be the urn:x-wiley:09696016:media:itor12170:itor12170-math-0540 matrix with the eigenvalues urn:x-wiley:09696016:media:itor12170:itor12170-math-0541 along the diagonal and zeroes everywhere else, and let Y be the urn:x-wiley:09696016:media:itor12170:itor12170-math-0542 matrix having the eigenvector corresponding to the eigenvalue urn:x-wiley:09696016:media:itor12170:itor12170-math-0543 as its jth column (for urn:x-wiley:09696016:media:itor12170:itor12170-math-0544), chosen so that Y consists of orthogonal columns. Then urn:x-wiley:09696016:media:itor12170:itor12170-math-0545. Since Λ is a diagonal matrix and all its diagonal entries are nonnegative (by positive semidefiniteness of G), we can write Λ as urn:x-wiley:09696016:media:itor12170:itor12170-math-0546, where urn:x-wiley:09696016:media:itor12170:itor12170-math-0547. Now, since urn:x-wiley:09696016:media:itor12170:itor12170-math-0548,

which implies that
(20)
is a realization of G in urn:x-wiley:09696016:media:itor12170:itor12170-math-0551.

7.4. Multidimensional scaling

MDS can be used to find realizations of approximate distance matrices urn:x-wiley:09696016:media:itor12170:itor12170-math-0552. As above, we compute urn:x-wiley:09696016:media:itor12170:itor12170-math-0553. Since urn:x-wiley:09696016:media:itor12170:itor12170-math-0554 is not a EDM, urn:x-wiley:09696016:media:itor12170:itor12170-math-0555 will probably fail to be a Gram matrix, and as such might have negative eigenvalues. But it suffices to let Y be the eigenvectors corresponding to the H positive eigenvalues urn:x-wiley:09696016:media:itor12170:itor12170-math-0556, to recover an approximate realization x of urn:x-wiley:09696016:media:itor12170:itor12170-math-0557 in urn:x-wiley:09696016:media:itor12170:itor12170-math-0558.

Another interesting feature of MDS is that the dimensionality H of the ambient space of x is actually determined by D (or urn:x-wiley:09696016:media:itor12170:itor12170-math-0559) rather than given as a problem input. In other words, MDS finds the “inherent dimensionality” of a set of (approximate) pairwise distances.

8. Conclusion

We presented what we feel are the most important and/or beautiful theorems in DG (Heron's, Cauchy's, Cayley's, Menger's, Gödel's, and Schoenberg's). Three of them (Heron's, Cayley's, and Menger's) are concerned with the volume of simplices given its side lengths, which appears to be the central concept in DG. We think Cauchy's proof is as beautiful as a piece of classical art, whereas Gödel's proof, though less important, is stunning. Last but not least, Schoenberg's theorem is the fundamental link between the history of DG and its contemporary treatment.

Acknowledgments

The first author (L.L.) worked on this paper while employed at IBM TJ Watson Research Center, and is very grateful to IBM for the freedom he was afforded. The second author (C.L.) is grateful to the Brazilian research agencies FAPESP and CNPq.

    Number of times cited: 4

    • , Clifford algebra and discretizable distance geometry, Mathematical Methods in the Applied Sciences, 41, 11, (4063-4073), (2017).
    • , Introduction, An Introduction to Distance Geometry applied to Molecular Geometry, 10.1007/978-3-319-57183-6_1, (1-4), (2017).
    • , A NOTE ON COMPUTING THE INTERSECTION OF SPHERES IN , The ANZIAM Journal, 59, 02, (271), (2017).
    • , Assigned and unassigned distance geometry: applications to biological molecules and nanostructures, 4OR, 14, 4, (337), (2016).