The Johnson-Lindenstrauss Lemma

The Johnsons-Lindenstrauss Lemma, or JL Lemma, says that we can embed $n$ points from $\mathbb{R}^d$ in a $O(\frac{\log(n)}{\varepsilon^2})$ dimensional space while preserving all pairwise distances to relative error $\varepsilon$ . We prove this in the simplest case, where we choose this embedding to be a random Gaussian matrix. We start with a simple concentration for a single vector $\mathbf{x}\in\mathbb{R}^{d}$ :

Lemma 1: Gaussians Preserve Norms
Lemma

Fix

\varepsilon>0

\delta>0

, dimension

d

, and

k = \Omega(\frac1{\varepsilon^2}\log(\frac1\delta))

. Let

\boldsymbol{G}\in\mathbb{R}^{k \times d}

be a matrix of iid

\mathcal{N}(0,1)

entries. Then with probability

1-\delta

, for any fixed

\mathbf{x}\in\mathbb{R}^d

, we have

(1-\varepsilon)\|\mathbf{x}\|_2^2 \leq \|{\textstyle\frac{ 1}{ \sqrt k}}\boldsymbol{G}\mathbf{x}\|_2^2 \leq (1+\varepsilon)\|\mathbf{x}\|_2^2

Proof. Consider the $i^{th}$ entry of $\mathbf{w} \;{\vcentcolon=}\; \boldsymbol{G}\mathbf{x}$ . Letting $g_1,\ldots,g_d$ denote the entries of the $i^{th}$ row of $\boldsymbol{G}$ , we have

w_i = \sum_{j=1}^d g_j x_j \sim \sum_{j=1}^d \mathcal{N}(0,x_j^2) = \mathcal{N}(0,\|\mathbf{x}\|_2^2)

That is, $\mathbf{w}$ is just a vector of iid $\mathcal{N}(0,\|\mathbf{x}\|_2^2)$ random variables. So,

\frac1k\|\mathbf{w}\|_2^2 = \frac1k \sum_{i=1}^k w_i^2 \sim \|\mathbf{x}\|_2^2 \cdot \frac1k\sum_{i=1}^k(\mathcal{N}(0,1))^2

Fortunately, the concentration of the average of squared Gaussians is well understood. Equation 2.2.1 from Wainwright (2015) says that, with probability $1-\delta$ , standard normal random variables $z_1,\ldots,z_k$ have

\left|\frac{1}{k}\sum_{i=1}^k z_i^2 - 1\right| \leq \sqrt{\frac{8}{k}\ln(\frac2\delta)} \leq \varepsilon

where our value of $k = \Omega(\frac1{\varepsilon^2}\log(\frac1\delta))$ guarantees this error term is at most $\varepsilon$ . We conclude that

\left|\|\mathbf{x}\|_2^2 - \|{\textstyle\frac{ 1}{ \sqrt k}} \boldsymbol{G}\mathbf{x}\|_2^2\right| = \left|\|\mathbf{x}\|_2^2 - {\textstyle\frac{ 1}{ k}}\|\mathbf{w}\|_2^2\right| = \|\mathbf{x}\|_2^2 \cdot \left|\frac{1}{k}\sum_{i=1}^k z_i^2 - 1\right| \leq \varepsilon \|\mathbf{x}\|_2^2

Which completes the proof.

\blacksquare \, \,

Given this simple concentration, we can state the Johnson-Lindenstrauss Lemma:

Lemma 2: Johnson-Lindenstrauss
Lemma

Let

\mathbf{x}_1,\ldots,\mathbf{x}_n\in\mathbb{R}^{d}

. Fix

\varepsilon\in(0,1)

\delta>0

, and

k = \Omega(\frac1{\varepsilon^2}\log(\frac n\delta))

. Let

\boldsymbol{\Pi}\in\mathbb{R}^{k \times d}

be a matrix of iid

\mathcal{N}(0,\frac1k)

entries. Then, with probability

1-\delta

, for all pairs

i,j

we have

(1-\varepsilon) \|\mathbf{x}_i - \mathbf{x}_j\|_2 \leq \|\boldsymbol{\Pi}\mathbf{x}_i - \boldsymbol{\Pi}\mathbf{x}_j\|_2 \leq (1+\varepsilon) \|\mathbf{x}_i - \mathbf{x}_j\|_2

Proof. For all pairs $i,j$ , consider $\mathbf{v}_{i,j} \;{\vcentcolon=}\; \mathbf{x}_i - \mathbf{x}_j$ , and union bound Lemma 1 to find a matrix $\boldsymbol{G}$ which preserves the norms of all $\mathbf{v}_{i,j}$ . This involves union bounding over $\binom{n}{2} = O(n^2)$ vectors, which causes $\log(\frac n\delta)$ to appear in the requirement on $k$ . Since $\boldsymbol{\Pi} = \frac1{\sqrt k} \boldsymbol{G}$ , we have

\sqrt{1-\varepsilon} \|\mathbf{x}_i - \mathbf{x}_j\|_2 \leq \|\boldsymbol{\Pi}\mathbf{x}_i - \boldsymbol{\Pi}\mathbf{x}_j\|_2 \leq \sqrt{1+\varepsilon} \|\mathbf{x}_i - \mathbf{x}_j\|_2

We then note that $\sqrt{1-\varepsilon} > 1-\varepsilon$ and $\sqrt{1+\varepsilon} < 1+\varepsilon$ (see this on Desmos), which completes the proof.

\blacksquare \, \,

Bibliography

Ailon and Chazelle. The fast Johnson–Lindenstrauss transform and approximate nearest neighbors. SICOMP 2009.
Johnson and Lindenstrauss. Extensions of Lipschitz mappings into a Hilbert space. Contemporary Mathematics 1984.
Kane and Nelson. Sparser Johnson-Lindenstrauss Transforms. JACM 2014.
Larsen and Nelson. Optimality of the Johnson-Lindenstrauss Lemma. FOCS 2017.
Musco. Lecture 10: Dimensionality Reduction and the Johnson-Lindenstrauss Lemma. Lecture Notes, 2018.
Nelson and Nguyễn. Sparsity Lower Bounds for Dimensionality Reducing Maps. STOC 2013.
Wainwright. Draft of High-dimensional statistics: A Non-Asymptotic Viewpoint, Chapter 2. Draft of publication at Cambridge University Press, 2015.

The Johnson-Lindenstrauss Lemma

Lemma 1: Gaussians Preserve Norms Lemma

Lemma 2: Johnson-Lindenstrauss Lemma

See Also

Bibliography

Lemma 1: Gaussians Preserve Norms
Lemma

Lemma 2: Johnson-Lindenstrauss
Lemma