primer

We are interested in 3 key properties of functions:

1. Lipschitzness

Defn: L-Lipschitz

A function F:RpR is said to be L-Lipschitz if x,yRp$$|F(x) - F(y)| \leq L \Vert x - y \Vert$$

2. Convexity

Defn: Convexity

A function F:RpR is said to be convex if x,yRp and θ[0,1] $$F(\theta x + (1-x)y) \leq \theta F(x) + (1-\theta)F(y)$$A function is said to be strictly convex if the above inequality holds in the strict sense.

Defn: Convexity

An at least once differentiable function F:RpR is convex if x,yRp$$F(x) + \nabla F(x)^{\top}(y-x) \leq F(y)$$

Defn: α-Strong Convexity

For some α>0, a function F:RpR is said to be α-strongly convex if we have that F(x)α2x2 is a convex function

Defn: α-Strong Convexity

An at least once differentiable function F:RpR is said to be α-strongly convex if for some α>0,(x,y)Rp we have$$F(y) \geq F(x) + \nabla F(x)^{\top}(y-x) + \frac{\alpha}{2} \cdot \Vert y - x \Vert^2$$

3. Smoothness

Defn: Lipschitz-Smoothness (β-Smooth or β- Lipschitz Smooth or β-Gradient Smooth)

For some β>0 and an at least once differentiable function F:RpR its gradients should be β-Lipschitz i.e. x,y$$\Vert \nabla F(x) - \nabla F(y) \Vert \leq \beta \Vert x - y \Vert$$