Convexity

woozins·2024년 7월 30일
0

Convex Sets.

  • Convex set

C is Convex set if
x,xCx, x' \in C then (1θ)x+θxC(1-\theta)x + \theta x' \in C for any 0θ10 \le \theta \le 1

Operations on Convex Sets.

  • Intersection :

If A and B are convex sets, then ABA \cap B is a convex set

Example 1 the set of positive-definite matrices Sn+S^+_n is convex because..

Hy={XyXy>0}H_y = \{X|y'Xy > 0\} is convex for all yy.
Sn+=yHyS_n^+= \cap_{y}H_y Thus Sn+S_n^+ is convex.

  • Affine transformation

If SRnS \in R^n is convex then f(S)={yy=Ax+b,xS}f(S) = \{y|y = Ax + b, x \in S \} is convex.
Similarly, f1(S)f^{-1}(S) is convex if f(x)=Ax+bf(x) = Ax+b is an affine map.

  • Projection

If S is convex then each projection π(S)={y(x,y)S}\pi(S) = \{y|(x,y) \in S \} is convex.

  • Separating Hyperplanes

If C and D are convex, and disjoint then there exists a separating hyperplane between them:

aTxba^Tx \ge b for all xCx \in C
aTxba^Tx \le b for all xDx \in D

  • Supporting Hyperplanes

Suppose xx is an element of the convex set C. and x0x_0 is in the boundary of C.

There is a Hyperplane H s.t. H={xaTx=aTx0}H = \{x | a^Tx = a^Tx_0\} and aTxaTx0a^Tx \ge a^Tx_0 for all xCx \in C

Convex Functions.

  • Some terminology
  • proper function : A function f=SRm[,]f = S \subseteq R^m \to [-\infty, \infty] is said to be proper if there is no xSx \in S with f(x)=f(x) = -\infty and there is some xx with f(x)f(x) \neq \infty.

  • Effective domain domfdom f : the set of points where ff is finite. i.e, domf={xSf(x)<}dom f = \{x \in S|f(x) < \infty \}

  • A proper convex function ff is closedclosed if it is lower semi-continuous; that is, in case for each α\alpha the set {x:f(x)>α}\{x:f(x) > \alpha\} is open.
  • 동치명제

    A function ff is convex if and only if the epigraphepigraph
    epif={(x,t)Rn+1f(x)t}epi f =\{(x,t) \in R^{n+1} | f(x) \leq t \}
    is a convex set.

  • How to show a function is convex?
    0차, 1차, 2차 미분의 관점에서 각각 보일 수 있다.

Zeroth-order characterization of convexity
:A function f:RnRf : R^n \to R is convex if and only if g(t)=f(x+tv)g(t) = f(x + tv) is convex for each xdomfx \in dom f and vRnv \in R^n

특징 : x에 대한 convexity를 t에 대한 convexity로 바꿔버린다.

Example : Log determinant
let f(A)=logdet(A)f(A) = log det(A) then let g(t)=f(A+tV)g(t) = f(A + tV).
g(t)=logdet(A+tV)=logdet(A)+logdet(I+tA1/2VA1/2)g(t) = log det(A + tV) = log det(A) + log det(I + tA^{-1/2}VA^{-1/2})
logdet(A)+ilog(1+tλi)log det(A) + \sum_{i}log(1 + t\lambda_i), where λi\lambda_i is ith eigenvalue of A1/2VA1/2A^{-1/2}VA^{-1/2}.
Therefore, f(A)f(A) is concave.

First-order characterization of convexity
: A differentiable function f:RnRf : R^n \to R is convex if and only if f(y)f(x)+f(x)T(yx)f(y) \ge f(x) + \bigtriangledown f(x)^T(y-x) is convex for every x,ydomfx,y \in dom f

특징 : 접선(평면) 성질에 의해 자명.

Second-order characterization of convexity
: Hessian 행렬의 non-posivite-definie 성질확인.

Example : log-sum-exp
f(x)=logiexp  xif(x) = log\sum_{i} exp \; x_i is convex.

Functions of Convex Functions

다음은 함수의 convexity를 보존하는 연산들이다.

  • Nonnegative weighted sum - trivial

  • Composition with an affine function
    - If f:RnRf : R^n \to R is convex and g : RmRnR^m \to R^n is affine, givne by g(x)=Ax+bg(x) = Ax + b, then the composition (f  o  g)(x)=f(Ax+b)(f\;o\; g)(x) = f(Ax+b) is convex.

  • pointwise maximum and supremum
    The maximum of convex functions is convex. Can be shown with the notion of epigraph. similary,
    g(x)=supyAf(x,y)g(x) = sup_{y \in A}f(x,y) is convex if ff is convex in xx for yAy \in A
    - Example : Largest eigenvalue
    For XSnX \in S^n a symmetric matrix.
    λmax(X)=supyTy=1yTXy\lambda_{max}(X) = sup_{y^Ty = 1}y^TXy
    X is linear for each y s.t. yty=1y^ty = 1 thus convex. Therefore, λmax(X)\lambda_{max}(X) is convex.

  • Composition
    - f  o  gf\;o\;g is convex if g is concave and f is convex and nondecreasing

    • f  o  gf\;o\;g is concave if g is concave and f is concave and nondecreasing
    • f  o  gf\;o\;g is concave if g is convex and f is concave and nonincreasing

Log-Convexity

: f(x)θf(y)1θf(θx+(1θ)y)f(x)^\theta f(y)^{1-\theta} \ge f(\theta x + (1-\theta)y)

Important properties

  • The product of log-concave functions is log-concave (trivial)

  • If f:Rn+mRf:R^{n+m} \to R is log-concave then the marginal
    g(x)=Rmf(x,y)dyg(x) = \int_{R^m} f(x,y) dy is log-concave.

    proof)proof) g(θx1+(1θ)x2)=Rmf(θx1+(1θ)x2,y)dyg(\theta x_1 + (1-\theta)x_2) = \int_{R^m}f(\theta x_1 + (1-\theta)x_2, y)dy
    Rmf(x1,y)θf(x2,y)1θdy\ge \int_{R_m}f(x_1,y)^\theta f(x_2,y)^{1-\theta}dy
    (Rmf(x1,y)dy)θ+(Rmf(x2,y)dy)1θ\ge (\int_{R_m}f(x_1,y)dy)^\theta + (\int_{R^m}f(x_2, y)dy)^{1-\theta} By Jensen's ineq

  • The convolution
    (fg)(x)=f(xy)g(y)dy(f \star g)(x) = \int f(x-y)g(y)dy of log-concave functions is log-concave.

Conjugate function

The conjugate function ff^* of a proper function ff is given by

f(y)=supxdomf{<x,y>f(x)}f^*(y) = sup_{x \in dom f}\{<x,y> - f(x)\}. The conjugate is always a convex function : trivial

  • If ff is differentiable, then y=f(x)y = \bigtriangledown f(x^*)

  • If f is strictly convex, we can write f1(y)=x\bigtriangledown f^{-1}(y) = x^*, so f(y)=<f1(y),yf(f1(y))f^*(y) = <\bigtriangledown f^{-1}(y), y - f(\bigtriangledown f^{-1}(y))

  • Fenchel's inequality
    From definition, f(x)+f(x)<x,y>f(x) + f(x^*) \ge <x,y>

  • Fenchel's duality

    Suppose that f:RnR{}f : R^n \to R \cup \{\infty\} and g:RmR{}g : R^m \to R \cup \{\infty\} are closed proper convex functions and ARmnA \in R^{mn} is an m x n matrix. Then

    infx{f(x)+g(Ax)}=supλ{f(ATλ)g(λ)}inf_{x} \{f(x) + g(Ax) \} = sup_{\lambda}\{-f^\star(A^T\lambda)-g^\star(-\lambda)\}

    proof : by Fenchel's inequality

    Question:

  • 127p-separating / supporting plane pf

  • epigraph convexity pf

  • 129p closed convex function??

  • 130p zeroth-order pf

  • 133p 134p log convexity properties : convolution 의미?

profile
통계학과 대학원생입니다.

0개의 댓글