Convex conjugate

Inmathematicsandmathematical optimization,theconvex conjugateof a function is a generalization of theLegendre transformationwhich applies to non-convex functions. It is also known asLegendre–Fenchel transformation,Fenchel transformation,orFenchel conjugate(afterAdrien-Marie LegendreandWerner Fenchel). The convex conjugate is widely used for constructing thedual probleminoptimization theory,thus generalizingLagrangian duality.

Definition

Let $X$ be areal topological vector spaceand let $X^{*}$ be thedual spaceto $X$ .Denote by

\langle \cdot,\cdot \rangle:X^{*}\times X\to \mathbb {R}

the canonicaldual pairing,which is defined by $\left\langle x^{*},x\right\rangle \mapsto x^{*}(x).$

For a function $f:X\to \mathbb {R} \cup \{-\infty,+\infty \}$ taking values on theextended real number line,itsconvex conjugateis the function

f^{*}:X^{*}\to \mathbb {R} \cup \{-\infty,+\infty \}

whose value at $x^{*}\in X^{*}$ is defined to be thesupremum:

f^{*}\left(x^{*}\right):=\sup \left\{\left\langle x^{*},x\right\rangle -f(x)~\colon ~x\in X\right\},

or, equivalently, in terms of theinfimum:

f^{*}\left(x^{*}\right):=-\inf \left\{f(x)-\left\langle x^{*},x\right\rangle ~\colon ~x\in X\right\}.

This definition can be interpreted as an encoding of theconvex hullof the function'sepigraphin terms of itssupporting hyperplanes.^[1]

Examples

For more examples, see§ Table of selected convex conjugates.

The convex conjugate of anaffine function $f(x)=\left\langle a,x\right\rangle -b$ is $f^{*}\left(x^{*}\right)={\begin{cases}b,&x^{*}=a\\+\infty,&x^{*}\neq a.\end{cases}}$
The convex conjugate of apower function $f(x)={\frac {1}{p}}|x|^{p},1<p<\infty$ is $f^{*}\left(x^{*}\right)={\frac {1}{q}}|x^{*}|^{q},1<q<\infty,{\text{where}}{\tfrac {1}{p}}+{\tfrac {1}{q}}=1.$
The convex conjugate of theabsolute valuefunction $f(x)=\left|x\right|$ is $f^{*}\left(x^{*}\right)={\begin{cases}0,&\left|x^{*}\right|\leq 1\\\infty,&\left|x^{*}\right|>1.\end{cases}}$
The convex conjugate of theexponential function $f(x)=e^{x}$ is $f^{*}\left(x^{*}\right)={\begin{cases}x^{*}\ln x^{*}-x^{*},&x^{*}>0\\0,&x^{*}=0\\\infty,&x^{*}<0.\end{cases}}$

The convex conjugate and Legendre transform of the exponential function agree except that thedomainof the convex conjugate is strictly larger as the Legendre transform is only defined for positive real numbers.

Connection with expected shortfall (average value at risk)

Seethis article for example.

LetFdenote acumulative distribution functionof arandom variableX.Then (integrating by parts), $f(x):=\int _{-\infty }^{x}F(u)\,du=\operatorname {E} \left[\max(0,x-X)\right]=x-\operatorname {E} \left[\min(x,X)\right]$ has the convex conjugate $f^{*}(p)=\int _{0}^{p}F^{-1}(q)\,dq=(p-1)F^{-1}(p)+\operatorname {E} \left[\min(F^{-1}(p),X)\right]=pF^{-1}(p)-\operatorname {E} \left[\max(0,F^{-1}(p)-X)\right].$

Ordering

A particular interpretation has the transform $f^{\text{inc}}(x):=\arg \sup _{t}t\cdot x-\int _{0}^{1}\max\{t-f(u),0\}\,du,$ as this is a nondecreasing rearrangement of the initial functionf;in particular, $f^{\text{inc}}=f$ forfnondecreasing.

Properties

The convex conjugate of aclosed convex functionis again a closed convex function. The convex conjugate of apolyhedral convex function(a convex function withpolyhedral epigraph) is again a polyhedral convex function.

Order reversing

Declare that $f\leq g$ if and only if $f(x)\leq g(x)$ for all $x.$ Then convex-conjugation isorder-reversing,which by definition means that if $f\leq g$ then $f^{*}\geq g^{*}.$

For a family of functions $\left(f_{\ Alpha }\right)_{\ Alpha }$ it follows from the fact that supremums may be interchanged that

\left(\inf _{\ Alpha }f_{\ Alpha }\right)^{*}(x^{*})=\sup _{\ Alpha }f_{\ Alpha }^{*}(x^{*}),

and from themax–min inequalitythat

\left(\sup _{\ Alpha }f_{\ Alpha }\right)^{*}(x^{*})\leq \inf _{\ Alpha }f_{\ Alpha }^{*}(x^{*}).

Biconjugate

The convex conjugate of a function is alwayslower semi-continuous.Thebiconjugate $f^{**}$ (the convex conjugate of the convex conjugate) is also theclosed convex hull,i.e. the largestlower semi-continuousconvex function with $f^{**}\leq f.$ Forproper functions $f,$

f=f^{**}

if and only if

f

is convex and lower semi-continuous, by theFenchel–Moreau theorem.

Fenchel's inequality

For any function $f$ and its convex conjugate $f *$ ,Fenchel's inequality(also known as theFenchel–Young inequality) holds for every $x\in X$ and $p\in X^{*}$ :

\left\langle p,x\right\rangle \leq f(x)+f^{*}(p).

Furthermore, the equality holds only when $p\in \partial f(x)$ . The proof follows from the definition of convex conjugate: $f^{*}(p)=\sup _{\tilde {x}}\left\{\langle p,{\tilde {x}}\rangle -f({\tilde {x}})\right\}\geq \langle p,x\rangle -f(x).$

Convexity

For two functions $f_{0}$ and $f_{1}$ and a number $0\leq \lambda \leq 1$ the convexity relation

\left((1-\lambda )f_{0}+\lambda f_{1}\right)^{*}\leq (1-\lambda )f_{0}^{*}+\lambda f_{1}^{*}

holds. The ${*}$ operation is a convex mapping itself.

Infimal convolution

Theinfimal convolution(or epi-sum) of two functions $f$ and $g$ is defined as

\left(f\operatorname {\Box } g\right)(x)=\inf \left\{f(x-y)+g(y)\mid y\in \mathbb {R} ^{n}\right\}.

Let $f_{1},\ldots,f_{m}$ beproper,convex andlower semicontinuousfunctions on $\mathbb {R} ^{n}.$ Then the infimal convolution is convex and lower semicontinuous (but not necessarily proper),^[2]and satisfies

\left(f_{1}\operatorname {\Box } \cdots \operatorname {\Box } f_{m}\right)^{*}=f_{1}^{*}+\cdots +f_{m}^{*}.

The infimal convolution of two functions has a geometric interpretation: The (strict)epigraphof the infimal convolution of two functions is theMinkowski sumof the (strict) epigraphs of those functions.^[3]

Maximizing argument

If the function $f$ is differentiable, then its derivative is the maximizing argument in the computation of the convex conjugate:

f^{\prime }(x)=x^{*}(x):=\arg \sup _{x^{*}}{\langle x,x^{*}\rangle }-f^{*}\left(x^{*}\right)

and

f^{{*}\prime }\left(x^{*}\right)=x\left(x^{*}\right):=\arg \sup _{x}{\langle x,x^{*}\rangle }-f(x);

hence

x=\nabla f^{*}\left(\nabla f(x)\right),

x^{*}=\nabla f\left(\nabla f^{*}\left(x^{*}\right)\right),

and moreover

f^{\prime \prime }(x)\cdot f^{{*}\prime \prime }\left(x^{*}(x)\right)=1,

f^{{*}\prime \prime }\left(x^{*}\right)\cdot f^{\prime \prime }\left(x(x^{*})\right)=1.

Scaling properties

If for some $\gamma >0,$ $g(x)=\ Alpha +\beta x+\gamma \cdot f\left(\lambda x+\delta \right)$ ,then

g^{*}\left(x^{*}\right)=-\ Alpha -\delta {\frac {x^{*}-\beta }{\lambda }}+\gamma \cdot f^{*}\left({\frac {x^{*}-\beta }{\lambda \gamma }}\right).

Behavior under linear transformations

Let $A:X\to Y$ be abounded linear operator.For any convex function $f$ on $X,$

\left(Af\right)^{*}=f^{*}A^{*}

where

(Af)(y)=\inf\{f(x):x\in X,Ax=y\}

is the preimage of $f$ with respect to $A$ and $A^{*}$ is theadjoint operatorof $A.$ ^[4]

A closed convex function $f$ is symmetric with respect to a given set $G$ oforthogonal linear transformations,

f(Ax)=f(x)

for all

x

and all

A\in G

if and only if its convex conjugate $f^{*}$ is symmetric with respect to $G.$

Table of selected convex conjugates

The following table provides Legendre transforms for many common functions as well as a few useful properties.^[5]

$g(x)$	$\operatorname {dom} (g)$	$g^{}(x^{})$	$\operatorname {dom} (g^{*})$
$f(ax)$ (where $a\neq 0$ )	$X$	$f^{}\left({\frac {x^{}}{a}}\right)$	$X^{*}$
$f(x+b)$	$X$	$f^{}(x^{})-\langle b,x^{*}\rangle$	$X^{*}$
$af(x)$ (where $a>0$ )	$X$	$af^{}\left({\frac {x^{}}{a}}\right)$	$X^{*}$
$\ Alpha +\beta x+\gamma \cdot f(\lambda x+\delta )$	$X$	$-\ Alpha -\delta {\frac {x^{}-\beta }{\lambda }}+\gamma \cdot f^{}\left({\frac {x^{*}-\beta }{\gamma \lambda }}\right)\quad (\gamma >0)$	$X^{*}$
${\frac {\|x\|^{p}}{p}}$ (where $p>1$ )	$\mathbb {R}$	${\frac {\|x^{*}\|^{q}}{q}}$ (where ${\frac {1}{p}}+{\frac {1}{q}}=1$ )	$\mathbb {R}$
${\frac {-x^{p}}{p}}$ (where $0<p<1$ )	$\mathbb {R} _{+}$	${\frac {-(-x^{*})^{q}}{q}}$ (where ${\frac {1}{p}}+{\frac {1}{q}}=1$ )	$\mathbb {R} _{--}$
${\sqrt {1+x^{2}}}$	$\mathbb {R}$	$-{\sqrt {1-(x^{*})^{2}}}$	$[-1,1]$
$-\log(x)$	$\mathbb {R} _{++}$	$-(1+\log(-x^{*}))$	$\mathbb {R} _{--}$
$e^{x}$	$\mathbb {R}$	${\begin{cases}x^{}\log(x^{})-x^{}&{\text{if }}x^{}>0\\0&{\text{if }}x^{*}=0\end{cases}}$	$\mathbb {R} _{+}$
$\log \left(1+e^{x}\right)$	$\mathbb {R}$	${\begin{cases}x^{}\log(x^{})+(1-x^{})\log(1-x^{})&{\text{if }}0<x^{}<1\\0&{\text{if }}x^{}=0,1\end{cases}}$	$[0,1]$
$-\log \left(1-e^{x}\right)$	$\mathbb {R} _{--}$	${\begin{cases}x^{}\log(x^{})-(1+x^{})\log(1+x^{})&{\text{if }}x^{}>0\\0&{\text{if }}x^{}=0\end{cases}}$	$\mathbb {R} _{+}$

References

^"Legendre Transform".RetrievedApril 14,2019.
^Phelps, Robert(1993).Convex Functions, Monotone Operators and Differentiability(2 ed.). Springer. p.42.ISBN 0-387-56715-1.
^Bauschke, Heinz H.; Goebel, Rafal; Lucet, Yves; Wang, Xianfu (2008). "The Proximal Average: Basic Theory".SIAM Journal on Optimization.19(2): 766.CiteSeerX10.1.1.546.4270.doi:10.1137/070687542.
^Ioffe, A.D. and Tichomirov, V.M. (1979),Theorie der Extremalaufgaben.Deutscher Verlag der Wissenschaften.Satz 3.4.3
^Borwein, Jonathan;Lewis, Adrian (2006).Convex Analysis and Nonlinear Optimization: Theory and Examples(2 ed.). Springer. pp.50–51.ISBN 978-0-387-29570-1.

Arnol'd, Vladimir Igorevich(1989).Mathematical Methods of Classical Mechanics(Second ed.). Springer.ISBN 0-387-96890-3.MR 0997295.
Rockafellar, R. Tyrrell;Wets, Roger J.-B.(26 June 2009).Variational Analysis.Grundlehren der mathematischen Wissenschaften. Vol. 317. Berlin New York:Springer Science & Business Media.ISBN 9783642024313.OCLC 883392544.
Rockafellar, R. Tyrell(1970).Convex Analysis.Princeton: Princeton University Press.ISBN 0-691-01586-4.MR 0274683.

v t e Convex analysisandvariational analysis
Basic concepts	Convex combination Convex function Convex set
Topics (list)	Choquet theory Convex geometry Convex metric space Convex optimization Duality Lagrange multiplier Legendre transformation Locally convex topological vector space Simplex
Maps	Convex conjugate Concave (Closed K- Logarithmically Proper Pseudo- Quasi-)Convex function Invex function Legendre transformation Semi-continuity Subderivative
Mainresults (list)	Carathéodory's theorem Ekeland's variational principle Fenchel–Moreau theorem Fenchel-Young inequality Jensen's inequality Hermite–Hadamard inequality Krein–Milman theorem Mazur's lemma Shapley–Folkman lemma Robinson–Ursescu Simons Ursescu
Sets	Convex hull (Orthogonally,Pseudo-)Convex set Effective domain Epigraph Hypograph John ellipsoid Lens Radial set/Algebraic interior Zonotope
Series	Convex series related((cs, lcs)-closed,(cs, bcs)-complete,(lower) ideally convex,(Hx),and(Hwx))
Duality	Dual system Duality gap Strong duality Weak duality
Applications and related	Convexity in economics