KKT Operator and KKT conditions

Shuvomoy Das Gupta

September 27, 2024

This is a short blog to show the connection between the zeros of the

KKT (Karush–Kuhn–Tucker) operator and the KKT conditions.

Contents

Optimization problem in consideration. 1

KKT conditions. 1

KKT operator. 2

Zeros of the KKT operator are the KKT points. 2

Optimization problem in consideration.

Consider the minimization problem

Notation and notions.

• R

= {x ∈ R

| x ⪰ 0} = {x ∈ R

∀

i∈{1,2,...,n}

≥ 0}.

• Indicator function. Indicator function of

a set S is deﬁned by:

(x) =

(

0, if x ∈ S ,

∞, else.

• Normal cone. The subdifferential set of

is called the normal cone of S, and is

denoted by N

= ∂δ

(x). It can be shown

easily that,

(x) =

(

{u | sup

z∈S

⟨

u | z − x

⟩

≤ 0}, if x ∈ S

∅, if x /∈ S .

Furthermore, N

(x) = {0} for any

x ∈ interior(S), so the normal cone takes

interesting values only at the boundary of

the set.

• Set addition. If A, B are two sets in R

then A + B = {x + y | x ∈ A, y ∈ B}.

Also, A + ∅ = ∅.







minimize

x∈R

(x)

subject to f

(x) ≤ 0, i = 1, 2, . . . , m, ▷dual variable λ

≥ 0

(x) = 0, i = 1, 2, . . . , p, ▷dual variable ν : free







(P)

where w f

, f

, . . . , f

, h

, . . . , h

: R

→ R ∪ {∞} are proper, closed,

and differentiable functions. We assume that the problem is feasible

and has a ﬁnite optimal solution.

KKT conditions.

For problem (P), the Lagrangian function is deﬁned as,

L(x, λ, ν) = f

(x) +

∑

i=1

(x) +

∑

i=1

(x),

where λ ⪰ 0. The points ( x, λ, ν) are called KKT points if they satisfy

the following four conditions:

1. Primal feasibility. f

(x) ≤ 0, for all i = 1, 2, . . . , m, and h

(x) = 0, for

all i = 1, 2, . . . , p.

2. Dual feasibility. λ

≥ 0 for all i = 1, 2, . . . , m.

3. Vanishing gradient of Lagrangian at primal variable. ∇

(x) +

∑

i=1

∇

(x) +

∑

i=1

∇

(x) = 0.

4. Complementary slackness. λ

(x) = 0 for all i = 1, 2, . . . , m.

The four conditions above are called the KKT conditions.

kkt operator and kkt conditions 2

KKT operator.

Calculation for (a).

Note that

∂



ext

(x, λ, ν)



=∂

(x) +

∑

i=1

(x) +

∑

i=1

(x) − δ

(λ)

=∇

(x) +

∑

i=1

∇

(x) +

∑

i=1

∇

(x) −

z }| {

∂

(λ)

=∇

(x) +

∑

i=1

∇

(x) +

∑

i=1

∇

(x),

and

∂

(λ,ν)



−L

ext

(x, λ, ν)







∂



− f

(x) − λ

⊤

F(x) − ν

⊤

H(x) + δ

(λ)



∂



− f

(x) − λ

⊤

F(x) − ν

⊤

H(x) + δ

(λ)













−

z }| {

∇

(x) −

F(x)

z }| {

∇

⊤

F(x) −

z }| {

∇

⊤

H(x) +

(λ)

z }| {

∂

(λ)

− ∇

(x)

| {z }

− ∇

⊤

F(x)

| {z }

− ∇

⊤

H(x)

| {z }

H(x)

+ ∂

(λ)

| {z }







using ∂( f

(x) + f

(x)) = ∂ f

(x) + ∇ f

(x),

where f

subdifferentiable and f

differentiable



−F(x) + N

(λ)

−H(x)



The KKT operator of (P) is deﬁned based on the extended La-

grangian function. The extended Lagrangian function is deﬁned as:

ext

(x, λ, ν) = f

(x) +

∑

i=1

(x) +

∑

i=1

(x) − δ

(λ)

= f

(x) + λ

⊤

F(x) + ν

⊤

H(x) − δ

(λ),

where in the second line use the notation:

F(x) =







(x)







, and H(x) =







(x)







The KKT operator associated with L

ext

(x, λ, ν) is deﬁned as:

T(x, λ, ν) =

∂

(

ext

(x, λ, ν)

)

∂

(λ,ν)

(

−L

ext

(x, λ, ν)

)

(a)







∇

(x) +

∑

i=1

∇

(x) +

∑

i=1

∇

(x)

−F(x) + N

(λ)

−H(x)







, (1)

where the calculation of (a) is shown in the sidenote.

Zeros of the KKT operator are the KKT points.

Normal cone of R

The normal cone of R

is given by:

(y) =

(

∅, if y /∈ R

{

u ∈ R

| u ⪯ 0,

⟨

u | y

⟩

= 0

}

if y ∈ R

(2)

In other words, if y ∈ R

then u ∈ N

(y) is

deﬁned by:

(

= 0, if y

> 0

≤ 0, if y

≥ 0

means for those y

= 0, we set the associated

≤ 0 and for those y

> 0 we set the

associated u

= 0.

Explanation ( b).

The inclusion 0 ∈ −F(x) + N

(λ) is

the same as saying that there is some u ∈

(λ) such that −F(x) + N

(λ) = 0.

We cannot have N

(λ) = ∅ as otherwise

−F(x) + N

(λ) = ∅, but we know that it

at least contains 0. Hence, we must also have

λ ⪰ 0.

We now claim that if 0 ∈ T(x, λ, ν) if and only if (x, λ, ν) are the

KKT points satisfying the KKT conditions. Clearly, the ﬁrst row of

T(x, λ, ν) being zero is the same as the third KKT condition: vanishing

gradient of Lagrangian at primal variable. The third row of T(x, λ, ν)

being zero is the same as second part of the ﬁrst KKT conditions:

primal feasibility of the equality constraints. Now, we show that the

second row of T(x, λ, ν) being zero is equivalent to the rest of the

KKT conditions as follows.

We start with

− F(x) + N

(λ) ∋ 0

see

Explanation (b)

⇐⇒







∃

u∈N

(λ)

− F(x) + u = 0,

λ ⪰ 0.

⇔











− f

(x) + u

= 0, i ∈ {1, 2, . . . , m},

u ∈ N

(λ),

λ ⪰ 0.

kkt operator and kkt conditions 3

⇔











(x) = u

, i ∈ {1, 2, . . . , m},

u ⪯ 0,

⟨

u | λ

⟩

= 0, /

using (2)

λ ⪰ 0.

⇔











(x) = u

i ∈ {1, 2, . . . , m},

≤ 0, λ

≥ 0, i ∈ {1, 2, . . . , m},

⟨

u | λ

⟩

∑

i=1

(x) = 0,

⇔











(x) = u

≤ 0 i ∈ {1, 2, . . . , m},

≥ 0, i ∈ {1, 2, . . . , m},

∑

i=1

(x) = 0.

⇔











(x) ≤ 0 i ∈ {1, 2, . . . , m},

≥ 0, i ∈ {1, 2, . . . , m},

(x) = 0, i ∈ {1, 2, . . . , m}, /

addition of nonpositive summands is zero

if and only if each summand is zero.

where in the last line, the ﬁrst inequality is primal feasibility of the

inequality constraints, the second inequality is dual feasibility, and

the last line is complementary slackness of the KKT conditions.