Solving Mixed-Integer Optimization Problems using

Benders Decomposition

Shuvomoy Das Gupta

September 17, 2024

This is a blog on how to solve mixed-integer optimization problems us-

ing Benders decomposition algorithm, due to Jacques F. Benders. The

formulation presented here closely follows [Garﬁnkel and Nemhauser,

1972, Section 4.8].

Figure 1: Jacques F. Benders

(1925-2017).

Contents

Things to recall. 1

Mixed-integer optimization problem to solve. 2

Problem (P) for ﬁxed x

′

∈ Z

. 2

Dual of the minimization problem associated with p

⋆

sub

′

). 3

Relationship between (P), (P(x

′

)) and (D(x

′

)). 4

Representation of (P) via extreme points of S. 5

Setup for the decomposition algorithm. 6

Decomposition algorithm. 6

Beneﬁts of the algorithm. 7

Things to recall.

We recall the following facts that we will use to develop the algo-

rithm.

Notation.

• The symbol ⪯ or ⪰ corresponds to

element-wise greater than or smaller than

type inequlaity.

• Consider the polyhedron P = {x ∈ R

Ax ⪰ b} where

A =







⊤

. . .

⊤







∈ R

5×4

, b =







. . .







∈ R

Consider a feasible point

x ∈ P, where

⊤

x = b

for i ∈ {1, 3, 5} and a

⊤

x > b

for

i ∈ {2, 4}. At

x, we have

active





⊤





∈ R

3×4

, A

nonactive



⊤



∈ R

2×4

• Fact 1: Strong duality in linear programming problems (LPs).

Strong duality does not hold in LPs if and only if both primal and

dual problems are infeasible.

• Fact 2: Extreme point of a polyhedron. Consider the polyhedron

P = {x | Ax ⪰ b} where A ∈ R

m×n

, b ∈ R

with m ≥ n. A point

x ∈ P is an extreme point of P if rank(A

active

) = n.

• Fact 3: Extreme ray of a recesion cone and a polyhedron. Con-

sider the recesion cone C = {x | Ax ⪰ 0}, where A ∈ R

m×n

b ∈ R

with m ≥ n. A point

r ∈ C is an extreme ray of C if

rank(A

active

) = n − 1. An extreme ray of C is also called the the

extreme ray of the underlying polyhedron {x | Ax ⪰ b}.

solving mixed-integer optimization problems using benders decomposition 2

• Fact 4: Optimal cost of an LP. Consider the LP





minimize

x∈R

⊤

subject to Ax ⪰ b.





where A ∈ R

m×n

, b ∈ R

with the feasible set {x | Ax ⪰ b} having

at least one extreme point. Then,

– [Bertsimas and Tsitsiklis, 1997, Theorem 4.14] The polyhedron

{x | Ax ⪰ b} can have only a ﬁnite number of extreme points

and extreme rays.

– [Bertsimas and Tsitsiklis, 1997, Theorem 4.14] The optimal cost

of this problem is −∞ if and only if some extreme ray r of {x |

Ax ⪰ b} satisﬁes c

⊤

r < 0 . In other words, the optimal cost

of this problem is ﬁnite if and only if for every extreme ray r of

{x | Ax ⪰ b}, we have c

⊤

r ≥ 0.

– [Bertsimas and Tsitsiklis, 1997, Theorem 2.7] If the optimal cost

of this problem is ﬁnite, then there exists an optimal solution

which is an extreme point of {x | Ax ⪰ b}.

Mixed-integer optimization problem to solve.

We want to solve the following mixed-integer optimization problem

(MIP):

⋆







maximize

x∈Z

, v∈R

⊤

x + c

⊤

subject to A

x + A

v ⪯ b,

x ⪰ 0, v ⪰ 0,







(P)

where A

∈ R

m×n

, A

∈ R

m×p

, b ∈ R

, c

∈ R

, c

∈ R

. We will

assume that p

⋆

is ﬁnite and denote the optimal solution by x

⋆

, v

⋆

Note that we are not saying anything about the boundedness of the

feasible set.

Problem (P) for ﬁxed x

′

∈ Z

Let us consider some x

′

∈ Z

, i.e., some ﬁxed integer-valued x

′

⪰ 0,

and deﬁne:

⋆

′

) =







maximize

v∈R

⊤

′

+ c

⊤

subject to A

′

+ A

v ⪯ b,

v ⪰ 0.







= c

⊤

′







maximize

v∈R

⊤

subject to A

v ⪯ b − A

′

v ⪰ 0.







solving mixed-integer optimization problems using benders decomposition 3

= c

⊤

′

−







minimize

v∈R

−c

⊤

subject to A

v ⪯ b − A

′

, ▷ d.v. u ⪰ 0

−v ⪯ 0. ▷ d.v. λ ⪰ 0







{z }

⋆

sub

′

)

= c

⊤

′

− p

⋆

sub

′

). (P(x

′

))

Any x ∈ Z

such that p

⋆

(x) is ﬁnite, is called an admissible solu-

tion. As p

⋆

is ﬁnite, we can write

⋆

= max

x∈Z

x:admissible

⋆

(x). (1)

Clearly the optimal x

⋆

to (P) is an admissible solution.

Dual of the minimization problem associated with p

⋆

sub

′

We will follow the style of [Boyd and Vandenberghe, 2004, Chapter

5]. The Lagrangian of the minimization problem associated with

⋆

sub

′

) is:

L(v, u, λ) = −c

⊤

v + u

⊤



v − (b − A

′

)



− v

⊤



−c

+ A

⊤

u − λ



⊤

v − u

⊤

(b − A

′

with the dual function:

g(u, λ) = inf

L(v, u, λ) =







−u

⊤

(b − A

′

), if − c

+ A

⊤

u − λ = 0,

−∞, else.

As a result, the dual problem of the minimization problem associ-

ated with p

⋆

sub

′

) is:

⋆

sub

′

) =







maximize

u∈R

, λ∈R

−(b − A

′

)

⊤

subject to −c

+ A

⊤

u − λ = 0,

u ⪰ 0, λ ⪰ 0.













maximize

u∈R

, λ∈R

−(b − A

′

)

⊤

subject to −c

+ A

⊤

u = λ ⪰ 0,

u ⪰ 0.













maximize

u∈R

−(b − A

′

)

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.







Now, due to weak duality, we always have d

⋆

sub

′

) ≤ p

⋆

sub

′

). For

solving mixed-integer optimization problems using benders decomposition 4

notational convenience, deﬁne:

⋆

′

) = c

⊤

′







minimize

u∈R

(b − A

′

)

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.







(D(x

′

))

So, the relationship between (P(x

′

)) and (D(x

′

)) is as follows.

⋆

′

) = c

⊤

′

− p

⋆

sub

′

)

≤ c

⊤

′

− d

⋆

sub

′

)

= c

⊤

′

−







maximize

u∈R

−(b − A

′

)

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.







= c

⊤

′







minimize

u∈R

(b − A

′

)

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.







= d

⋆

′

) (2)

Relationship between (P), (P(x

′

)) and (D(x

′

)).

We can have two possible relationships

between (P), (P(x

′

)) and

Note that, the case that (D(x

′

)) does not

have a feasible solution for some x

′

∈ Z

cannot happen, which we can see as follows.

Suppose (D(x

′

)) does not have a feasible

solution some x

′

∈ Z

, i.e.,

∃

′

∈Z

⋆

′

) = ∞.

This means that the feasible set of

(D(x

′

)), which is the polyhedron

S ≜



u | A

⊤

u ⪰ c

, u ⪰ 0



= ∅ (which

does not contain x

′

), is empty. This also

means that, d

⋆

′

) = ∞ for some x

′

∈ Z

also

implies d

⋆

(

x) = ∞ for every

x, be it real or

integer valued.

Recalling Fact 1, in Case 1, for every

(P (

x)) will be either infeasible, i.e., p

⋆

(

x) =

∞ or unbounded, i.e., p

⋆

(

x) = −∞ [Bertsimas

and Tsitsiklis, 1997, Table 4.2, Page 151]. In

other words, (P ) itself will be infeasible or

unbounded. This is a contradiction to our

assumption that p

⋆

is ﬁnite for some (x

⋆

, v

⋆

(D(x

′

)).

• Case 1. (D(x

′

)) is unbounded for some x

′

∈ Z

. If (D(x

′

)) is

unbounded for some x

′

∈ Z

, i.e.,

∃

′

∈Z

⋆

′

) = −∞,

then strong duality will hold due to Fact 1, and we will have

∃

′

∈Z

⋆

′

) = −∞,

which means that (P(x

′

)) will be infeasible for that x

′

∈ Z

. This

can happen.

• Case 2. (D(x

′

)) has a ﬁnite optimal solution for some x

′

∈ Z

. If

(D(x

′

)) has a ﬁnite optimal solution for some x

′

∈ Z

, then strong

duality will hold for that x

′

So, combining both cases above, we have for every x

′

∈ Z

⋆

≥ d

⋆

′

) = p

⋆

′

), (3)

where the inequality follows from (1).

solving mixed-integer optimization problems using benders decomposition 5

Representation of (P) via extreme points of S.

Let us denote the extreme points and extreme rays of the polyhedron

S by E and R, respectively.

Using Fact 4, from 1, we can construct the following integer linear

program, which we then will use to construct the decomposition

algorithm:

⋆







maximize

x∈Z

x:admissible

⋆

(x)













maximize

x∈Z

x:admissible











⊤

x +







minimize

u∈R

(b − A

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.





























maximize

x∈Z

∀

r∈R

(b−A

⊤

r≥0











⊤

x +







minimize

u∈R

(b − A

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.























▷ because x being admissible is the

same as ∀

r∈R

(b − A

⊤

r ≥ 0 due to

Fact 4.







maximize

x∈Z

∀

r∈R

(b−A

⊤

r≥0











⊤

x +







minimize

u∈R

(b − A

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0.



























maximize

x∈Z

∀

r∈R

(b−A

⊤

r≥0



⊤

x + min

u∈E



(b − A

⊤









▷ because we are enforcing the admis-

sibility, an optimal solution to the inner

LP can be found just by searching over

E (due to Fact 4).







maximize

⊤

x + min

u∈E



(b − A

⊤



subject to ∀

r∈R

(b − A

⊤

r ≥ 0,

x ∈ Z













maximize

x,t

subject to t ≤ min

u∈E



⊤

x + (b − A

⊤



∀

r∈R

(b − A

⊤

r ≥ 0,

x ∈ Z













maximize

x,t

subject to ∀

u∈E

t ≤ c

⊤

x + (b − A

⊤

∀

r∈R

(b − A

⊤

r ≥ 0,

x ∈ Z







enum

)

One may think that now we can ﬁnd the optimal solution via

enumeration in (P

enum

). Of course, doing that is impractical, as we

need to know all the extreme points and extreme rays of S, which can

be an astronomically large number. However, we can use (P

enum

) to

construct a decomposition algorithm as follows.

solving mixed-integer optimization problems using benders decomposition 6

Setup for the decomposition algorithm.

• Notation. At k-th iteration (k ∈ {1, 2, 3, . . .}) of the decomposition

algorithm, the subset of E is denoted by E(k) and the subset R is

denoted by R(k).

• Master problem. At iteration k, denote the master problem by

⋆

(

E(k), R(k)

)







maximize

x,t

subject to ∀

u∈E(k)

t ≤ c

⊤

x + (b − A

⊤

∀

r∈R(k)

(b − A

⊤

r ≥ 0,

x ∈ Z







(M(k))

with its solution denoted by x

(k)

, t

(k)

. As (M(k)) is a relaxation of

enum

), we have

⋆

≤ p

⋆

(

E(k), R(k)

)

. (4)

• Subproblem. Denote the subproblem by

⋆

(k)

) = c

⊤

(k)







minimize

u∈R

(b − A

(k)

)

⊤

subject to A

⊤

u ⪰ c

u ⪰ 0,







(S(k))

with its solution denoted by u

(k)

Decomposition algorithm.

• Step 1: Initialization. Here k = 1. If the E(1) or R(1) is nonempty,

then go to step 2. If E(1) = R(1) = ∅, let d

⋆

(k) be arbitrarily large

and x

(1)

be any non-negative integer vector and go to Step 3.

• Step 2: Solve the master problem. Solve (M(k)) with the current

value of E(k) and R(k) . There are two possibilities

Note that p

⋆

(k) = −∞ cannot happen. If

⋆

(k) = −∞, then (M(k)) is infeasible, and

because (M(k)) is a relaxation of (P

enum

), we

must have (P

enum

) infeasible, which violates

our assumption that p

⋆

is ﬁnite.

– If p

⋆

(

E(k), R(k)

)

= ∞, then then (M(k)) is not bounded above.

Set t

(k)

to be an arbitrarily large number and compute an x

(k)

that is a feasible solution to (M(k)) for the ﬁxed value of t

(k)

Proceed to step 3. (Note that this case would not happen, if

x ∈ {0, 1}

– If p

⋆

(

E(k), R(k)

)

is ﬁnite, then we collect x

(k)

, t

(k)

and proceed

to step 3.

Note that for both the cases, we must have

⋆

≤ p

⋆

(

E(k), R(k)

)

≤ p

⋆

(

E(k − 1), R(k − 1)

)

as M(k − 1) is a relaxation of M(k) and M(k) is a relaxation of (P)

(or (P

enum

)).

solving mixed-integer optimization problems using benders decomposition 7

• Step 3: Solve the subproblem. Solve (S(k)) for the current value

of x

(k)

, t

(k)

. There are two possibilities again:

– If d

⋆

(k)

) = −∞, then we have found one extreme point u

(k)

∈

E and one extreme ray r

(k)

∈ R such that d

⋆

(k)

) can decrease

as much as we vary θ ≥ 0 in u

(k)

+ θr

(k)

. We set E(k + 1)

E(k) ∪ {u

(k)

} and R(k + 1)

= R(k) ∪ {r

(k)

}, i.e., we add the

following constraints lazily to (M(k) ):

t ≤ c

⊤

x + (b − A

⊤

(k)

(b − A

′

)

⊤

(k)

≥ 0,

Then we set k

= k + 1 and go to Step 2.

– If d

⋆

(k)

) ﬁnite, then we have found one optimal solution

(k)

∈ E, i.e. u

(k)

is an extreme point of S. Using (3) and (4)

we have:

⋆

(k)

) ≤ p

⋆

≤ p

⋆

(

E(k), R(k)

)

. (5)

In this d

⋆

(k)

) ﬁnite case, one of two things can happen:

If d

⋆

(k)

) =

(

E(k), R(k)

)

, then from (5) we must have, p

⋆

(k)

) =

(

E(k), R(k)

)

. So we have found the optimal value

and x

⋆

= x

(k)

is an optimal solution to (P). By solving



P(x

(k)

)



, we can compute an optimal v

⋆

= v

(k)

to (P).

If d

⋆

(k)

) <

(

E(k), R(k)

)

, then we E(k + 1)

= E(k) ∪ {u

(k)

i.e, we add the following constraints lazily to (M(k))

t ≤ c

⊤

x + (b − A

⊤

(k)

Then we set k

= k + 1 and proceed to Step 2.

In worst case, the number of iterations of this algorithm is bounded

by |E| + |R|.

Beneﬁts of the algorithm.

When S(k) has an optimal solution with ﬁnite objective value, after

step 3, we have a feasible solution to (P) along with a measure of

suboptimality p

⋆

(

E(k), R(k)

)

− d

⋆

(k)

) from (5). So, in practice, even

if we terminate early, we can have a good enough feasible solution

with small suboptimality gap.

References

Dimitris Bertsimas and John N Tsitsiklis. Introduction to linear opti-

mization, volume 6. Athena Scientiﬁc Belmont, MA, 1997.

solving mixed-integer optimization problems using benders decomposition 8

Stephen Boyd and Lieven Vandenberghe. Convex optimization. Cam-

bridge university press, 2004.

R. Garﬁnkel and G. Nemhauser. Integer Programming. John Wiley and

Sons, New York, 1972.