MAX-CUT

The MAX-CUT problem is quite famous [1]. It can be stated as follows:

Given an undirected graph $G=(V,E)$, split the nodes into two sets. Maximize the number of edges that have one node in each set.

Here is an example using a random sparse graph:

MAX CUT, visualization using [2]

Here we colored the two sets of nodes green and blue. We maximize the number of red arcs: they have a green and a blue node. The remaining arcs are grey. We see the optimal solution has a large number of red arcs.

There are some extensions:

The graph can be sparse (as in the figure above): not every pair of nodes has an arc.
We can use weights on the arcs and maximize the weighted sum of red arcs. We can think of the example as using weights $\color{darkblue}w_{i,j}=1$. In this post I assume $\color{darkblue}w_{i,j}\ge 0$.
We can use a directed version: the maximum directed cut.

Max-Cut formulations

To say a node is part of team green or team blue, we just use: \[\color{darkred}x_i = \begin{cases}1 & \text{if node $i$ is a green node}\\ 0 & \text{otherwise}\end{cases}\] One way to maximum as many red arcs as possible is to use a quadratic formulation:

Unconstrained quadratic model
\[\begin{align}\max&\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \left(\color{darkred}x_i-\color{darkred}x_j \right)^2 \\ & \color{darkred}x_i\in\{0,1\} \end{align}\]

Here $A$ is the set of arcs. This is non-convex. However, a solver like Cplex will linearize this model automatically for us. We can see this in the log:

Classifier predicts products in MIQP should be linearized.

I.e. Cplex will solve this as a MIP instead of a non-convex MIQP.

We can force Cplex to solve as a quadratic model using the option qtolin=0. After this Cplex will automatically change the problem a bit to make it convex:

Repairing indefinite Q in the objective.

Instead of using a quadratic objective, we can also use slightly different objectives, such as: \[\max\>\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \left| \color{darkred}x_i-\color{darkred}x_j \right|\] or \[\max\>\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \left(\color{darkred}x_i \>{\bf xor }\> \color{darkred}x_j \right)\] Here $\bf xor$ is the "exclusive or" operation, which can be defined by a truth table:

$x$	$y$	$x\>{\bf xor}\>y$
0	0	0
0	1	1
1	0	1
1	1	0

$z =x\>{\bf xor}\>y$ can also be written as a system of linear inequalities: \[\begin{align}&z \le x+y\\ & z \ge x-y \\ & z \ge y-x \\ & z \le 2-x-y\end{align}\] As we are maximizing $z$, we can drop the $\ge$ conditions. Of course, we can also interpret the two included inequalities directly: \[\begin{align} & z \le x+y: && x=y=0 \Rightarrow z=0 \\ & z \le 2-x-y: && x=y=1 \Rightarrow z=0\end{align}\]

So a hand-crafted linear MIP model can look like:

Linear MIP model
\[\begin{align}\max&\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \color{darkred} e_{i,j}\\ &\color{darkred} e_{i,j} \le\color{darkred}x_i+\color{darkred}x_j && \forall (i,j)\in A\\ & \color{darkred} e_{i,j} \le 2 - \color{darkred}x_i-\color{darkred}x_j && \forall (i,j)\in A \\ & \color{darkred}x_i,\color{darkred}e_{i,j}\in\{0,1\} \end{align}\]

If we want, we can relax $\color{darkred}e_{i,j}$ to be continuous between 0 and 1. Good solvers actually prefer often binary variables in cases like this.

Here are some results of my experiments.

----    145 PARAMETER results  

                  miqp  miqp/nolin         mip   mip/relax   mip/extra

|i|             70.00070.00070.00070.00070.000
|a|            514.000514.000514.000514.000514.000
variables       71.00071.000585.000585.000585.000
 discrete       70.00070.000584.00070.000584.000
equations        1.0001.0001029.0001029.0001029.000
status         Optimal     Optimal     Optimal     Optimal     Optimal
obj            186.232186.232186.232186.232186.232
time             4.09423.54732.15731.96931.844
nodes          364.0001990995.00032172.00032172.00032172.000
iterations  152218.0004908384.0004765712.0004765712.0004765712.000

The columns are:

miqp: MIQP model with default settings, linearized by Cplex. I noticed that Cplex may or may not linearize the same model depending on the data. To be sure linearization is on, use the option qtolin.
miqp/nolin: MIQP without linearization.
mip: linear model.
mip/relax: linear model with $\color{darkred}e$ variables relaxed.
mip/extra: add the constraint: \[\sum_i \color{darkred}x_i \le \sum_i (1-\color{darkred}x_i)\] I.e. fewer selected nodes than unselected ones. This removes some symmetry. Does not seem to make a difference.

Interestingly the quadratic formulation (automatically linearized) works best. I am not sure what Cplex does here that makes it so fast.

Running model and generating interactive plot

Max Directed Cut

Here we have a directed graph. We want to maximize the number of arcs $i \rightarrow j$ such that $i \in S$ and $j \notin S$. An example of an optimal solution is:

Here we see that each arc is directed. The curvy arcs indicate there is an arc $i \rightarrow j$ and an arc $j \rightarrow i$. Note that red arcs always start in a blue node (the set $S$) and end in a green node.

This problem can be modeled as:

Unconstrained quadratic model
\[\begin{align}\max&\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \color{darkred}x_i \cdot \left(1-\color{darkred}x_j \right) \\ & \color{darkred}x_i\in\{0,1\} \end{align}\]

A linearization of this quadratic model can look like:

Linear MIP model
\[\begin{align}\max&\sum_{(i,j)\in A} \color{darkblue}w_{i,j}\cdot \color{darkred}e_{i,j} \\ &\color{darkred}e_{i,j}\le \color{darkred}x_i && \forall (i,j) \in A \\ & \color{darkred}e_{i,j}\le 1-\color{darkred}x_j && \forall (i,j) \in A \\ & \color{darkred}x_i,\color{darkred}e_{i,j}\in\{0,1\} \end{align}\]

The results are:

----    132 PARAMETER results  

                  miqp  miqp/nolin         mip   mip/relax

|i|             60.00060.00060.00060.000
|a|            877.000877.000877.000877.000
variables       61.00061.000938.000938.000
 discrete       60.00060.000937.00060.000
equations        1.0001.0001755.0001755.000
status         Optimal     Optimal     Optimal     Optimal
obj            161.603161.603161.603161.603
time             2.5000.1254.7974.891
nodes           23.0005989.0006451.0006451.000
iterations   16405.00014100.000739951.000739951.000

Interestingly, here we should not linearize, and let Cplex work on the quadratic model.

Conclusions

MAX-CUT and MAX-DICUT can be written as relatively simple quadratic or linear models. But there are a few small surprises on the way.
Cplex may reformulate quadratic integer models into linear ones. It decides this on some machine learning model. Downside: it is really unpredictable whether or not this reformulation is applied.
Cplex may also reformulate non-convex quadratic models into convex ones.
This means we can actually use quadratic formulations more often than in the past.
Solutions are difficult to interpret without visualization tools.

References

Maximum cut, https://en.wikipedia.org/wiki/Maximum_cut
Cytoscape.js, Graph theory (network) library for visualisation and analysis, https://js.cytoscape.org/

Appendix: GAMS model for undirected max cut problem

$ontext

   MAX CUT (undirected graphs)

   Random graphs

$offtext

* allow all cores to be used
option threads=0;

*---------------------------------------------------------
* undirected graph
*---------------------------------------------------------

set
i 'nodes'/node1*node70/
a(i,i) 'arcs'
;
alias (i,j);

* sparse undirected network
a(i,j)$(ord(i)<ord(j)) = uniform(0,1)<0.2;

parameter w(i,j) 'weights';
w(a) = uniform(0,1);

display$(card(i)<=50) i,a,w;

*------------------------------------------------------------------
* reporting macros
*------------------------------------------------------------------

parameter results(*,*);

acronym Optimal;

* macros for reporting
$macro report(m,label) \
    results('|i|',label) = card(i); \
    results('|a|',label) = card(a); \
    results('variables',label) = m.numvar; \
    results(' discrete',label) = m.numdvar; \
    results('equations',label) = m.numequ; \
    results('status',label)= m.solvestat; \
    results('status',label)$(m.solvestat=1) = Optimal; \
    results('obj',label) = z.l; \
    results('time',label) = m.resusd; \
    results('nodes',label) = m.nodusd; \
    results('iterations',label) = m.iterusd; \
    display results;

*---------------------------------------------------------
* maxcut model 1: MIQP
*---------------------------------------------------------

binaryvariables
   x(i) 'node is in S'
;

variable z 'objective';

equations
   obj1 'quadratic objective'
;

obj1.. z =e= sum(a(i,j),w(i,j)*sqr(x(i)-x(j)));

model maxcut1 /obj1/;
option miqcp=cplex;
solve maxcut1 maximizing z using miqcp;

parameter ecut(i,j) 'maximum cut';
ecut(a(i,j)) = w(i,j)*sqr(x.l(i)-x.l(j));

option x:0;
display x.l,ecut,z.l;

report(maxcut1,"miqp")

*---------------------------------------------------------
* as maxcut model 1 but prevent automatic linearization
*---------------------------------------------------------

$onecho > cplex.opt
qtolin 0
$offecho

maxcut1.optfile=1;
solve maxcut1 maximizing z using miqcp;
report(maxcut1,"miqp/nolin")

*---------------------------------------------------------
* maxcut model 2:
*---------------------------------------------------------

binaryvariables
   e(i,i) 'arc e is in cut'
;

equations
   obj2     'linear objective'
   e1(i,j) 'x(i)=x(j)=0 ==> e(i,j)=0'
   e2(i,j) 'x(i)=x(j)=1 ==> e(i,j)=0'
;

obj2.. z =e= sum(a,w(a)*e(a));
e1(a(i,j)).. e(i,j)=l=x(i)+x(j);
e2(a(i,j)).. e(i,j)=l=2-x(i)-x(j);

model maxcut2 /obj2,e1,e2/;
solve maxcut2 maximizing z using mip;

option e:0;
display x.l,e.l,z.l;

report(maxcut2,"mip")

*---------------------------------------------------------
* as model 2 but relax variables e(i,j)
*---------------------------------------------------------

e.prior(a) = +inf;
solve maxcut2 maximizing z using mip;
report(maxcut2,"mip/relax")
e.prior(a) = 1;

*---------------------------------------------------------
* as model 2 but add: number of x(i) <= n/2
*---------------------------------------------------------

equation extra;
extra.. sum(i, x(i)) =l= sum(i, 1-x(i));

model maxcut3 /maxcut2,extra/;
solve maxcut2 maximizing z using mip;

option e:0;
display x.l,e.l,z.l;

report(maxcut2,"mip/extra")

Appendix: GAMS model for max directed cut

$ontext

   MAX DIRECTED CUT (MAX CUT for directed graphs)

$offtext

* allow all cores to be used
option threads=0;

*---------------------------------------------------------
* directed graph
*---------------------------------------------------------

set
i 'nodes'/node1*node60/
a(i,i) 'arcs'
;
alias (i,j);

* sparse graph
a(i,j) = uniform(0,1)<0.25;

parameter w(i,j) 'weights';
w(a) = uniform(0,1);

display$(card(i)<=50) i,a,w;

*------------------------------------------------------------------
* reporting macros
*------------------------------------------------------------------

parameter results(*,*);

acronym Optimal;

* macros for reporting
$macro report(m,label) \
    results('|i|',label) = card(i); \
    results('|a|',label) = card(a); \
    results('variables',label) = m.numvar; \
    results(' discrete',label) = m.numdvar; \
    results('equations',label) = m.numequ; \
    results('status',label)= m.solvestat; \
    results('status',label)$(m.solvestat=1) = Optimal; \
    results('obj',label) = z.l; \
    results('time',label) = m.resusd; \
    results('nodes',label) = m.nodusd; \
    results('iterations',label) = m.iterusd; \
    display results;

*---------------------------------------------------------
* maxcut model 1
*---------------------------------------------------------

binaryvariables
   x(i) 'node is in set S'
;

variable z 'objective';

equations
   obj1 'Quadratic objective'
;

obj1.. z =e= sum(a(i,j),w(i,j)*x(i)*(1-x(j)));

model maxcut1 /obj1/;
option miqcp=cplex;
solve maxcut1 maximizing z using miqcp;

parameter ecut(i,j) 'maximum cut';
ecut(a(i,j)) = x.l(i)*(1-x.l(j));

option x:0,ecut:0;
display x.l,ecut,z.l;

report(maxcut1,"miqp")

*---------------------------------------------------------
* as maxcut model 2 but prevent automatic linearlization
*---------------------------------------------------------

$onecho > cplex.opt
qtolin 0
$offecho

maxcut1.optfile=1;
solve maxcut1 maximizing z using miqcp;
report(maxcut1,"miqp/nolin")

*---------------------------------------------------------
* maxcut model 2
*---------------------------------------------------------

binaryvariables
   e(i,i) 'arc e is part of cut'
;

equations
   obj2    'linear objective'
   e1(i,j) 'x(i)=0 ==> e(i,j)=0'
   e2(i,j) 'x(j)=1 ==> e(i,j)=0'
;

obj2.. z =e= sum(a,w(a)*e(a));
e1(a(i,j)).. e(i,j)=l=x(i);
e2(a(i,j)).. e(i,j)=l=1-x(j);

model maxcut2 /obj2,e1,e2/;
solve maxcut2 maximizing z using mip;

option e:0;
display x.l,e.l,z.l;

report(maxcut2,"mip")

*---------------------------------------------------------
* as maxcut model 2 but relax e to be continuous
*---------------------------------------------------------

e.prior(a) = +inf;
solve maxcut2 maximizing z using mip;
report(maxcut2,"mip/relax")

MAX-CUT

Max-Cut formulations

Max Directed Cut

Conclusions

References

Appendix: GAMS model for undirected max cut problem

Appendix: GAMS model for max directed cut

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...

\(x\)	\(y\)	\(x\>{\bf xor}\>y\)
0	0	0
0	1	1
1	0	1
1	1	0