I, ME AND MYSELF !!!: Maximum Matching (Hopcroft)

Friday, May 7, 2010

Maximum Matching (Hopcroft)

Hopcroft-Karp is one of the fastest algorithm that finds the maximum cardinality matching on a bipartite graph. It has the best known worst case time complexity. More details can be found here [courtesy of Wikipedia].

C++ Source Code:


#define MAX 100001
#define NIL 0
#define INF (1<<28)

vector< int > G[MAX];
int n, m, match[MAX], dist[MAX];
// n: number of nodes on left side, nodes are numbered 1 to n
// m: number of nodes on right side, nodes are numbered n+1 to n+m
// G = NIL[0] ∪ G1[G[1---n]] ∪ G2[G[n+1---n+m]]

bool bfs() {
    int i, u, v, len;
    queue< int > Q;
    for(i=1; i<=n; i++) {
        if(match[i]==NIL) {
            dist[i] = 0;
            Q.push(i);
        }
        else dist[i] = INF;
    }
    dist[NIL] = INF;
    while(!Q.empty()) {
        u = Q.front(); Q.pop();
        if(u!=NIL) {
            len = G[u].size();
            for(i=0; i<len; i++) {
                v = G[u][i];
                if(dist[match[v]]==INF) {
                    dist[match[v]] = dist[u] + 1;
                    Q.push(match[v]);
                }
            }
        }
    }
    return (dist[NIL]!=INF);
}

bool dfs(int u) {
    int i, v, len;
    if(u!=NIL) {
        len = G[u].size();
        for(i=0; i<len; i++) {
            v = G[u][i];
            if(dist[match[v]]==dist[u]+1) {
                if(dfs(match[v])) {
                    match[v] = u;
                    match[u] = v;
                    return true;
                }
            }
        }
        dist[u] = INF;
        return false;
    }
    return true;
}

int hopcroft_karp() {
    int matching = 0, i;
    // match[] is assumed NIL for all vertex in G
    while(bfs())
        for(i=1; i<=n; i++)
            if(match[i]==NIL && dfs(i))
                matching++;
    return matching;
}

The implementation is quite straight forward as the algorithm on Wikipedia page. I am looking for some optimizations.

47 comments:

AnonymousMay 13, 2010 at 12:17 PM
This works? Do you have a testing program also? please post it because i just can't figure where the problem is. Maybe at me :)
ReplyDelete
Replies
johnMay 13, 2010 at 5:34 PM
hello. i would like to know what the input data is. thank you in advance!
ReplyDelete
Replies
Nguyễn Cảnh ToànMay 14, 2010 at 7:40 PM
Hi ,i'm from Vietnam.
ReplyDelete
Replies
Zobayer HasanMay 15, 2010 at 11:34 PM
Yes this works, I think. I used it in a problem:
MATCHING @ SPOJ
https://www.spoj.pl/problems/MATCHING

Sorry I don't have any testing program yet.

I am learning matching for only a few days, so may be this is not a source which will solve any problem of matching, but I think it could be modified accordingly.

Thanks :)
ReplyDelete
Replies
AnonymousJanuary 5, 2011 at 10:48 PM
But what is NIL node ? Is it an extra node that must be added in the graph before we launch the algorithm ? In this case how are NIL node edges defined ? NIL node has no edges ?
ReplyDelete
Replies
Zobayer HasanJanuary 6, 2011 at 3:32 AM
NIL node is nothing but just a reference node, it is added to the graph prior to applying the algorithm. As you see, it has no particular use in the algorithm, except used as a marking item, i.e. whether some node has some match or not. G[0] is left blank (no edge) and NIL has the value 0, so, we are calling Node with number 0 to be a NIL node. I tried to stick to the original algorithm presented at wikipedia, the actual implementations are more straight forward, but those are specific problem dependent.
ReplyDelete
Replies
AnonymousSeptember 15, 2011 at 7:36 PM
I tried the sample input

5 4 6
5 2
1 2
4 3
3 1
2 2
4 4

for the following main():

int main() {
int A, B;

scanf("%d %d %d", &N, &M, &P);
while (P--) {
scanf("%d %d", &A, &B);
G[A].push_back(B);

}

printf("%d\n", hopcroft_karp());

return 0;
}

It gives 2, correct answer is 3.

What's wrong with the main? or what's missing?
ReplyDelete
Replies
AnonymousSeptember 15, 2011 at 7:40 PM
I wrote the following main() to work with your code:

int main() {
int A, B, P;

scanf("%d %d %d", &N, &M, &P);
while (P--) {
scanf("%d %d", &A, &B);
G[A].push_back(B);

}

printf("%d\n", hopcroft_karp());

return 0;
}

It gives 2, correct answer is 3.

What's wrong with the main() function? or what's missing?
ReplyDelete
Replies
Zobayer HasanSeptember 15, 2011 at 9:21 PM
Sorry, can't say without reading the whole code. However, this implementation is well tested as well. Although the right portion of the graph is not necessary.
ReplyDelete
Replies
AnonymousSeptember 15, 2011 at 9:46 PM
Yes, it's my mistake, in the main function. Thanks.
ReplyDelete
Replies
adi the bohrMay 2, 2012 at 10:24 PM
would you please help me...i'm trying to solve this problem https://www.spoj.pl/problems/MATCHING and i got WA
could you please give me some more test case...thanks.. :)
ReplyDelete
Replies
AnonymousNovember 3, 2012 at 7:11 AM
I made such a program based on Your code: http://ideone.com/nlschW and as You can see it prints 2 instead of 3. Did I make a mistake or Your code is incorrect? I wouldn't ask but having copied exactly Your code and main from the comment I got 2 as well.
ReplyDelete
Replies
Thilipan CrooseNovember 6, 2012 at 12:19 AM
hey. plz tell me how get matched nodes. this for above test we got there are matches. should we get matched node.
whether
3 1
2 2
4 4
are the matched three. or not plz explain.
ReplyDelete
Replies
Zobayer HasanNovember 6, 2012 at 1:57 AM
match[u] = v means u is matched with node v.
ReplyDelete
Replies
AnonymousMarch 1, 2013 at 12:53 PM
Hi, I'm new in c++.
What should be the input of G looks like? I for the vector.
when 1 is matched to 3, is it means that G[1].push_back(3)?

besides, should I also input the match[] array?what should I put inside?
ReplyDelete
Replies
UnknownJune 3, 2013 at 7:33 PM
@Zobayer : You can make it more faster by checking the condition for dist[NIL]!=INF inside the bfs.It will make faster (My code on writing that line made it pass in only 2.06 sec whereas the code in which i wrote it at the end passed in 3.31 sec)
ReplyDelete
Replies
NikhilJuly 4, 2013 at 1:35 PM
@Zobayer : Do you have any implementation for min cost maximum matching in a bipartite graph ? I have been following your blog since long.. So it would be great if you could upload a blog on the same.
Thanks in advance.
ReplyDelete
Replies
AnonymousDecember 30, 2013 at 1:06 PM
Why are you checking (dist[NIL]!=INF) and returning its value?
even when you are never modifying dist[NIL].

What does BFS return??
ReplyDelete
Replies
UnknownSeptember 29, 2014 at 12:22 AM
why do we need to check dis[match[v]]==dis[u]+1 in dfs() function.How its helping and what if i dont check.
I am not understanding
ReplyDelete
Replies
AnonymousMarch 15, 2015 at 5:31 PM
how do you use "return (dist[NIL]!=INF);" in bfs?
ReplyDelete
Replies
ShupgupDecember 7, 2015 at 1:43 PM
Incase someone hasn't mentioned it, you only need adj[u].push_back[v]. The graph is essentially for the vertices to the left and you only need to add the edges corresponding from the left to right.

I have solved MATCHING in 0.12 seconds, faster than most. Here is the code is used: https://github.com/foreverbell/acm-icpc-cheat-sheet/blob/master/code/hopcroft-karp.cpp
ReplyDelete
Replies
Paranoid.soulDecember 15, 2017 at 1:01 AM
In some problems you may need an additional dfs for bicoloring to decide the value of n and m. More precisely, you may often do some extra work to decide which nodes go to left and which nodes go to right and you may need extra bfs/dfs for bicoloring. There is a pretty similar implementation of the above algorithm which takes care of this problem. Also, it encapsulates all methods and classes used by the algorithm in a separate class which makes it a little simpler to use the algorithm as template.
https://pastebin.com/pvPRuGXr
Tested on
http://codeforces.com/contest/468/problem/B
Happy Coding
ReplyDelete
Replies
Paranoid.soulDecember 15, 2017 at 4:32 AM
Sorry brother, I was wrong. Please ignore my previous message.
ReplyDelete
Replies

Add comment