Number of nodes reachable from each node in a directed graph

→ Pay attention

Before contest
Codeforces Round 1006 (Div. 3)
2 days
Register now »

→ Top rated

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	151
7	adamant	151
9	luogu_official	150
10	awoo	147

View all →

→ Find user

→ Recent actions

Detailed →

vaibnak7's blog

Number of nodes reachable from each node in a directed graph

By vaibnak7, history, 5 years ago, In English

Given a directed graph, suppose we want to find the number of reachable nodes from each node of the graph then what is the best way to solve this problem ??

One obvious way to solve it is doing dfs from every node of the graph and counting how many nodes are getting visited, but the problem with this approach is that it is O(n^2) where n is the number of nodes in the graph

Then i thought of maybe if we can store at each node how many nodes are reachable and when queried give the answer based on the values of the neighbouring nodes but this will not be able to handle the case of overcounting as in the graph below.

So how to solve this ?

#graph, directed, #dfs

vaibnak7
5 years ago
23

Comments (20)

Show archived | Write comment?

tfg

5 years ago, # |

As far as I know the question of "is it possible to solve that faster than quadratic time" is an open problem.

→ Reply

yh11

20 months ago, # ^ |

← Rev. 2 →

-9

If the graph is a DAG, you can just topsort and do dp.
If not just use Kosaraju Algorithm to condense SCCs, then use topsort + dp.

This should be linear. Or am I missing something?

→ Reply

vgtcross

20 months ago, # ^ |

How would you solve the problem on a DAG using dp?

→ Reply

tfg

20 months ago, # ^ |

Such dp counts the number of paths starting from some vertex, but sadly there might be more than one path with the same endpoints.

→ Reply

yh11

20 months ago, # ^ |

I see. Thanks.

→ Reply

BohdanPastuschak

5 years ago, # |

You can optimize $$$O(n^2)$$$ with bitsets. If doing straightforward, this probably will give MLE (if n = $$$10^5$$$, ML = 256/512MB), but you can try to divide all vertices on groups, and for each group G runs separate dfs: for each vertex V count how many of vertices U(from G) are reachable from V.

→ Reply

just_try_again

4 years ago, # |

Spoj Problem DAGCNT2 is similar to this

→ Reply

vaibnak7

4 years ago, # ^ |

Can you also tell about the correct approach to this problem

→ Reply

just_try_again

4 years ago, # ^ |

By using Bitsets overcounting of nodes can be prevented. Then it can be solved by toposort.

Here is the code with explanation

→ Reply

vaibnak7

4 years ago, # ^ |

Is there any use of topological sorting in this algorithm, or by using simple dfs also you can maintain the reach for every node

→ Reply

horiacool

2 years ago, # |

← Rev. 3 →

This can be done by first finding Strongly Connected Components (SCC), which can be done in O(|V|+|E|). Then, build a new graph, G', where each SCC is a node in the graph and each node has value which is the sum of the nodes in that SCC.

Given a graph G(V, E), we build G'(V', E') where:

V' = { U1, U2, ..., Uk | U_i is a SCC of the graph G }

E' = { (U, W) | there is node u in U and w in W such that (u, w) is in E }

This graph, G', is a DAG and the question becomes similar with finding the number of nodes reachable from each node in a DAG, which can be made easily via DFS:

int DFS(node v) {
    vis[v] = true
    reachable[v] = v.scc_size() // nodes reachable from that SCC, including themselves

    for u in v.children() {
        // nodes already visited were added via previously visited nodes
        if (vis[u] == false) {
            reachable[v] += DFS(u)
        }
    }

    return reachable[v]
}

for v in V'  '{
    if (indegree(v) == 0) {
        DFS(v)
    }
}

So for the original nodes from G we get very easily the number of reachable nodes:

for v in V {
    reachable_G[v] = reachable[containing_scc(v)]
}

Thus the final complexity is linear O(|V| + |E|) .

→ Reply

lrvideckis

2 years ago, # ^ |

It double counts

→ Reply

horiacool

2 years ago, # ^ |

← Rev. 2 →

Oh, yeah, sorry about that, it seems it doesn't cover all the cases, my bad :///

→ Reply

lrvideckis

2 years ago, # ^ |

All good, Note in the case that G is a DAG, this code will calculate reachable[v] instead as the number of paths starting at node v, which can grow exponentially

→ Reply

afylers

20 months ago, # |

Basically for every node it will be n^2. So total time complexity becomes n^3 right? @author

→ Reply

Abito

20 months ago, # ^ |

No, you are doing dfs for each node so it's $$$O(n^2)$$$

→ Reply

afylers

20 months ago, # ^ |

Doing dfs for a single node is n^2 in the worst case where we have all nodes connected with each other. So if we do dfs for all nodes, doesn’t it make it n*n^2 = n^3?

→ Reply

Bosscoder

20 months ago, # ^ |

dfs from one node is (n+e) afaik . multiplying by n gives of n^2

→ Reply

afylers

20 months ago, # ^ |

But that e can be n^2 in worst case right. So basically if there are 5 nodes, every node can have 4 edges. So e is 20 which is close to n^2 right?

→ Reply

Abito

20 months ago, # ^ |

+14

Yes but reasonable graph problems usually have m=n , but you are right. Exact time complexity is $$$O(n(n+m))$$$

→ Reply