How to find no. of occurences of a given substring of a string in the string? - Codeforces

→ Pay attention

Before contest
Codeforces Round 1006 (Div. 3)
2 days
Register now »

→ Top rated

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

Countries | Cities | Organizations

→ Top contributors

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	152
7	adamant	152
9	luogu_official	150
10	awoo	147

View all →

→ Find user

→ Recent actions

Detailed →

oobxw9zf0lm7ez's blog

How to find no. of occurences of a given substring of a string in the string?

By oobxw9zf0lm7ez, 4 years ago, In English

In English

Hello community!

How can we find no. of occurences of a given substring t of string s in the string s? There are two parts of the problem :

non-overlapping -> for eg. aa in aaa, Ans: 1
overlapping -> Ans: 2

I know many good people here know this.

Thanks in advance.

+1

oobxw9zf0lm7ez
4 years ago
6

Comments

Comments (3)

Show archived | Write comment?

»

4 years ago, # |

← Rev. 3 →

Vote: I like it

+1

Vote: I do not like it

we can do sliding window of length of substring t on our string 's' , we will store the start index for each substring which is equal to 't' , total count of such strings is our overlapping ones and for non overlapping we will see if the difference between start index is >= length of t .

→ Reply

»

4 years ago, # |

Vote: I like it

0

Vote: I do not like it

Use KMP algorithm to find starting index of all occurences of given substring in original string.

Further, u can divide them in overlapping/non-overlapping based on difference between 2 consecutive starting points.

Overall Time Complexity : O(N)

Check out this for understanding KMP Algorithm:

KMP

→ Reply

»

4 years ago, # |

Vote: I like it

+8

Vote: I do not like it

A probably simpler solution using the z-function is as follows:

The z-function of the string $$$a[1..n]$$$ is defined as an array $$$z[1..n]$$$ such that $$$z[i]$$$ is the length of the maximum common prefix of the string $$$a[1..n]$$$ and $$$a[i..n]$$$. This can be constructed in $$$O(n)$$$ using an algorithm similar to what is described here.

Consider the string $$$t + \Phi + s$$$ where $$$ \Phi $$$ is a character that doesn't appear in either of $$$s$$$ or $$$t$$$. Note that the z-function of this array for characters after $$$\Phi$$$ will give the longest prefix of $$$t$$$ that matches a string starting from that character in $$$s$$$, so all we need to do is to count the number of indices after the index of $$$\Phi$$$ where the value of the z-function equals the length of $$$t$$$.

→ Reply