About kmp optimization

→ Pay attention

Before contest
CodeTON Round 9 (Div. 1 + Div. 2, Rated, Prizes!)
29:55:45
Register now »

*has extra registration

→ Top rated

#	User	Rating
1	tourist	4009
2	jiangly	3823
3	Benq	3738
4	Radewoosh	3633
5	jqdai0815	3620
6	orzdevinwang	3529
7	ecnerwala	3446
8	Um_nik	3396
9	ksun48	3390
10	gamegame	3386

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	167
2	Um_nik	163
3	maomao90	162
3	atcoder_official	162
5	adamant	159
6	-is-this-fft-	158
7	awoo	157
8	TheScrasse	154
9	Dominater069	153
9	nor	153

View all →

→ Find user

→ Recent actions

Detailed →

Monster_Nerd's blog

About kmp optimization

By Monster_Nerd, history, 4 years ago, In English

Recently i encountered TLE on this problem E. Tree-String Problem
using decent dfs + kmp. Later i came to know that there is an optimized version of kmp. I listed the code below which i had taken from Arpa 's code.

int k = 0;
    for(int i = 1; i < p.size(); i++){
	while(k && p[i] != p[k]) k = f[k];
	if(p[i] == p[k])  k++;
	f[i + 1] = k;
    }

for(int i = 0; i < p.size(); i++)
	for(int j = 0; j < z; j++)
	    nxt[i][j] = p[i] - 'a' == j ? i + 1 : bool(i) * nxt[ f[i] ][j];

now my question is, can i use it always [if memory limit is ok] ? Does it consistent and can anyone just briefly explain what does it do?

Monster_Nerd
4 years ago
6

Comments (6)

Write comment?

Monster_Nerd

4 years ago, # |

Auto comment: topic has been updated by Monster_Nerd (previous revision, new revision, compare).

→ Reply

Monster_Nerd

4 years ago, # |

Auto comment: topic has been updated by Monster_Nerd (previous revision, new revision, compare).

→ Reply

Arpa

4 years ago, # |

Yes, it's consistent. Simplify the code, it's easy to understand.

→ Reply

Monster_Nerd

4 years ago, # ^ |

Thanks's a lot. Yep i am iterating!!

→ Reply

clyring

4 years ago, # |

← Rev. 4 →

Your original code TLEs on this problem because its worst-case runtime is $$$\Omega(|t| + n \cdot \min (|t|, \sum |s_v|))$$$. The failing testcase is probably a larger version of the following:

10
1 aaaaaaaaaa
2 x
2 x
2 x
2 x
2 x
2 x
2 x
2 x
aaaaaaaaaay

Your original code performs the backtracking part of KMP at match-time. When parsing a single string, this doesn't affect asymptotic complexity and doesn't have a big impact on the constant factor because each character added to a partial match is removed at most once. But in the tree setting, the partial match characters may be removed $$$n-2$$$ times in a test case like the one above. Arpa's version of KMP gets around this by preprocessing all of the potential backtracking steps so that they may be performed in (un-amortized) $$$O(1)$$$ at match time, restoring a more reasonable $$$O(n + |\sigma | |t| + \sum |s_v|)$$$ time complexity.

EDIT: Complexities were rewritten a bit more precisely.

→ Reply

Monster_Nerd

4 years ago, # ^ |

Thanks a lot buddy. I had tried a lot to figure out the test case and was quite dumb to understand what's wrong going on. now i completely got it. Thanks, and at last i understood why optimization was required. Thanks a lot.

→ Reply