Algorithm Wildcard Searching with *

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	adamant	151
7	djm03178	151
7	luogu_official	151
10	awoo	146

I am thinking about an effient algorithm for wildcard searching with * representing any characters with any length.↵
↵
aa*c, she*he, *she*he↵
↵
↵
Example:↵
↵
1.   caa find aa*c↵
↵
     return "DOES NOT EXIST"↵
↵
2.   hesherheshe find she*he↵
     return 2↵
↵
     Because sherheshe begins at index 2↵
3. sherheshe find *she*he*↵
     return 0↵
     Because the whole string↵
When I am supposed **to return, say,  the beginning index of the first matching instance**.↵
↵
↵
Say the pattern is of length M and the document is of length N, and the pattern has K '*' signs. I can think of a solution that first uses AC Automation to find all occurences of each chunk in O(N + M), with bitmask.↵
↵
While converting the bitmask to indexes takes O(N * K)↵
↵
Then binary search for the last possible beginning positions for each chunk. This could take O (K log N)↵
↵
↵
↵
So the overall time complexity is still O (N * K), any way to do better?↵
↵
↵
↵
↵
**References**↵
↵
https://codeforces.net/blog/entry/111380↵
↵
https://codeforces.net/problemset/problem/1023/A↵
↵
https://codeforces.net/blog/entry/127169↵

Rev.	By	When	Δ	Comment
en7	cardcounter	2024-12-21 23:01:06	18
en6	cardcounter	2024-12-21 22:58:46	229
en5	cardcounter	2024-12-21 12:46:01	44	Tiny change: 'm/1023/A\n' -> 'm/1023/A\n\nhttps://codeforces.net/blog/entry/127169\n'
en4	cardcounter	2024-12-21 12:41:11	2	Tiny change: 'y/111380\nhttps://' -> 'y/111380\n\nhttps://'
en3	cardcounter	2024-12-21 12:40:45	108
en2	cardcounter	2024-12-21 12:32:51	7
en1	cardcounter	2024-12-21 12:30:21	746	Initial revision (published)

Rev.

Lang.

When

Comment

en7