How many different substrings does a string of n letters have

→ Обратите внимание

До соревнования
2024-2025 ICPC, NERC, Southern and Volga Russian Regional Contest (Unrated, Online Mirror, ICPC Rules, Preferably Teams)
14:47:48
Зарегистрироваться »

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	4009
2	jiangly	3823
3	Benq	3738
4	Radewoosh	3633
5	jqdai0815	3620
6	orzdevinwang	3529
7	ecnerwala	3446
8	Um_nik	3396
9	ksun48	3390
10	gamegame	3386

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	cry	166
2	maomao90	163
2	Um_nik	163
4	atcoder_official	161
5	adamant	160
6	-is-this-fft-	158
7	awoo	157
8	TheScrasse	154
9	nor	153
9	Dominater069	153

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя jmrh_1

How many different substrings does a string of n letters have

Автор jmrh_1, история, 5 лет назад, По-английски

Hello codeforces Let's say an F(string s) as the number of different substrings that s has. Which is the maximum F(string x) such that x at most has 100000 characters and a dictionary of 26 characters.

jmrh_1
5 лет назад
13

Комментарии (11)

Показать архивные | Написать комментарий?

Sturdy

5 лет назад, # |

Is there a problem link

→ Ответить

Bojack

5 лет назад, # |

-20

https://cp-algorithms.com/string/string-hashing.html in this tutorial you can learn how to find the number of distinct substrings using hashing.

→ Ответить

Sturdy

5 лет назад, # ^ |

It's complexity is O(N^2).

→ Ответить

Kallaseldor

5 лет назад, # |

-14

To solve this you can build a suffix tree of the original string S and count the amount of nodes on the tree. The complexity of this solution is O(N) if you use Ukkonen's algorithm to build the suffix tree.

→ Ответить

kronos

5 лет назад, # |

-12

Problem link: http://acm.timus.ru/problem.aspx?space=1&num=1590

→ Ответить

kronos

5 лет назад, # ^ |

-12

another possible solution: https://www.geeksforgeeks.org/count-distinct-substrings-string-using-suffix-array/

→ Ответить

starboy_jb

5 лет назад, # |

-15

here is the problem link

and here is the solution by me.

→ Ответить

jmrh_1

5 лет назад, # |

← Rev. 2 →

+16

I know that there is a solution in O (N log N) with Suffix Array and LCP, but the real question is the maximum value that the F () function could reach in any string of at most 100,000 characters with a 26 letter dictionary.

→ Ответить

jmrh_1

5 лет назад, # |

I think that the more characters the string has, the less possibility there is that the function F () defined for every string of at most 100000 and with a 26-letter dictionary reaches its maximum value but I am not sure of the maximum value of the function F (). Which would suit me very well because I think I solve many problems depending on the maximum value of F ().

→ Ответить

theodor.moroianu

5 лет назад, # |

← Rev. 2 →

I wrote the Suffix Array solution, and generated a few random strings of length 100.000.

The maximal value of the number of substrings is N * (N - 1) / 2 = 5000050000, and all the random strings i generated had around 4999700000 distinct substrings. It's almost 0.99995% of the maximal value we could get.

If we think about it it's quite obvious why: as the string is random, there is virtually no chance that two different substrings of length >= 10 match (the probability of a colision is 1/26^10 = 7e-15 ~= 0). So except for the small substrings of length 1, 2, ... 9 (which are 9 * N = 900000), all the remaining 4999500000 are distinct.

So if your hope was that there are few such distinct substrings, sorry to break it down :(

→ Ответить

nmakeenkov

5 лет назад, # ^ |

+14

Moreover, there's an obvious statement that there're no more than $$$26$$$ distinct substrings of size 1, $$$26^2$$$ of size 2 and $$$26^3$$$ of size 3. N * (N - 1) // 2 - (N - 26) - (N - 1 - 26 * 26) - (N - 2 - 26 * 26 * 26) equals 4999668281. You get almost that (if not exactly).

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 17.11.2024 22:47:14 (i1).

Десктопная версия, переключиться на мобильную.

При поддержке