Help with Rolling hashes

→ Обратите внимание

До соревнования
Codeforces Round 1006 (Div. 3)
2 дня
Зарегистрироваться »

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	152
7	adamant	152
9	luogu_official	151
10	awoo	147

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя vinayaka

Help with Rolling hashes

Автор vinayaka, история, 16 месяцев назад, По-английски

Hi,

My knowledge on String algorithms is poor so I am studying it now.

I solved 271D - Good Substrings using a Trie, but I am trying to solve it using hashing.

I am able to calculate a polynomial rolling hash and compare substrings based on the hash, but my solution is getting WA on test 8 (230797737). I am thinking this is because of collisions, since the expected answer is also pretty high.

I read a topic on double polynomial rolling hash to reduce probability of collisions. I understand that if we use two hashes of order 10^9 then probability will be less because it is equivalent to using a 10^18 modulo.

I am unable to understand how to implement it.

Can anyone help?

Please point me towards an article/blog/submission that has code on implementing it.

Thank you.

string algorithms, roll hash

vinayaka
16 месяцев назад
8

Комментарии (5)

Показать архивные | Написать комментарий?

adityagamer

16 месяцев назад, # |

Just hash it with two different primes. You can check my submission: https://codeforces.net/contest/271/submission/230803346

→ Ответить

vinayaka

16 месяцев назад, # ^ |

I was confused on how to use the two hashes, now I understand we need to store them in a set, and they both combined point to a substring, this was the knowledge gap! Thank you!

And..what does this code mean?

struct hash_pair {
    template <class T1, class T2>
    size_t operator()(const pair<T1, T2>& p) const
    {
        auto hash1 = hash<T1>{}(p.first);
        auto hash2 = hash<T2>{}(p.second);
 
        if (hash1 != hash2) {
            return hash1 ^ hash2;              
        }
         
        // If hash1 == hash2, their XOR is zero.
          return hash1;
    }
};

unordered_set<pair<int,int>, hash_pair> countSet;

→ Ответить

adityagamer

16 месяцев назад, # ^ |

Earlier you were hashing with one prime and storing in hash set. Now you are hashing with two different primes so you will have to store it in hash set of pair. C++ does not have built in hash for pair, this part is just hashing the pair

→ Ответить

wakaranai

16 месяцев назад, # |

Using two different bases works as well, which has the added benefit that you can randomize them at runtime to avoid getting hacked.

→ Ответить

LetterC67

16 месяцев назад, # |

← Rev. 2 →

You can just avoid modulo operator entirely. Instead, allow them to overflow, which is essentially behaves like modulo $$$2^{64}$$$ (if you use 64-bit integers).

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 23.02.2025 11:44:53 (k1).

Десктопная версия, переключиться на мобильную.

При поддержке