KACTL ModMul. It says that it runs around 2x faster than naive
(__int128_t)a * b % M
When I ran my benchmarks with -O2, the results were similar. Am I mistaken?
# | User | Rating |
---|---|---|
1 | jiangly | 3898 |
2 | tourist | 3840 |
3 | orzdevinwang | 3706 |
4 | ksun48 | 3691 |
5 | jqdai0815 | 3682 |
6 | ecnerwala | 3525 |
7 | gamegame | 3477 |
8 | Benq | 3468 |
9 | Ormlis | 3381 |
10 | maroonrk | 3379 |
# | User | Contrib. |
---|---|---|
1 | cry | 167 |
2 | -is-this-fft- | 165 |
3 | Dominater069 | 161 |
4 | atcoder_official | 160 |
5 | Um_nik | 159 |
6 | djm03178 | 156 |
7 | adamant | 153 |
8 | luogu_official | 151 |
9 | awoo | 149 |
10 | TheScrasse | 146 |
KACTL ModMul. It says that it runs around 2x faster than naive
(__int128_t)a * b % M
When I ran my benchmarks with -O2, the results were similar. Am I mistaken?
Name |
---|
This is only one mod operation, try doing more operations and benchmarking.
I did that. I did 1e5 runs and their running time was basically the same.
1e5 operations is not very many. That should take around 1 millisecond. Try something like 1e10 of them to spot a consistent difference. Also make sure the compiler can't optimize it out.