Hi, can anyone tell if there's a way to stress test solution to an interactive problem by running it over hundreds of test cases ?
# | User | Rating |
---|---|---|
1 | jiangly | 4039 |
2 | tourist | 3841 |
3 | jqdai0815 | 3682 |
4 | ksun48 | 3590 |
5 | ecnerwala | 3542 |
6 | Benq | 3535 |
7 | orzdevinwang | 3526 |
8 | gamegame | 3477 |
9 | heuristica | 3357 |
10 | Radewoosh | 3355 |
# | User | Contrib. |
---|---|---|
1 | cry | 169 |
2 | -is-this-fft- | 165 |
3 | atcoder_official | 160 |
3 | Um_nik | 160 |
5 | djm03178 | 158 |
6 | Dominater069 | 156 |
7 | adamant | 153 |
8 | luogu_official | 152 |
9 | awoo | 151 |
10 | TheScrasse | 147 |
Hi, can anyone tell if there's a way to stress test solution to an interactive problem by running it over hundreds of test cases ?
Name |
---|
It’s quite difficult and perhaps someone out there has a catch-all way, but for me I tend to take a copy of my code, and loop over it for a series of predefined tests, with a naive function which replaces the queries and responses. In this naive function I would deterministically calculate the ‘responses’ to my own queries instead of receiving them as inputs.
Limitations:
1) naive functions usually only work on smaller inputs so it probably wouldn’t be able to test larger test cases
2) there may be bugs in the naive function, or other problems introduced by slightly modifying the code
3) this only works for interactive problems where there actually is a deterministic answer. Sometimes (especially in contests like Google Code Jam) you may find that heuristics are appropriate (e.g. if a 90% success rate is the target), or the response has a random element to it. This is much trickier
Google actually develops an ‘interactive runner’ for each of its interactive problems, which runs some cases for you. You could also look to modify that for your specific purposes, but the code is not trivial.
I don't do this during short contests, but on a long challenge I once used Polygon to run stress tests.
This is so smart, WOW
I think if you've had a prewritten task for it the process would be nearly as fast as regular stress-testing.
First generate your testcases based on the hack format in the problem. Then write an interactor. Finally replace all your io with functions which you can then replace with function calls to your interactor.