Anti-Instinct: Mistakes Made and Ignored by Many on Stage 32, Ucup Season 3

Sorry for such a title. But I believe this topic will interest a lot of people. (When writing this post, I am informed that this stage would be unrated. Nevertheless, I decide to post this. After all, I paid a lot of effort in it.)

This passage is about the Problem J — Hand Cricket on this stage. My team believe that EVERY "passed" submission in this problem is wrong and we can offer a hacking case for these submissions (and probably for the standard solution).

If you haven't seen the problem description, here it is:

Hacking Case

If you just wonder what is our hacking case, here it is:

If you try some passed submissions on this case, they all give $$$371894957$$$ as their answer, which is $$$\frac{100}{51}\bmod 998244353$$$ . The strategy chosen by these submissions are actually $$$(p_1,p_2,p_3)=(\frac{1}{102},\frac{1}{102},\frac{100}{102}), (q_1,q_2,q_3)=(\frac{1}{102},\frac{1}{102},\frac{100}{102})$$$ .

However, there's clearly a better solution: For Alice, she can choose $$$(p_1,p_2,p_3)=(\frac{1}{2},\frac{1}{2},0)$$$ . In that way, no matter what strategy Bob chooses, Alice is guaranteed to gain at least $$$50$$$ points, which is far larger than the answer mentioned above. Actually, the equilibrium here is $$$(p_1,p_2,p_3)=(\frac{1}{2},\frac{1}{2},0),(q_1,q_2,q_3)=(\frac{1}{2},\frac{1}{2},0)$$$ , rather than the answer provided by the "correct answer".

What happened?

We ignore the $$$K_i$$$ in the problem statement and let it be $$$0$$$, and we try to answer just a single query.

In that way, we just need to figure out what is the best strategies of Alice and Bob when the subarray is fixed.

Let's say the elements of this subarray are $$$[v_1,v_2,\dots,v_k]$$$ , and Alice chooses his strategy as $$$(p_1,p_2,\dots,p_k)$$$ , Bob chooses his strategy as $$$(q_1,q_2,\dots,q_k)$$$ .

Alice tries to solve this optimization problem: $$$\max_{(p_1,p_2,\dots,p_k)}\sum\limits_{i=1}^k v_ip_i(1-q_i)$$$

Bob tries to solve this optimization problem: $$$\min_{(q_1,q_2,\dots,q_k)}\sum\limits_{i=1}^k v_ip_i(1-q_i)$$$

Both equations are linear, so it seems quite easy.

For Alice, if she knew about $$$(q_1,q_2,\dots,q_k)$$$, she could find the maximum value of $$$v_i(1-q_i)$$$ and distribute her probability on those $$$i'$$$-s such that $$$v_{i'}(1-q_{i'})$$$ reaches this maximum. (Statement 1)

Same for Bob. If he knew about $$$(p_1,p_2,\dots,p_k)$$$, she could find the minimum value of $$$v_ip_i$$$ and distribute his probability on those $$$i'$$$-s such that $$$v_{i'}p_{i'}$$$ reaches this minimum. (Statement 2)

So far, so good, right? But we don't actually know which $$$i'$$$-s to consider . (And the hypothesis made by a lot of teams is that probability should be distributed in each $$$i=1,2,\dots,k$$$, which is just not true.)

Let's say in the final equilibrium, we distribute the probability over a subset $$$i_1,i_2,\dots,i_t\ (t\geq 1)$$$ . Then we can say: $$$p_{i_1}\gt 0, p_{i_2}\gt 0, \dots, p_{i_t}\gt 0$$$ .

Then, from what we have discussed (Statement 1), $$$v_{i_1}(1-q_{i_1}),v_{i_2}(1-q_{i_2}),\dots, v_{i_2}(1-q_{i_2})$$$ all reach maximum, and are therefore the same. So $$$v_{i_1}(1-q_{i_1})=v_{i_2}(1-q_{i_2})=\dots=v_{i_t}(1-q_{i_t})$$$ .

And, for other $$$i$$$ not in $$${i_1,i_2,\dots,i_t}$$$ , Bob has no reason to give probability on any of them, because Alice will never choose them, so $$$q_{i_1}+q_{i_2}+\dots+q_{i_t}=1$$$ , that is, $$$(1-q_{i_1})+(1-q_{i_2})+\dots+(1-q_{i_t})=t-1$$$ .

From these equations, we can get that $$$1-q_{i_k}=(t-1)\frac{\frac{1}{v_k}}{\sum\limits_{j=1}^t\frac{1}{v_j}}\gt 0\ (1\leq k\leq t)$$$ .

Therefore, from what we have discussed (Statement 2), $$$v_{i_1}p_{i_1},v_{i_2}p_{i_2},\dots,v_{i_t}p_{i_t}$$$ reach maximum. So we have $$$v_{i_1}p_{i_1}=v_{i_2}p_{i_2}=\dots=v_{i_t}p_{i_t}$$$ . And we have $$$p_{i_1}+p_{i_2}+\dots+p_{i_t}=1$$$ 。

From these equations, we can get that $$$p_{i_k}=\frac{\frac{1}{v_k}}{\sum\limits_{j=1}^t\frac{1}{v_j}}\ (1\leq k\leq t)$$$ .

And we use these results to get the expectation of the final gain for Alice:

$$$\sum\limits_{k=1}^t \left(v_{i_k}\times\frac{\frac{1}{v_k}}{\sum\limits_{j=1}^t\frac{1}{v_j}}\times(t-1)\frac{\frac{1}{v_k}}{\sum\limits_{j=1}^t\frac{1}{v_j}}\right)=\frac{t-1}{\sum\limits_{j=1}^t\frac{1}{v_j}}$$$ .

So, we should choose the largest $$$t$$$ $$$v_j$$$-s to maximize this value. However, $$$t$$$ is yet to be determined, and it doesn't seem to have an easy way to find the best $$$t$$$ . And comparisons between results of different $$$t$$$-s without precision loss are seemingly quite difficult for me.

Some comments

There is one way to make this problem solvable: Reduce the problem to one query and return a real value for each query, not something about modulo. Then, we can iterate over the number of numbers to choose and we add those small numbers.

Our team spent quite long hours trying to get a glympse of this problem's solution, only to find the answers of others are just wrong. It is quite upset for us.

But there are more things about this problem. Here, instinct tells us that we don't have much reason to ignore some options. But logic tells something different. So careful analysis can be the key.

So, maybe, Anti-Instinct is yet another AI we should never ignore.

Комментарии (5)

Написать комментарий?

xxh1999

17 часов назад, # |

← Rev. 2 →

Very good article. If it weren't for problem J, I would have finished all the other problems (perhaps

→ Ответить

AksLolCoding

7 часов назад, # |

-13

Maybe tourist can solve the original problem?

Little_Sheep_Yawn

3 часа назад, # |

Auto comment: topic has been updated by Little_Sheep_Yawn (previous revision, new revision, compare).

2 часа назад, # |

Actually it might be an NP problem since we are not sure how to do increment. Here's the example I found during the contest: Consider the case

What is the optimal strategy? I'm sure in the incorrect solution, everyone thinks we shall make increment to the smallest element. But in this case, the last element won't be choose initially. Shall we make it 2 to bring it back to game or shall we just add one to the first four elements? Calculation tells us it's better to make it 3 2 2 2 1. It's different in the case below:

In this case, it's better to make it 2 2 2. This is in the case where we use actual fractions for comparison instead of taking the modulus, otherwise we can't even handle the case $$$k=0$$$. With large $$$K (1e8)$$$, I can not imagine the optimal strategy.

112 минут назад, # ^ |

If we "decide" how many elements to choose, we can always increment the smallest ones in these elements.

So if there is only one query and we can compare between different strategies effectively (or we don't need to compare them without precision loss), we can iterate over the count of elements to choose, and for each set of elements, we just need to add those smaller elements.

№	Пользователь	Рейтинг
1	jiangly	3845
2	tourist	3798
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3589
6	Ormlis	3532
7	Benq	3468
8	Radewoosh	3463
9	ecnerwala	3451
10	Um_nik	3450

№	Пользователь	Вклад
1	cry	165
2	Qingyu	160
3	-is-this-fft-	159
4	atcoder_official	157
5	Dominater069	156
6	adamant	154
7	djm03178	151
8	luogu_official	149
9	awoo	147
10	TheScrasse	145

Блог пользователя Little_Sheep_Yawn

Hacking Case

What happened?

Some comments