Expected size of a set consisting of pairwise AND from two random sets

Пожалуйста, прочтите новое правило об ограничении использования AI-инструментов. ×

→ Обратите внимание

До соревнования
Codeforces Round 976 (Div. 2) and Divide By Zero 9.0
14:07:34
Зарегистрироваться »

*есть доп. регистрация

→ Трансляции

Codeforces Round 976 (Div 2) - Solution Discussion

Shayan

До начала 16:12:33

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	4009
2	jiangly	3839
3	Radewoosh	3646
4	jqdai0815	3620
4	Benq	3620
6	orzdevinwang	3612
7	Geothermal	3569
7	cnnfls_csy	3569
9	ecnerwala	3494
10	Um_nik	3396

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	Um_nik	164
2	maomao90	160
3	-is-this-fft-	159
4	atcoder_official	158
4	awoo	158
4	cry	158
7	adamant	155
8	nor	154
9	TheScrasse	152
10	maroonrk	151

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя hellman_

Expected size of a set consisting of pairwise AND from two random sets

Автор hellman_, 8 лет назад, По-английски

Hello everyone!

Here's an interesting problem. Sounds easy at first but I could not find a solution that matches experimental results. Maybe I am missing some simple observation.

You are given two integers 1 ≤ l₁, l₂ ≤ 2ⁿ. Consider two random subsets of n-bit vectors $\text{[math]}$ of sizes l₁, l₂ respectively. Let

$\text{[math]}$ , where & is a bitwise AND.

Compute the expected size of R (in polynomial in n time). Computing maximum possible size of R is interesting too.

An easier version is when bitwise AND is replaced by bitwise XOR, it is also interesting.

NOTE: I am not sure that this problem has a solution.

bit manipulation, and, xor, probability, expectation

hellman_
8 лет назад
13

Комментарии (13)

Написать комментарий?

geniucos

8 лет назад, # |

+18

It really sounds very interesting so I'm going to think about it for a while. Just for the record, are you sure it is solvable? (like have you got it frome somewhere or have you just asked yourself whether you can compute it or not?)

→ Ответить

hellman_

8 лет назад, # ^ |

No, I am not sure if it's solvable. This problem appeared in a research.

→ Ответить

majk

8 лет назад, # |

← Rev. 4 →

+13

Best I can do with the "xor" case is $\text{[math]}$ .

First, let's select a fixed n-bit vector r. There are exactly 2ⁿ pairs of {x, y}, such that $\text{[math]}$ . Note that all x_i (resp. y_i) are distinct. Define the following function: $\text{[math]}$ and observe that $\text{[math]}$ . In other words, to prevent r from being in R, we have to select whole S₂ outside of f(S₁).

There are l₁ elements in S₁ and l₂ elements in S₂. For any set S₁, there are exactly $\text{[math]}$ ways of choosing the set S₂, such that the above intersection is empty, and thus $\text{[math]}$ . There are exactly $\text{[math]}$ ways of selecting the set S₂.

Combined together, we have $\text{[math]}$ .

This yield an intermediate result, that (perhaps unsurprisingly), the expected size of R is 2ⁿ if l₁ + l₂ > 2ⁿ.

Otherwise, by linearity of expectation: $\text{[math]}$ . Now I would speculate that you can only calculate that in time $\text{[math]}$ (you can of course use Stirling's formula to get some upper and lower bounds faster).

→ Ответить

majk

8 лет назад, # ^ |

The "and" case is more complicated in the sense that the cardinality of set f(S₁) is not constant, and I don't yet see a simple way of expressing it, but maybe someone will.

→ Ответить

bicsi

8 лет назад, # ^ |

← Rev. 4 →

Ignore my post. I was wrong. :)

→ Ответить

hellman_

8 лет назад, # ^ |

+10

Linearity of expectation does not care about dependency actually.

I think majk meant l₁ + l₂ > 2ⁿ, becase in case of equality the numerator is equal to 1, not 0.

sage: n = 3; l1 = 4; l2 = 4; 1 - binomial(2**n-l1, l2)/binomial(2**n, l2)
69/70
sage: n = 3; l1 = 5; l2 = 4; 1 - binomial(2**n-l1, l2)/binomial(2**n, l2)
1

→ Ответить

bicsi

8 лет назад, # ^ |

I just checked that from wikipedia. You are right. Good to know linearity of expectation applies even when the variables are not independent.

→ Ответить

majk

8 лет назад, # ^ |

Thanks for correction hellman_, the inequality shall indeed be strict.

Exactly. You however need independence for linearity of variance.

→ Ответить

bicsi

8 лет назад, # |

← Rev. 3 →

+15

First of all, the expected size of the set can be stated as: $\text{[math]}$ (by linearity of expectation [please confirm]).

Let's now fix an element $\text{[math]}$ . Then $\text{[math]}$ .

Now, the real answer can be computed via dynamic programming in O(3ⁿ) or O(2ⁿ * n). I now conjecture that you can reduce this to polynomial in n by making the rather intuitive observation that the answer depends solely on the number of set bits of m.

→ Ответить

hellman_

8 лет назад, # ^ |

Could you elaborate on the DP part? Let n = 3. Then first we compute

$\text{[math]}$

How now to compute let's say $\text{[math]}$ ?

By your formula we can compute $\text{[math]}$

To get $\text{[math]}$ we also need to know $\text{[math]}$

(I am using the equation $\text{[math]}$ )

→ Ответить

bicsi

8 лет назад, # ^ |

← Rev. 2 →

Going from the computed values to the required ones might be harder using probability notion. The good thing is you can always transform it into a counting problem (and vice-versa), and then normalize it accordingly.

Seeing it as a counting problem, you can just use a simple inclusion-exclusion principle and, by induction from DP, all the values of greater masks will be correct.

→ Ответить

geniucos

8 лет назад, # ^ |

← Rev. 4 →

To reduce it to a polynomial problem you only need an easy observation: the probability for a number depends only on its number of 1s, letting you compute the dynamic programming in O(N). You would also need binomial coefficients to find out the answer which can be computed in O(N) as well so I guess that's all, isn't it?

PS: I consider the variant in which you have to compute the answer modulo a big prime number(like it's of the form P / Q and print P * Q ^ (-1)) because otherwise it still works but asking for precision for N a little bigger then 20 wouldn't work and with N small, you can backtracking for half of the values and it's much more interesting if you set N as 10^6

LE:The observation is right indeed, but the reccurence is not that simple actually so ignore most of what I said(except for the observation which should be relevant in any solution that uses those probabilities to compute the answer)

→ Ответить

bicsi

8 лет назад, # ^ |

Does the original problem ask to compute the answer modulo some prime? Because, if not, computing the combinatorial probabilities might be pretty painful without some Gaussian approximations. If so, then a rather small modulo would solve the problem of computing large binomial coefficients. (remember that binomials in this problem contain a (2ⁿ)! term).

I suppose some maths theorem would be helpful for a bigger prime...

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 29.09.2024 04:27:27 (h1).

Десктопная версия, переключиться на мобильную.

При поддержке