Blog entries - Codeforces

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	155
7	djm03178	152
7	adamant	152
9	luogu_official	150
10	awoo	147

lior5654's blog

[Tutorial] Solving a 3300 | Codeforces Round #773 (Div. 1) Problem E — Special Positions

By lior5654, 3 years ago, In English

Hello Codeforces! (Long Post!)

Introduction

As I've recently solved my first ever $$$\ge$$$ 3000 rated problem (3300) on Codeforces — Codeforces Round #773 (Div. 1) Problem E — Special Positions and the solution that I came up with is different than the one presented in the official editorial, I thought that it will be valuable to the community if I share my approach.

Note that problem solving is a complicated & creative process. In the editorial, I show clean steps to reach the final solution, but the jump between each step is not always trivial (and sometimes hard!). Reaching this solution involved a large amount of drawings, observations, calculations & mistakes. There's no magic trick (at least to my knowledge :) ).

Alright, enough introduction, let's get started!

(click "The Editorial" below this line in order to be able to view the editorial)

The Editorial

From this point, I assume that you've at least read the problem, so if you haven't, click here in order to read the statement of the problem.

Note that in the editorial, I always use 0-based indexing.

We are given a non-empty subset $$$P$$$ of indexes and we wish to find the expected value of some scary expression dependent on $$$T$$$, where we randomly select $$$T$$$ out of all non-empty subsets of $$$P$$$ (with equal probability). Note that in this editorial, when I index $$$P$$$ or $$$T$$$, I refer to them as a sorted set. For instance, $$$P_0$$$ is the smallest element in $$$P$$$.

Recall the expression of interest: $$$\displaystyle\sum\limits_{i=0}^{n-1}{(a_i \cdot \min\limits_{j \in T}{|j-i|})}$$$ ( $$$a$$$ is the given array, $$$T$$$ is the chosen subset of indexes out of all non empty subsets of the given subset). This expression is essentially the sum of the multiplication of each element by some coefficient. When we talk about "the coefficient of an element", we refer to the value multiplied by the element in the summation.

For comfort reasons when making arguments, we'll solve a version of the problem where $$$T$$$ can also be empty, i.e a random subset of $$$P$$$ is selected with equal probability and if it's empty just say that the result value is 0. The reason this makes things more comfort, is that now the probability a certain element $$$i$$$ from $$$P$$$ is selected to be in $$$T$$$ is exactly $$$\frac{1}{2}$$$. this wasn't the case before, because of the empty set not being allowed. Further note that now, the probability of a state of $$$k$$$ elements of $$$p$$$ being a specific state (i.e, for each element of the $$$k$$$ elements, whether it's selected or not), is $$$\frac{1}{2^k}$$$.

We can recover the answer to the original problem using the fact that E[the expression] = E[the expression | T is not the empty set] * p(T is not empty), namely E[the expression | T is not the empty set] = E[the expression] / p(T is not empty).

Note: In the editorial, we sometimes make use of powers of $$$2$$$ or $$$\frac{1}{2}$$$ between the $$$0$$$th & the $$$m$$$th power. These can be calculated in $$$O(1)$$$, after pre-processing all of them in $$$O(m)$$$ and keeping them in some array.

Let the distance between two indexes to be the absolute value of their differences.

Notice that the coefficient of $$$a_i$$$ in the expression, is the distance of it's index to the closest index which is in $$$T$$$. Furthermore, observe that this index is either the closest index from the left, or the closest index from the right, namely, the biggest index in $$$T$$$ which is $$$< i$$$, or the smallest index in $$$T$$$ which is $$$> T$$$.

Initially when solving this problem, after making some observations on the structure of the problem, one could try to "solve the problem for each suffix" (you can arrive at attempting this by noticing that if you select some index $$$P_x$$$ to be in $$$T$$$, then the values of the coefficients of everything to the right of it are completely independent from everything to the left of $$$P_x$$$ being in $$$T$$$ or not, as such indexes are necessarily farther than everything to the right of $$$P_x$$$ than $$$P_x$$$ is).

Namely, you can sort of do something like letting $$$f(j)$$$ be the answer to the problem where $$$P_j$$$ is given to be in $$$T$$$ and the array the sum is calculated over is $$$a_{P_j} .. a_{n-1}$$$ (i.e, the "world" of the problem is only the suffix starting from $$$P_j$$$). Note that the answer to the whole problem can be expressed as summing for each element of $$$P$$$, the probability that it's the first index chosen to be in $$$T$$$ (i.e all the indexes in P smaller than it are not chosen to be in $$$T$$$), multiplied by the expected value given so, which is exactly $$$f(j)$$$ + the cost of the prefix up to the first chosen index. The probability that $$$P_i$$$ will be the first element is $$$\frac{1}{2^{i+1}}$$$, so the answer to the full problem is $$$\displaystyle\sum\limits_{i=0}^{m-1}{(\frac{1}{2^{i+1}} \cdot (f(i) + \text{cost of the prefix} 0..P_{i} \text{, where only} P_{i} \text{is selected})})$$$, and the cost of the prefix is $$$\displaystyle\sum\limits_{j=0}^{P_i}{(P_i-j) \cdot a_j}$$$. So, how do we calculate $$$f(j)$$$? Suppose the first index chosen after $$$P_j$$$ is $$$P_k$$$ ($$$P_{j+1}$$$ .. $$$P_{k-1}$$$ are all not chosen). The probability of this happening is $$$\frac{1}{2^{k-j}}$$$, and given this, the expected value of everything to the left of $$$P_k$$$ is exactly $$$f(k)$$$! Note that with probabiliy $$$\frac{1}{2^{m-1-j}}$$$ nothing is chosen, and in this case the value is easy to calculate. Before talking about an expression for f(j), let's introduce some helper functions.

Suppose index $$$i$$$ is chosen, then the next index chosen after it in the array is index $$$j$$$ (no other index is chosen in the middle), then the "value" of this part of the array is independent from indexes $$$< i$$$ or $$$> j$$$ being in $$$T$$$ or not, and is precisely equal to $$$0 \cdot a_i + 1 \cdot a_{i+1} + 2 \cdot a_{i+2} + .. + \lfloor \frac{i+j}{2} \rfloor \cdot a_{\lfloor \frac{i+j}{2} \rfloor} + .. + 2 \cdot a_{j-2} + 1 \cdot a_{j-1} + 0 \cdot a_{j}$$$. The value goes up by 1 each time you "step" forward in the region where $$$i$$$ is closer to the index than $$$j$$$, and the same logic for the other side. Formally, this is due to the coefficient of $$$a_k$$$ where $$$i \le k \le j$$$ being $$$min(k-i, j-k)$$$ and the way this function behaves. Let $$$cost(i,j)$$$ be this value. let $$$pcost(i)$$$ be the "value" of the prefix $$$[0,i]$$$ where only $$$i$$$ is in $$$T$$$. Similarly, let $$$scost(i)$$$ be the "value" of the suffix $$$[i,n-1]$$$ where only $$$i$$$ is in $$$T$$$. The formulas for $$$pcost$$$ and $$$scost$$$ are simply for each element, it's value multiplied by the distance from i.

To summarize: $$$cost(i,j) = \displaystyle\sum\limits_{k=i}^{j}{min(k-i, j-k) \cdot a_i}$$$

$$$pcost(i) = \displaystyle\sum\limits_{k=0}^{i}{(i-k) \cdot a_i}$$$

$$$scost(i) = \displaystyle\sum\limits_{k=i}^{n-1}{(k-i) \cdot a_i}$$$

By utilizing all of our observations & these helper functions, we can now write a clean formula for $$$f(j)$$$:

$$$f(j) = \displaystyle\sum\limits_{k=j+1}^{m-1}{(\frac{1}{2^{k-j}} \cdot (cost(P_j,P_k) + f(k)))} + \frac{1}{2^{k-j}} \cdot scost(P_j)$$$

Calculate all the $$$f$$$ values in decreasing order, and without any extra work we now have an $$$O(m^2 \cdot n)$$$ solution. We need to do much better.

Claim: with $$$O(n)$$$ of preprocessing, $$$cost$$$, $$$pcost$$$ & $$$scost$$$ can be calculated in $$$O(1)$$$.

How? prefix sums! (a bunch of them)

As mentioned before, the coefficients of the elements in $$$cost$$$, have an increasing part $ a decreasing part.

Let $$$inc(l,r) = \displaystyle\sum\limits_{i=l}^{r}{((i-l) \cdot a_i)}$$$ (each element multiplied by it's distance from the left (first by 0, second by 1, ...)

Let $$$dec(l,r) = \displaystyle\sum\limits_{i=l}^{r}{((r-i) \cdot a_i)}$$$ (each element multiplied by it's distance from the right (last by 0, one before last by 1, ...)

For comfort reasons, if $$$r > l$$$, or if at least one of them isn't a valid index, let $$$inc$$$ & $$$dec$$$ both be 0.

Observe that based on the way we analyzed $$$cost$$$ before, $$$cost(l,r) = inc(l, \lfloor \frac{l+r}{2} \rfloor) + dec(\lfloor \frac{l+r}{2} \rfloor + 1, r)$$$

Let's find a way to use some prefix sums for a fast calculation of $$$inc$$$ & $$$dec$$$

Let $$$g(i)$$$ be the sum of the first $$$i+1$$$ elements. Notice that $$$g(0) = 0$$$, $$$g(i) = a_i + g(i-1)$$$ for $$$i > 0$$$.

Let $$$g_{u}(i)$$$ be the summation of each of the first $$$i+1$$$ elements multiplied by its index. Notice that $$$g_{u}(0) = 0$$$, $$$g_{u}(i) = i \cdot a_i + g_{u}(i-1)$$$ for $$$i > 0$$$

Let $$$g_{d}(i)$$$ be the summation of each element in the suffix $$$[i, n-1]$$$ multiplied by n — 1 — its index. Notice that $$$g_{d}(n-1) = 0$$$, $$$g_{d}(i) = (n - 1 - i) \cdot a_i + g_{d}(i+1)$$$ for $$$i < n - 1$$$

Each of these function's values over all indexes can be pre-processed by using their recurrence relations.

For comfort reasons, define the value of each of these functions for numbers which are not a valid index to be $$$0$$$.

let $$$sum(l, r)$$$ be the sum of the elements in the subarray $$$a_l .. a_r$$$. Notice that $$$sum(l, r) = g(r) - g(l-1)$$$.

Claim: $$$inc(l, r) = g_{u}(r) - g_{u}(l-1) - l \cdot sum(l, r)$$$, $$$dec(l, r) = g_{d}(l) - g_{d}(r+1) - (n - 1 - r) \cdot sum(l, r)$$$. You can see why these claims are true visually, or quickly proof formally by playing with the summation expressions.

Substituting into the $$$cost$$$ formula: $$$cost(l, r) = g_{u}(\lfloor \frac{l+r}{2} \rfloor) - g_{u}(l-1) - l \cdot sum(l, \lfloor \frac{l+r}{2} \rfloor) + g_{d}(\lfloor \frac{l+r}{2} \rfloor + 1) - g_{d}(r+1) - (n - 1 - r) \cdot sum(\lfloor \frac{l+r}{2} \rfloor + 1, r)$$$

$$$cost(l, r) = g_{u}(\lfloor \frac{l+r}{2} \rfloor) - g_{u}(l-1) - l \cdot (g(\lfloor \frac{l+r}{2} \rfloor) - g(l-1)) + g_{d}(\lfloor \frac{l+r}{2} \rfloor + 1) - g_{d}(r+1) - (n - 1 - r) \cdot (g(r) - g(\lfloor \frac{l+r}{2} \rfloor))$$$

Note that with similar logic: $$$pcost(i) = dec(0, i)$$$, $$$scost(i) = inc(i, n-1)$$$

So now $$$f(j)$$$ is calculatable in $$$O(m)$$$ (as we can calculate $$$scost$$$ & $$$cost$$$ in $$$O(1)$$$ after $$$O(n)$$$ preprocessing), so our new best complexity is $$$O(m^2 + n)$$$. This is still not good enough.

So at this point one can continue with the $$$f(j)$$$ idea, and perhaps maintain some weighted sums of terms dependent on $$$l$$$ and terms dependent on $$$r$$$, but $$$f(k)$$$ & the terms dependent on $$$\lfloor \frac{l+r}{2} \rfloor$$$ make it hard.

We need to look at the problem from a new perspective.

for a specific subset $$$T$$$ being chosen, if we think about it, we gain some combination of $$$cost$$$ values, a $$$pcost$$$ value & a $$$scost$$$ value. Specifcally, we get $$$pcost$$$ of the smallest index chosen, $$$scost$$$ of the largest index chosen, and $$$cost$$$ values for all consecutive pairs in T (if we look at T as a sorted list of indexes).

For example: — — — — X — — — — X — — — — X — — X — — —

(X represents a selected spot)

In this example, we gain $$$pcost(4)$$$, $$$cost(4, 9)$$$, $$$cost(9, 14)$$$, $$$cost(14, 17)$$$, $$$scost(17)$$$

The idea is that the expected value of the expression should be equal to the sum for each pcost/cost/scost value, the value * the probability it's "block" appears in a picture like the picture in the above example.

Let's introduce this idea more formally by using linearity of expectation.

Note: I'm being extra formal, I suggest you simply take a pen & some paper, and draw to understand the above claim. What essentially try to show formally in the explanation below is expressing the expected value as the sum of "expected values" of each "block" (cost/pcost/scost) (the value of a "block" is 0 if it doesn't appear, or its value if it does).

Let $$$C_{i,j}(T)$$$ be $$$cost(P_i,P_j)$$$ if $$$P_i$$$ & $$$P_j$$$ are in $$$T$$$ and no index in between them is in $$$T$$$, or $$$0$$$ otherwise

Let $$$PC_i(T)$$$ be $$$pcost(P_i)$$$ if $$$P_i$$$ is the smallest index in $$$T$$$, or $$$0$$$ otherwise.

Let $$$SC_i(T)$$$ be $$$scost(P_i)$$$ if $$$P_i$$$ is the largest index in $$$T$$$, or $$$0$$$ otherwise.

The expression we desire to find the expected value of can simply expressed as the sum of all of these variables, because each variable represents a unique "block", and is 0 if that block doesn't appear in the picture above, so only the values of relevant blocks will be considered.

So $$$\displaystyle\sum\limits_{i=0}^{n-1}{(a_i \cdot \min\limits_{j \in T}{|j-i|})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(C_{i,j}(T))})} + \displaystyle\sum\limits_{i=0}^{m-1}{(PC_i(T) + SC_i(T))}$$$

Apply linearity of expectation:

$$$\mathbb{E}[\text{the expression}] = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\mathbb{E}[C_{i,j}])})} + \displaystyle\sum\limits_{i=0}^{m-1}{(\mathbb{E}[PC_i] + \mathbb{E}[SC_i])}$$$

the probability of $$$C_{i,j}$$$ being "active" (the block appears), is $$$\frac{1}{2^{j-i+1}}$$$ (we essentially ask what's the probability for a certain fixed state of $$$j-i+1$$$ elements in $$$P$$$ (the elements which are within the block's region)).

Therefore, $$$\mathbb{E}[C_{i,j}] = \frac{1}{2^{j-i+1}} \cdot cost(P_i, P_j)$$$.

By applying similar logic: $$$\mathbb{E}[PC_i] = \frac{1}{2^{i+1}} \cdot pcost(P_i)$$$, $$$\mathbb{E}[SC_i] = \frac{1}{2^{m-1-i+1}} \cdot scost(P_i)$$$

Substituting the results:

$$$\mathbb{E}[\text{the expression}] = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{j-i+1}} \cdot cost(P_i, P_j))})} + \displaystyle\sum\limits_{i=0}^{m-1}{(\frac{1}{2^{i+1}} \cdot pcost(P_i) + \frac{1}{2^{m-1-i+1}} \cdot scost(P_i))}$$$

As we have $$$O(1)$$$ expressions for $$$cost$$$, $$$pcost$$$ & $$$scost$$$ (after preprocessing in $$$O(n)$$$), this can be calculated in $$$O(m^2)$$$. So, we have a different $$$O(m^2 + n)$$$ solution.

We can calculate the $$$\displaystyle\sum\limits_{i=0}^{m-1}{(\frac{1}{2^{i+1}} \cdot pcost(P_i) + \frac{1}{2^{m-1-i+1}} \cdot scost(P_i))}$$$ in $$$O(n)$$$, so the main challenge is to calculate $$$\displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{j-i+1}} \cdot cost(P_i, P_j))})}$$$ fast.

Recall the $$$cost$$$ formula we found:

let's group together terms that depend on the same thing:

$$$cost(l, r) = - g_{u}(l-1) + l \cdot g(l-1) - g_{d}(r+1) - (n - 1 - r) \cdot g(r) + g_{u}(\lfloor \frac{l+r}{2} \rfloor) + g_{d}(\lfloor \frac{l+r}{2} \rfloor + 1) + (n - 1 - (r + l)) \cdot g(\lfloor \frac{l+r}{2} \rfloor)$$$

Let $$$A(x) = - g_{u}(x-1) + x \cdot g(x-1)$$$

Let $$$B(x) = - g_{d}(x+1) - (n - 1 - x) \cdot g(x)$$$

Let $$$W(x) = g_{u}(\lfloor \frac{x}{2} \rfloor) + g_{d}(\lfloor \frac{x}{2} \rfloor + 1) + (n - 1 - x) \cdot g(\lfloor \frac{x}{2} \rfloor)$$$

Then $$$cost(l,r) = A(l) + B(r) + W(l+r)$$$

This expression seems much more friendly! Note that $$$A(x)$$$, $$$B(x)$$$ & $$$W(x)$$$ values for all valid $$$x$$$ can be pre-processed in $$$O(n)$$$ (they all depend on functions which we have shown that can be pre-processed in $$$O(n)$$$)

Substituting in the value that we are still looking for: $$$\displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{j-i+1}} \cdot cost(P_i, P_j))})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{A(P_i) + B(P_j) + W(P_i+P_j)}{2^{j-i+1}})})}$$$

Let $$$S_A = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{A(P_i)}{2^{j-i+1}})})}$$$

Let $$$S_B = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{B(P_j)}{2^{j-i+1}})})}$$$

Let $$$S_W = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{W(P_i+P_j)}{2^{j-i+1}})})}$$$

So, the value we are still looking for is exactly $$$S_A + S_B + S_W$$$. If we are able to calculate the value of each of the 3 terms fast, we win.

Let's begin with $$$S_A$$$.

$$$S_A = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{A(P_i)}{2^{j-i+1}})})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\frac{A(P_i)}{2^{-i+1}} \cdot \displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^j})})}$$$

Let's calculate the values summed over all $$$i$$$ from $$$i=m-1$$$ to $$$i=0$$$. During the calculation of the terms, we'll maintain a helper variable $$$H_1 = \displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^j})}$$$, which is the value we need to multiply $$$\frac{A(P_i)}{2^{-i+1}}$$$ by.

At first, before iterating over the $$$i$$$ values, set $$$H_1 = 0$$$, $$$S_A = 0$$$. When we are at some $$$i$$$, add to $$$S_A$$$ $$$\frac{A(P_i)}{2^{-i+1}} \cdot H_1$$$. Then, add $$$\frac{1}{2^i}$$$ to $$$H_1$$$, to be used in the next iterations.

If you are uncomfortable with this approach, you can also pre-process $$$H_1(i) = \displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^j})}$$$ like a DP (like pre-processing array prefix/suffix sums), then use them to calculate $$$S_A$$$ without maintating an extra helper variable during the calculation.

To summarize, we have demonstrated a way to calculate $$$S_A$$$ in $$$O(m+n)$$$ (the $$$+n$$$ part is due to preprocessing $$$A(x)$$$ values)..

One down, two left.

Let's tackle $$$S_B$$$.

$$$S_B = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{B(P_j)}{2^{j-i+1}})})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\frac{1}{2^{-i+1}} \cdot \displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{B(P_j)}{2^{j}})})}$$$

We can use exactly the same kind of trick. Let $$$H_2(i) = \displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{B(P_j)}{2^{j}})}$$$ (the part which is multiplied by $$$\frac{1}{2^{-i+1}}$$$.

We can either maintain $$$H_2$$$ as a helper variable, iterate over the indexes backwards and add it multiplied by $$$\frac{1}{2^{-i+1}}$$$ to the result, or calculate it like a DP backwards (like prefix/suffix sums) then calculate $$$S_B$$$ with $$$O(1)$$$ per term.

2/3 done, now for the hardest one — $$$S_W$$$.

Our trick won't work this time, we need to think harder.

$$$S_W = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{W(P_i+P_j)}{2^{j-i+1}})})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{-i+1}} \cdot \frac{1}{2^{j}} \cdot W(P_i+P_j))})}$$$.

This expression feels like polynomial multiplication!

Well, almost.

If we have a polynomial $$$U(x)$$$ of degree $$$u$$$, where the coefficient of $$$x^i$$$ is $$$s_i$$$ and a polynomial $$$V$$$ of degree $$$v$$$, where the coefficient of $$$x^i$$$ is $$$t_i$$$, then:

$$$U(x) = \displaystyle\sum\limits_{i=0}^{u}{(s_i \cdot x^{i})}$$$

$$$V(x) = \displaystyle\sum\limits_{i=0}^{v}{(t_i \cdot x^{i})}$$$

$$$U(x) \cdot V(x) = \displaystyle\sum\limits_{i=0}^{u}{(\displaystyle\sum\limits_{j=0}^{v}{(s_i \cdot x^{i} \cdot t_j \cdot x^{j})})} = \displaystyle\sum\limits_{i=0}^{u}{(\displaystyle\sum\limits_{j=0}^{v}{(s_i \cdot t_j \cdot x^{i+j})})}$$$

So, $$$U(x) \cdot V(x) = \displaystyle\sum\limits_{0 \le i \le u, 0 \le j \le v}{(s_i \cdot t_j \cdot x^{i+j})}$$$

Two polynomials can be multiplied in $$$O((u+v) \cdot log(u+v))$$$ via Fast Forier Transformation // Number-Theory-Transformation Important Note: you do NOT need to understand how FFT/NTT works internally in order to understand this editorial. Just suppose you have a black box which can multiply two polynomials in the specified complexity.

Recall that $$$S_W = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{-i+1}} \cdot \frac{1}{2^{j}} \cdot W(P_i+P_j))})}$$$.

First, let's convert the problem into a problem of finding a polynomial. Let's replace $$$W(P_i + P_j)$$$ with $$$x^{P_i+P_j}$$$, namely, let's define a polynomial $$$G(x) = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{-i+1}} \cdot \frac{1}{2^{j}} \cdot x^{P_i+P_j})})}$$$. Let $$$g_i$$$ be the coefficient of $$$x^i$$$ in $$$G$$$. Now, notice that $$$S_W = \displaystyle\sum\limits_{i=0}^{2 \cdot n - 2}{(g_i \cdot W(i))}$$$, because $$$g_i$$$ is exactly the sum of all terms multiplied by $$$x^i$$$, which is exactly the sum of all terms multiplied by $$$W(i)$$$ in $$$S_W$$$.

So, if we find fast the coefficients of the polynomial $$$G(x)$$$, we win.

If $$$x^{P_i+P_j}$$$ "gains" $$$\frac{1}{2^{-i+1}} \cdot \frac{1}{2^{j}}$$$, this gives us the motivation to define two polynomials, one with $$$\frac{1}{2^{-i+1}}$$$ as a coefficient of $$$x^{P_i}$$$ for all $$$i$$$, and one with $$$\frac{1}{2^{j}}$$$ as a coefficient of $$$x^{P_j}$$$ for all $$$j$$$. Formally:

Let $$$Y(i) = \displaystyle\sum\limits_{i=0}^{n-1}{y_i \cdot x^i}$$$, where $$$y_{P_i} = \frac{1}{2^{-P_i+1}}$$$ for all $$$i$$$, and the rest of the $$$y$$$ values are 0.

Let $$$K(i) = \displaystyle\sum\limits_{i=0}^{n-1}{k_i \cdot x^i}$$$, where $$$k_{P_i} = \frac{1}{2^{P_i}}$$$ for all $$$i$$$, and the rest of the $$$k$$$ values are 0.

Notice that $$$G(x) = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(\frac{1}{2^{-i+1}} \cdot \frac{1}{2^{j}} \cdot x^{P_i+P_j})})} = \displaystyle\sum\limits_{i=0}^{m-1}{(\displaystyle\sum\limits_{j=i+1}^{m-1}{(y_i \cdot k_i \cdot x^{P_i+P_j})})} = \displaystyle\sum\limits_{0 \le i < j \le n - 1}{(y_i \cdot k_i \cdot x^{P_i+P_j})}$$$

$$$Y(i) \cdot K(i) = \displaystyle\sum\limits_{0 \le i,j \le n-1}{(y_i \cdot k_i \cdot x^{i+j})}$$$

So, simply multiplying $$$Y(x)$$$ & $$$K(x)$$$ gives us exactly the expression we need, but with $$$0 \le i,j \le n-1$$$ instead of $$$0 \le i < j \le n-1$$$.

A vague informal idea is that if we multiply all elements from the "first half" of $$$Y$$$ with all elements from the "second half$ of $$$K$$$, then all of the pairs we get contributing to the coefficients of the resultant polynomial $$$G$$$ are pairs we need (as everything in the "second half" of $$$K$$$ has strictly larger $$$x$$$ powers than everything in the "first half" of $$$Y$$$ and we look for pairs where $$$i < j$$$), then we need to recursively calculate the contribution of elements from the "first" half of $$$Y$$$ combined with elements from the "first half" of $$$K$$$, and elements from the "second half" of $$$Y$$$ combined with elements from the "second half" of $$$K$$$ (and obviously, the contribution of elements from the "second half" of $$$Y$$$ combined with elements from the "first half" of $$$K$$$ is 0, (as everything in the "second half" of $$$Y$$$ has strictly larger $$$x$$$ powers than everything in the "first half" of $$$K$$$, and we look for pairs where $$$i < j$$$))

Namely, we want a divide & conquer algorithm.

Define the first-half-polynomial of a of a polynomial of degree $$$d$$$, say, $$$Z(x)$$$, where $$$z_i$$$ is the coefficient of $$$x^i$$$, to be $$$Z_0 = \displaystyle\sum\limits_{i=0}^{\lfloor \frac{d}{2} \rfloor}{(z_i \cdot x^i)}$$$.

Define the second-half-polynomial of a of a polynomial of degree $$$d$$$, say, $$$Z(x)$$$, where $$$z_i$$$ is the coefficient of $$$x^i$$$, to be $$$Z_1 = \displaystyle\sum\limits_{i=\lfloor \frac{d}{2} \rfloor+1}^{d}{(z_i \cdot x^i )}$$$.

Let $$$U(x), V(x)$$$ both be polynomial of degree $$$d$$$, where the coefficients of $$$U(x) $$$ are $$$s_i$$$, and the coefficients of $$$V(x)$$$ are $$$t_i$$$. define $$$U(x) \oplus V(x) = \displaystyle\sum\limits_{0 \le i < j \le d}{(s_i \cdot t_i \cdot x^{i+j})}$$$.

Note that based on this definition, $$$G(x) = Y(x) \oplus K(x)$$$.

Claim: Let $$$U(x), V(x)$$$ be polynomials defined exactly like before. for $$$d \ge 1$$$, $$$U(x) \oplus V(x) = U_0(x) \cdot V_1(x) + U_0(x) \oplus V_0(x) + U_1(x) \oplus V_1(x)$$$. This claim is true because for all valid pairs $$$i, j$$$, namely, pairs where $$$i < j$$$, either $$$x^i$$$ is in $$$U_0$$$ & $$$x^j$$$ is in $$$V_1$$$, and these values we gain by multiplying regularly $$$U_0$$$ & $$$V_1$$$, or $$$x^i$$$ is in $$$U_0$$$ & $$$x^j$$$ is in $$$V_0$$$, in which case we can use the same function we use the calculate $$$U(x) \oplus V(x)$$$ recursively, or $$$x^i$$$ is in $$$U_1$$$ & $$$x^j$$$ is in $$$V_1$$$, in which case we can (again) use the same function we use the calculate $$$U(x) \oplus V(x)$$$ recursively. Note that the only other option we did not mention is $$$x^i$$$ being in $$$U_1$$$ & $$$x^j$$$ being in $$$V_0$$$, but this option is impossible, as this imples $$$i > j$$$.

multiplying $$$U_0$$$ with $$$V_1$$$ takes $$$O(d \cdot \log(d))$$$ (as mentioned before, by using FFT/NTT). we recurse the job twice. once with $$$U_0$$$ & $$$V_0$$$, which are polynomials of degree $$$\lfloor \frac{d}{2} \rfloor$$$, and once with $$$U_1$$$ & $$$V_1$$$, which are polynomials of degree.. $$$d$$$? This is clearly bad, but the coefficients of their first $$$\lfloor \frac{d}{2} \rfloor + 1$$$ $$$x$$$ powers are 0! so instead of recursiving with $$$U_1(x)$$$ & $$$V_1(x)$$$, we can recurse with $$$\frac{U_1(x)}{x^{\lfloor \frac{d}{2} \rfloor + 1}}$$$ and $$$\frac{V_1(x)}{x^{\lfloor \frac{d}{2} \rfloor + 1}}$$$, which are polynomials of degree $$$d - \lfloor \frac{d}{2} \rfloor - 1 \le \lfloor \frac{d}{2} \rfloor$$$, then multiply the result by $$$x^{2 \cdot \lfloor \frac{d}{2} \rfloor + 1}$$$ to undo the effect of the divisions.

With our improvement, for degree $$$d$$$, we do $$$O(d \cdot \log(d))$$$ & recurse twice with degree $$$\lfloor \frac{d}{2} \rfloor$$$, hence the total complexity for the operation (by standard divide-and-conquer analysis) is $$$O(d \cdot \log(d)^2)$$$

To summarise, we can find the coefficients of the polynomial $$$G(x)$$$ by calculating $$$Y(x) \oplus K(x)$$$ via the method that we've presented in $$$O(n \cdot \log(n)^2)$$$, and we can use them to calculate $$$S_W$$$, so we have (finally!) arrived at an algorithm to find the expected value we desire in $$$O(n \cdot \log(n)^2)$$$.

$$$\blacksquare$$$

Final Notes

Thank you Mangooste for creating such a challenging & educational problem!
Friendly reminder to upvote all of Monogon's blog posts
gitgud

Full text and comments »

fft, divide and conquer, recursion, math, combinatorics, prefix sums, 3300, editorial

+123

lior5654
3 years ago
5

[Discussion Thread] APIO 2021

By lior5654, history, 4 years ago, In English

As Israeli contestants posted both the APIO 2019 Discussion Thread & the APIO 2020 Discussion Thread, I feel obligated to post this year's discussion thread :D

This year, APIO 2021 is hosted again by Indonesia. For most of the participating countries, APIO 2021 serves as one of the main "TST"s (Team Selection Tests) for IOI 2021 (including Israel). The competition's official website is located Here.

The competition is IOI style, 5 hours long. A maximum of 50 submissions is allowed for each problem. A participant's final score is the sum of the scores the participant got on each problem. A participant's score for a problem is the sum of the scores the participant gained in each subtask of the problem. A participant's score for a subtask is the maximum grade the participant's submissions achieved for that subtask.

APIO Problems are of high quality and are considered good practice for those who wish to participate in IOI.

This thread is intended for discussing results. Furthermore, after APIO 2021 Open Contest ends, this thread shall also be used for discussing the solutions to the problems given in the competition.

Note: Before jonathanirvings writes a comment like this yet again, please note that discussing solutions before APIO 2021 Open Contest ends is prohibited. However, discussing results should be allowed as the official window (based on the official website) has ended.

Full text and comments »

apio, 2021, ioi, discussion, apio2021, olympiad

lior5654
4 years ago
34

[Community Editorial] Codeforces Round #720 (Div.2) Problem D — Nastia Plays with a Tree

By lior5654, history, 4 years ago, In English

Hello Codeforces!

Today was Codeforces Round #720 (Div.2), in which I solved problem D — Nastia Plays with a Tree. As I find my solution different from the solutions I heard of after the contest, I will explain it in detail in this blog post. I will note that you still might find value in reading this if you solved the problem, as my proposed solution also maintains at every stage the graph being a forest (i.e I don't create any cycles).

The Solution

Solution By FiveSixFiveFour

Note that in this editorial, when I write simple path, I mean a simple path between two nodes without having extra edges going from any of the nodes that do not belong to that simple path, or simply a node not connected to anything (this will be clearer once we see drawings).

We should first note that bamboo is simply a tree that looks like a single line of connections (i.e a simple path).

We observe that given the sequence of moves the order we delete/link edges in does not change the resultant underlying graph. We consider the following question: how can we delete edges such that we can link the remaining forest via link operations to make a bamboo after we delete all of them?

Well, we want to achieve a simple path in the end, and a simple path can only be broken into nonintersecting simple paths, therefore after edge deletions, we should have a set of nonintersecting simple paths.

Now, our motivation is to take some pen & paper and draw how such a set looks like (not necessarily the optimal one) (in this editorial I cheat with CSAcademy Graph Editor). **Note that for convenience and as in many tree problems, we root the tree at an arbitrary node.

Suppose we are given the following tree (rooted at node 1):

We can convert it to a set of non-intersecting simple paths in multiple ways, one of which is deleting 4->10, 5->13 & 5->7, and if we were to apply this way we would end up with the following set of simple paths:

If we look at subtrees of the original tree and what they turned into after the deletion, we observe that if a node has outgoing edges to 2 of its original children from the original subtree then that node must not be connected to its parent.

This leads to an approach of dynamic programming over subtrees & an observation is that we can utilize the following state: whether or not our subtree root is connected to its parent. The motivation for this state is exactly what we state above, in order to connect to more than 1 child we must not be connected to our parent.

Let $$$sdp[c]$$$ be the minimum cost to convert $$$c$$$'s subtree into a set of disjoint simple paths, given that we delete the edge from $$$c$$$ to its parent, and let $$$cdp[c]$$$ denote the cost given that we do not delete the edge from node $$$c$$$ to its parent. So in $$$sdp$$$ we allow connecting to 2 children and in $$$cdp$$$ we do not allow that, but for $$$sdp$$$ we add a cost of 1 (deleting the edge to the parent). Note that in this way, the root $$$sdp$$$ 's value will contain 1 more than the actual answer (because the root doesn't have a parent so we add 1 for no reason) but as you'll see we don't use that numerical value in our construction.

For the construction, when we calculate the $$$dp$$$ values we also the children that we keep edges to. let $$$cn$$$ be an array that will store the connections for the $$$cdp$$$. We will assign $$$-1$$$ to $$$cn[c]$$$ if we don't connect anything, and otherwise, we will assign the label of the node that we connect to (note that in this $$$dp$$$ we can connect to almost 1 child because we are already connected to our parent). Furthermore, let $$$sn$$$ be an array of pairs such that it will store the connections for the $$$sdp$$$. We will assign $$$-1$$$ to $$$sn[c].first$$$ if we make no connections. otherwise, we will assign one of the connections to it. Moreover, we will assign $$$-1$$$ to $$$sn[c].second$$$ if we make less than 2 connections, otherwise we will assign to it the label of the node we connect to that we did not assign to $$$sn[c].first$$$.

By using $$$cn$$$ & $$$sn$$$ we will be able to know given the state (connected or not connected to parent) what edges we chose to keep (i.e, the connections we still have).

Now, we shall discuss calculating the $$$dp$$$ values. Obviously, we want to do this via a DFS scan so that when we calculate the $$$dp$$$ value for a node, we have already calculated the $$$dp$$$ value for all children of the node. Note that for leaves $$$cdp[c] = 0$$$ & $$$sdp[c] = 1$$$ *(again, in $$$sdp$$$ we add 1 for the cost of deleting the edge to the parent). We shall now discuss the calculation for nonleaf nodes.

For both $$$dp$$$'s, we are able to choose for some finite number of nodes the $$$cdp$$$ state (namely, leaving them connected) ($$$\le 1$$$ nodes for $$$cdp$$$ & $$$\le 2$$$ nodes for $$$sdp$$$), and the $$$sdp$$$ state for all of the rest of the nodes (the nodes that we did not leave connected). So if we choose for some child $$$e$$$ the $$$cdp$$$ state rather than its $$$sdp$$$ state, it's good to do that if and only if $$$cdp[e] < sdp[e]$$$ & by doing that we reduce our answer by $$$sdp[e] - cdp[e]$$$, as we use the $$$cdp$$$ instead of the $$$sdp$$$, so for all children $$$e$$$ of node $$$c$$$ we calculate this difference, and we store the 2 biggest differences and the nodes that cause them as they will contribute to us the most. we also calculate the sum of all $$$sdp$$$ values of all children, as that's our worst-case value and we will subtract from that the differences that we get by leaving children connected. We will denote this sum by $$$sm$$$. Additionally, we will denote the biggest difference ($$$sdp[e] - cdp[e]$$$) by $$$d_0$$$ & the child that causes this difference by $$$e_0$$$. Furthermore, if $$$c$$$ has more than 1 child we will denote similarly to $$$d_0$$$ & $$$e_0$$$, the second biggest difference that a node causes by $$$d_1$$$ and the node that causes this difference by $$$e_1$$$. Note that we can find all of these values in $$$O(n)$$$ via keeping a heap of 2 min pairs or simply in $$$O(n\log(n))$$$ via sorting. Now we simply have a bunch of casework (as you have already seen, this solution contains a bunch of casework). id $$$d_0 \le 0$$$, switching to $$$cdp$$$ wont make anything better so we use $$$sdp$$$ for all children (i.e delete the connections to all children) $$$sdp[c] = 1 + sm$$$, $$$cdp[c] = sm$$$, $$$cn[c] = -1$$$ & $$$sn[c].first = -1$$$, $$$sn[c].second = -1$$$. Now, if $$$d_0 > 0$$$, then we have 2 cases: if $$$c$$$ has only one child or it has more than 1 child and $$$d_1 \le 0$$$, then only connecting to $$$e_0$$$ will make things better, so $$$sdp[c] = 1 + sm - d_0$$$, $$$cdp[c] = sm - d_0$$$, $$$cn[c] = e_0$$$ & $$$sn[c].first = e_0$$$, $$$sn[c].second = -1$$$. The second option is that node $$$c$$$ has more than 1 child & $$$d_1 > 0$$$. In this case, $$$cdp$$$ & $$$cn$$$ are the exact same as we can only take 1 node & $$$e_0$$$ both improves our answer & is the best node to do so as it's difference $$$d_0$$$ is the biggest difference. However, now we can take 2 nodes for $$$sdp$$$, so $$$sdp[c] = 1 + sm - d_0 - d_1$$$, $$$sn[c].first = e_0$$$ & $$$sn[c].second = e_1$$$.

Now we know in each state what nodes each node connects to, and our task now is to get a list of the chains in the optimal state. let $$$chain[c]$$$ be {-1, -1} if $$$c$$$ is not the LCA of an existing chain, or a pair of both endpoints of that chain otherwise. we can calculate this array via a second DFS scan, keeping the current state of the root (connected to parent / not connected to parent), the chain LCA if we are members of a chain right now and at want position in the pair to put the end of my chain if so. For more details, see my submission from the actual contest — 115613655 (the function $$$dfs2$$$).

now by iterating over all nodes, we collect all chains, and our task now is to simply connect them. Note that because each node is a member of some chain this implies the root is a member of that chain and the root must be the LCA of that chain therefore chain[root] != {-1, -1}. for all other chains, the corresponding removed edge to them is p[LCA], LCA. we will use a $$$tail$$$ variable, each time connecting one of the endpoints of the current chain to this tail then setting the tail to be the endpoint we did not use (this also works if the chain is a single node because it's both the head and the tail). In the beginning, set $$$tail$$$ to be $$$chain[root].second$$$. Iterate over all other chains and for the current chain disconnect $$$p[i]$$$ -> $$$i$$$ & connect $$$tail$$$ -> $$$chain[i].first$$$, then set $$$tail$$$ to be the other endpoint, namely, $$$chain[c].second$$$. this way we chain all of the chains and we solve the problem!

Complexity: $$$O(n)$$$ or $$$O(n\log(n))$$$, depending if you use a heap of 2 elements or sort in order to find $$$d_0$$$, $$$e_0$$$, $$$d_1$$$ & $$$e_1$$$

This was very challenging to implement during the contest yet eventually I got AC 10 minutes before the end of the round. Thanks for the contest! :D

Full text and comments »

div2, 720, trees, dynamic programming

lior5654
4 years ago
13

[Suggestion] When should a round be rated for a registered user?

By lior5654, history, 4 years ago, In English

Dear codeforces community. Lately, I have started participating in some division 1 rounds, and I experience a weird issue. Suppose some Div1's problems are >= 2100 in difficulty. Then most purples after reading the problems choose to quit and not do the round. The purples that do choose to do the round, might not even solve the first problem, but the main issue is that a round is only rated for you if you submit. The latest codeforces div1 round (round #707) had about 1300 registered participants, but only ~700 submitted something, and only ~620 solved a problem. So solving problem A with bad timing causes you to be last place, despite doing better than half of the participants.

I suggest that a round should be rated for a user if he views one of the problems of the round as a registered participants. Was this suggested before? I think atcoder had some contests that worked that way but I am not sure.. Any way, the current system causes low div1's to simply quit, causing the people that do submit and succeed to lose massive amounts of rating.

I would appreciate discussing the issue in the blog's comment section.

Thanks, lior5654

Full text and comments »

rating, div1, purple, issue

+187

lior5654
4 years ago
38

Ratism Sample

By lior5654, history, 4 years ago, In English

The same announcement, Expert posted 6 hours before Master. Expert got -10, Master got +74.

Your comments?

https://codeforces.net/blog/entry/85359 —

35 hours ago:

https://codeforces.net/blog/entry/85354

41 hours ago (6 hours before the above post):

Full text and comments »

ratism, contribution, broken

lior5654
4 years ago
54

Monogon #1 Contributor

By lior5654, history, 4 years ago, In English

Congratulations Monogon for achieving the #1 spot in the top contributor leaderboards!

Full text and comments »

monogon, orz, #contribution, first

lior5654
4 years ago
26

[Discussion Thread] RMI 2020

By lior5654, history, 4 years ago, In English

Opening this thread to discuss Romanian Masters of Informatics (RMI) 2020 Problems & Results.

Hope everybody had a great competition!

http://rmi.lbi.ro/rmi_2020/

Full text and comments »

rmi, rmi2020, discussion, olympiad

lior5654
4 years ago
35

[Discussion Thread] IATI 2020

By lior5654, history, 4 years ago, In English

Opening this thread to discuss XII International Autumn Tournament in Informatics (IATI) Problems.

Hope everybody had a great competition!

https://iati-shu.org/en/home/

Full text and comments »

#iati2020, #discussion

lior5654
4 years ago
8

Congratulations Um_nik for 6969 followers!

By lior5654, history, 4 years ago, In English

As round 696 is getting closer, our chad Um_nik finally achieved 6969 followers on codeforces! Congratulations!

Full text and comments »

um_nik, milestone, epic

+372

lior5654
4 years ago
46

Feature Request: User Profile Description

By lior5654, history, 4 years ago, In English

I think it would be cool if a user would be able to make a paragraph about themselves and make it appear on their codeforces profile.

Was this feature suggested before? I see no issue implementing it within a few days and it would be cool for it to be added.

Full text and comments »

feature, profile

lior5654
4 years ago
27

APIO 2020 Discussion

By lior5654, history, 5 years ago, In English

Opening this thread for discussion of APIO 2020.

Not really sure whether the problems themselves can be discussed yet (open contest) but the competition has officially ended :)

Edit 1: APIO 2020 Scientific Committee note: "As you mentioned, please refrain from publicly discuss APIO 2020 problems as there will still be an APIO 2020 open contest (public). We can publicly discuss the problems once the open contest has finished."

Feel free to discuss results, we'll only be able to discuss problem content after the open contest ends.

Edit 2: Should be good now :)

Full text and comments »

+114

lior5654
5 years ago
36