[HELP] What is the proof of this greedy solution ? - Codeforces

→ Pay attention

Before contest
Codeforces Round 1006 (Div. 3)
2 days
Register now »

→ Top rated

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

Countries | Cities | Organizations

→ Top contributors

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	152
7	adamant	152
9	luogu_official	151
10	awoo	147

View all →

→ Find user

→ Recent actions

Detailed →

abcdqwerty12345's blog

[HELP] What is the proof of this greedy solution ?

By abcdqwerty12345, history, 3 years ago, In English

In English

Basically you are given an array and you have to keep removing any two unequal elements until the size of the array is minumum.

In the editorial it's said:- We get the following greedy solution — each time we take two characters with maximal occurrences number and delete them.

Can anyone plz tell why this solution always give the optimal answer ? Like why everytime we are selecting the elements with maximum frequency?

Thx a lot !!!

+9

abcdqwerty12345
3 years ago
4

Comments

Comments (2)

Show archived | Write comment?

»

3 years ago, # |

← Rev. 2 →

Vote: I like it

+3

Vote: I do not like it

I have come across this lemma as a subproblem in harder problems so I think its nice to write down a formal proof of this fact.

First this problem is equivalent to the following problem:

Given an array $$$a[]$$$, find the maximum number of disjoint pairs of indices $$$(i, j)$$$ such that $$$a[i]\neq a[j]$$$ and each index is in at-most one pair.

Let $$$f[i]$$$ denote the frequency of the value $$$i$$$ in the array. Let's sort $$$f[]$$$ in increasing (non-decreasing) order. There are two cases:

Case 1: $$$f[n]\geq f[1]+f[2]+...+f[n-1]$$$

For this case, first of all note that the maximum number of possible pairs is $$$f[1]+f[2]+...+f[n-1]$$$. It cannot exceed that because in each pair, elements need to be distinct. So we need at least one of $$$f[1], f[2],....f[n-1]$$$ in each pair. And moreover, just by pairing off $$$f[n]$$$ with others in each pair, we can achieve this maximum. If we keep pairing the maximums, easy to see that $$$f[n]$$$ always remains as the maximum and we pair off each $$$f[i]$$$ from $$$1$$$ to $$$n-1$$$ with $$$f[n]$$$.

Case 2: $$$f[n]< f[1]+f[2]+...+f[n-1]$$$

Here, I claim the maximum is $$$floor((f[1]+f[2]+..f[n])/2)$$$.

It is true, since the maximum cannot exceed this, as this is the case where every element gets divided into pairs (except possibly $$$1$$$, if total number of elements is odd).

Moreover, let's prove that the greedy method of pairing off two largest finds this maximum:

we proceed by induction on sum of elements.

We have $$$f[n]<=f[1]+f[2]+...+f[n-1]$$$ -(A)

Inductive hypothesis: if above inequality holds true, we have the number of pairs = $$$floor(sum/2)$$$ by the greedy approach.

Lets pair off $$$1$$$ $$$f[n]$$$ and $$$1$$$ $$$f[n-1]$$$.

Case 1: $$$f[n]$$$ remains the maximum

The statement (A) still holds as $$$f[n]-1<=f[1]+f[2]+...f[n-1]-1$$$ and the sum of elements has decreased by $$$2$$$. We can thus apply inductive hypothesis to get the maximum number of elements as $$$floor((sum-2)/2)+1$$$ which is $$$floor(sum/2)$$$.

Case 2: $$$f[n-2]$$$ becomes the maximum.

This is possible only if $$$f[n]=f[n-1]=f[n-2]$$$. In that case, if $$$f[n]>=2$$$, $$$f[n-2]<=f[n-1]-1+f[n]-1$$$ which means $$$f[n-2]<=f[1]+f[2]+...+f[n-1]-1+f[n]-1$$$. Again we can apply inductive hypothesis as above and get the result.

→ Reply

»

3 years ago, # |

Vote: I like it

+6

Vote: I do not like it

There is definitely a lack of good resources on greedy algorithms.

It seems that people consider that the solution is obvious, but often it is not.

Hope someone can fix this issue.

→ Reply