Facebook Hacker Cup 2016 Round 2

9 years ago, # |

+120

"10:00 AM PST / 22:00 MSK"

Nice trick, but I'm sure it should be 21:00 MSK.

→ Reply

9 years ago, # ^ |

Hmm, looks like your are right. The first link that appeared to me when I googled "PST to MSK" tends to be incorrect showing Moscow time as UTC+4 :( Thanks!

→ Reply

White_Bear

9 years ago, # |

+16

It would also be better to have link to the dashboard in e-mail.

→ Reply

smahdavi4

9 years ago, # |

Also Top 200 will be advanced to round 3 and Top 500 will recieve an oficial T-shirt

→ Reply

9 years ago, # |

+54

Getting to the dashboard is part of the challenge. It's like a round 1.5.

→ Reply

wjomlex

9 years ago, # ^ |

We aim to always keep you on your toes.

→ Reply

Petr

9 years ago, # ^ |

+31

I have a question about the rules. Is it fine to submit publicly available code authored by others? More specifically, I'm interested in code snippets from e-maxx.ru

→ Reply

wjomlex

9 years ago, # ^ |

+26

Any code written before a round starts is fair game. I would treat it the same as codebook code.

Definitely nobody should be pointing people to public code during a round, since that constitutes communication which is disallowed. But certainly anything preexisting that you're aware of or discover for yourself during a round is OK.

If you have any concerns about any submissions, feel free to message me.

→ Reply

9 years ago, # |

+34

Round 2 link : https://www.facebook.com/hackercup/round/807323692699472/

→ Reply

9 years ago, # |

This link always had current round dashboard acessible.

→ Reply

9 years ago, # ^ |

+10

Yes, I'm also using it and I've provided it in the post body.

→ Reply

9 years ago, # ^ |

Oops, misread that

→ Reply

9 years ago, # ^ |

The past rounds page didn't have Round 2 when I posted the direct link.

→ Reply

9 years ago, # ^ |

But would you click on Instruction it'll get you there

→ Reply

9 years ago, # ^ |

Oh ok, I didn't know that!

→ Reply

vsp4

9 years ago, # |

Sad, wasted too much time on C. Only 30 mins left for A, and couldn't solve within time :. Was it greedy for A?

→ Reply

9 years ago, # ^ |

← Rev. 3 →

I wouldn't tell much before results are announced, but I did DP on it. DP asking how many times you need to recolour if you start from letter i.

C was difficult to code — especially that even small subtasks were problems in themselves (like calculating all pair square difference with N = 200,000), and I am still not good with quickly producing tree-structures :(

→ Reply

vsp4

9 years ago, # ^ |

← Rev. 2 →

I don't know whether it is correct. But i think i did it in O(n log n) By storing sum, square and no of same height's found till now and by pruning incase a big height ladder came, all this within a map. Seems to work correct for all random test cases that i made

Edit: Its AC :D

→ Reply

9 years ago, # ^ |

True, you don't really need tree there. Should be faster to code.

→ Reply

gen

9 years ago, # ^ |

+18

I solved it using stack, it's easy to code.

→ Reply

9 years ago, # ^ |

Now I see that it works. S#$@ happens, sometimes the first idea is much more complex then needed. :(

→ Reply

gen

9 years ago, # ^ |

Know that feel bro. :(

→ Reply

inactive06

9 years ago, # ^ |

I also used stack . Though the hardest part for me was to find a way to calculate the sum of square of absolute difference of pair of integers from n numbers in less than O(n^2) time.

→ Reply

eduardische

9 years ago, # ^ |

If you iterate over the middle split (the point that Jack & Jill never cross -- there is one), then it's greedy from that point outwards for both players -- once you encounter a difference you have to paint it over. However, you need to implement that fast enough, as it is O(N^2) naively.

→ Reply

Xellos

9 years ago, # |

← Rev. 2 →

273 full scores? I hope there's going to be a lot of fails.

UPD: 177. With an average 1 hour submission time for passing.

→ Reply

9 years ago, # ^ |

Do your part first :)

→ Reply

9 years ago, # |

+35

The (probably, correct, as I've compared it with some other contestants) solutions and the answers for all problems. In each directory 02 is a test, and 02.r is the answer for it.

Link.

→ Reply

yeputons

9 years ago, # |

+13

So, what is the intended solution for D? It's not straightforward Min-Cost-Max-Flow for all O(nc²) transitions (which makes total running time O(nc⁵)), is it?

→ Reply

Um_nik

9 years ago, # ^ |

Hungarian algo instead of MCMF reults in O(nk⁴)

→ Reply

9 years ago, # ^ |

Sure, but there must be some better solution than applying MCMF while conducting dp.

→ Reply

http://man7.org/linux/man-pages/man1/tee.1.html

9 years ago, # ^ |

First of all, it is O(nc⁴) because you shouldn't run a solution for an assignment problem if the degree of a vertex is larger than c (so the worst test is where there are n / c vertices of degree c, the running time will be O(nc⁴)).

Second, you can use Hungarian algorithm instead of min-cost flow. That makes the solution almost impossible to fail since I don't see a way to make the Hungarian algorithm antitest appear often by adjusting the input data in this problem.

→ Reply

yeputons

9 years ago, # ^ |

← Rev. 2 →

Ok, so, no extra tricks — I had all these written. Looks like I've implemented both of them terribly bad. It takes about 10-15 seconds per test on my not-so-outdated PC. 8 cores still solve the problem, but it's not happy about that approach.

→ Reply

eduardische

9 years ago, # ^ |

+13

The expectation that my code will run slowly and borderline on time-limit failed me hard. I was waiting all 6 minutes for the output and because I was piping the output I didn't realize it was stuck even on 1st case, so I didn't notice the initialization bug which I couldn't find by manual tests in time.

Lesson learned: never pipe.

→ Reply

fetetriste

9 years ago, # ^ |

+13

→ Reply

ItsLastDay

9 years ago, # ^ |

You can pipe into tee command, like this: python solution.py | tee output.txt. You will see the output as it computes in the terminal, while it also will be written into output.txt.

→ Reply

KAN

9 years ago, # ^ |

+99

Because of this reason I always print the number of solved tests to stderr after each test.

→ Reply

ADJA

9 years ago, # ^ |

So, again I had a 'passing' solution in my mind, but didn't write it because calculated complexity seemed quite high (well, even O(nc⁴) seems a lot).

MinCost Matchings and Flows don't stop to amuse me :)

→ Reply

9 years ago, # ^ |

+26

Why nc²? You need only color of parent, and you need to solve odd and even vertex separately

→ Reply

9 years ago, # ^ |

+20

0_0

How odd and even vertices are related to this problem?

→ Reply

9 years ago, # ^ |

← Rev. 2 →

+60

Because odd and even vertices can be colored independently. By odd and even, I mean, if you 2-color the tree, odd vertices are the vertices of one color and even are the vertices of the other color.

→ Reply

9 years ago, # ^ |

Very cool! So the complexity of a solution becomes O(nc³), right?

→ Reply

9 years ago, # ^ |

I think that if the solution is optimized well enough, the complexity would be O(nk²): worst case is n / k vertices of degree k, each processed in O(k³) time. To process a vertex: first, use Hungarian algorithm to find an optimal assignment of colors to children in O(k³). Then, it is necessary, for each color, to find an assignment which doesn't use that color. To do this, start with an optimal assignment and remove the vertex which corresponds to that color. After that, one child may have no color assigned to it. In this case, it is necessary to find a single extending path, which can be done (I think) in O(k²) time, so the overall complexity of this step is also O(k³).

→ Reply

rng_58

9 years ago, # ^ |

Yes, I always have trouble with the speed of assignment problems too. Luckily this time the testcases were small but my solution will fail if the testcase is 30 * max case.

→ Reply

9 years ago, # ^ |

What do you call max case? I've generated a test with T = 1 that is probably the worst for mine solution (it is called 43 in the archive I provided above), and it works in 0.1 on my laptop.

→ Reply

rng_58

9 years ago, # ^ |

Edges between i+1 and i/30*30. My solution took 50s for this case, but I know my MCMF is terribly slow.

→ Reply

9 years ago, # ^ |

I've generated edges between i and i / 29 + 1 for all i >= 2. My laptop process both mine and your test 30 times in ~10s,

→ Reply

k0st1a

9 years ago, # ^ |

← Rev. 2 →

Could someone explain the idea behind D?

→ Reply

Xellos

9 years ago, # ^ |

Standard tree DP — one choice for states is "edge, colours of its endpoints -> minimum cost for assigning the colours in the subtree of its lower endpoint" (can be optimised by processing odd and even vertices independently, so one of the colours doesn't matter), where you can always try paying P and summing up the minima of its sons, or avoid paying P — in which case you have the problem of assigning distinct colours to sons which minimise the sum of costs (computed by that DP). That's mincost maxflow and can be solved with Hungarian algorithm. See above for optimisations.

→ Reply

WasylF

9 years ago, # |

how to correctly get probability? without -nan(ind)

→ Reply

Guliash

9 years ago, # ^ |

I used logarithms.

→ Reply

carlotheboss

9 years ago, # ^ |

Define P[n][k] as the probability to get exactly k "heads" throwing n coins. Now if you can do a O(N²) DP like this: - if n == 1, P[1][0] = 1-p and P[1][1] = p - else, P[n][k] = p * P[n-1][k-1] + (1-p) * P[n-1][k]

The remaining part of the problem can be solved using linearity of expectations.

→ Reply

9 years ago, # |

← Rev. 2 →

D was quite a bit of work, I submitted a few minutes before time but I doubt it's correct (edit: yay it passed). I used dynamic programming, where I had to do weighted matching multiple times in each vertex to get the answer. Did anyone have a nicer solution?

→ Reply

ffao

9 years ago, # ^ |

+52

If you think that's hard to code, you must not have seen the "Gaussian elimination and matrix exponentiation inside a segment tree" stunts that they pulled last year. :P

→ Reply

IgorKoval

9 years ago, # |

How to solve problem B? We can use BigDecimal to calculate binomial coefficients and powers of p and (1-p) but it is too long (TLE).

→ Reply

9 years ago, # ^ |

+26

You can calculate expressions of form C[n][k] = C_n^kp^k(1 - p)^n - k using the recursive formula C[n][k] = C[n - 1][k - 1]p + C[n - 1][k](1 - p). This solution doesn't contain any divisions, so it won't lead to overflows.

→ Reply

9 years ago, # ^ |

I did this, but still worry about summing up big numbers with small numbers repeatedly. Maybe not an issue for 3,000 items.

→ Reply

9 years ago, # ^ |

I used logarithms (stored in double variables).

→ Reply

9 years ago, # ^ |

You can use dynamic programming, if P(A, B) is the probability of getting exactly A heads in B throws where 0 ≤ A ≤ B then:

P(A, B) = p * P(A - 1, B - 1) + (1 - p) * P(A, B - 1)

Where $P(x, y) = 0$ if x > y or x < 0 or y < 0 etc.

→ Reply

9 years ago, # |

← Rev. 2 →

According to the scoreboard, 273 people submitted all four problems. I wonder how many of them will actually get full score.

UPD: results are available, 177 people have full score.

→ Reply

gen

9 years ago, # ^ |

+50

No more than 272. :D

→ Reply

9 years ago, # |

+36

That was really horrible contest in fact. 3 goddamn simple problems, long double solution passed in B, "copy-paste from emaxx Hungarian algo" in straightforward D.

→ Reply

Nezzar

9 years ago, # ^ |

+15

nah, even double works for B...

→ Reply

9 years ago, # ^ |

← Rev. 3 →

+29

Almost forgot: O(n²) solution works for C.

Solution: http://pastebin.com/3Jxp0wbW

Works 2:30 on my laptop with 1 thread.

→ Reply

Al.Cash

9 years ago, # ^ |

+34

There are only 3 tests with n=200000, so no surprise.

→ Reply

9 years ago, # |

-90

That's the cost of changing rules in the middle of contest. Minus T-shirt. I finished 567th, and 100% sure that if rules weren't changed and only 560 people participated in Round 2, I would easily get in 500.

Agree it is fairer this way, but... changing rules in the middle of competition is not fair anyway.

→ Reply

9 years ago, # ^ |

-15

Strange response from you guys. If you think that my statement is factually incorrect — it is correct, very easy to prove, if it's not obvious. If you are not happy that somebody was hurt by change in rules — sorry — might be a revelation — when somebody gains somebody loses. If you think that getting into 500 in Round 1 means less then getting into 500 in Round 2 — that's ok for you to think so. It's not about what you think it is about what rules are And rules were changed in the middle of the contests.

→ Reply

Beresta

9 years ago, # ^ |

← Rev. 2 →

+21

As I remember, Round 1 rules was changed before Round 1 started. Seems fair.

Besides that, you can't know what place you get in Round 1 if rules wasn't changed — I guess a lot of people solved no more tasks than they needed to proceed further according to new rules. So you could easily didn't pass to Round 2 at all.

→ Reply

9 years ago, # ^ |

← Rev. 2 →

-20

I agree that on 22nd those people who were the strongest on 22nd won — no arguing about that. But below few top places there is actually lots of randomness involved, so modifying the rules does modify the end result.

Argument about a lot of people who decided to solve just 2 easy problems (1,2,3) or one hard problem in round 1 to get through — I don't buy it. Smallest quirk can kick you out of competition, why risking it? Even those people, who are absolutely sure that they don't have even a minimal mistake in their solution — for them it would take another 15 minutes to solve third problem "just in case". So I don't think there were more then a handful of people following that strategy (solving less then three problems, because that's enough to proceed).

→ Reply

Nezzar

9 years ago, # ^ |

Because the problems are not that interesting?

→ Reply

Klein

9 years ago, # ^ |

+28

Why risk it? Because one doesn't care that much and doesn't have that much time/energy?

I was one of the "handful people" who followed that strategy. I figured that if I wouldn't be able to solve problem D in Round 1 flawlessly in one attempt, then I wouldn't be worthy of Round 2. Now I didn't participate in Round 2 anyway (because of lack of time and motivation), but still, the rules this year were way more balanced than last year.

Last year, you were more or less guaranteed to get a T-Shirt if you managed to qualify to Round 2, for which (largely) only time to verify thoroughly that your solutions in Round 1 were correct was needed. So if we want to reward people who can solve problems (quickly and often correctly), rather than people with a lot of time to spare, then a lower threshold in Round 1 is definitely the way to go. (And seriously, a Round 2 with about 600-700 something participants where 500, actually 550 last year, gets a T-Shirt, is not balanced in any way.)

→ Reply

ffao

9 years ago, # ^ |

+16

I figured that if I wouldn't be able to solve problem D in Round 1 flawlessly in one attempt, then I wouldn't be worthy of Round 2.

That's a bit too harsh, mistakes happen. Just look at how many excellent contestants got eliminated in round 2 by having one problem fail. I have also seen red coders failing as much as 3 problems in the same round...

→ Reply

9 years ago, # ^ |

I failed D because I ran DFS from vertex 1 instead of 0 to test if the answer stays the same, then forgot to change it back. Then I had all cases correct except for the single-vertex case :c

→ Reply

9 years ago, # ^ |

+12

If you think that getting into 500 in Round 1 means less then getting into 500 in Round 2 — that's ok for you to think so.

It does mean less — in round 1 you passed on points, so if you wanted to pass you could just start the contest ~20 hours after it begun, do the first two problems and call it a day — which I assume a lot of people did. Now that your actual ranking matters, these people will start on time and do all the problems, so it makes sense that it's harder to get into the top 500.

→ Reply

9 years ago, # |

+21

Test cases on B may be weak; I passed by computing all binomials and powers of the probability, p. Of course, even though I stored these values in a long double, I would still expect precision problems to arise.

→ Reply

9 years ago, # ^ |

I think that before you go around claiming that the testcases are weak, you should find a testcase where long doubles actually are note precise enough. Saying the testcases are weak because you 'expect' them to be is a bit strange.

→ Reply

9 years ago, # ^ |

← Rev. 2 →

+13

3000 choose 1500 exceeds the range of long double. Also I said testcases might be weak.

→ Reply

9 years ago, # ^ |

I don't see how the fact that $\text{[math]}$ exceeds the range of a long double immediately guarantees that there exists some testcase where using long doubles causes a perturbation of 10^- 6 or more.

Also I'm sorry, I didn't see the may earlier.

→ Reply

Errichto

9 years ago, # ^ |

← Rev. 2 →

I checked and it doesn't exceed long double on my machine.

→ Reply

9 years ago, # ^ |

I was going by these standards. Why would the range of long double vary from computer to computer?

→ Reply

yeputons

9 years ago, # ^ |

+14

Visual C++ thinks that long double is same as double, GCC does not — it uses 80-bit double (as used internally by CPU). AFAIK, standard does not require long double to be really "long", just not shorter than double.

→ Reply

9 years ago, # |

+28

Ok, classic T-shirt giveaway. First reply = free T-shirt.

→ Reply

9 years ago, # ^ |

+16

jk i'm getting one

→ Reply

9 years ago, # ^ |

Well done. PM me your address please.

→ Reply

9 years ago, # ^ |

← Rev. 2 →

I meant I won one in the contest. I don't really want it, I just felt like being first (sorry). I like the Prodigy picture, by the way.

So it's still on, I guess?

→ Reply

fetetriste

9 years ago, # ^ |

A t-shirt in exchange for an address? Deftly done, Bruce Schneier.

→ Reply

SlavaSSU

9 years ago, # ^ |

→ Reply

9 years ago, # ^ |

Congratz, you know what to do next :)

→ Reply

alexwice

9 years ago, # ^ |

first?

→ Reply

riadwaw

9 years ago, # ^ |

→ Reply

StoneCold_

9 years ago, # ^ |

+26

First

→ Reply

Mr.Awesome

9 years ago, # ^ |

+101

Internet Explorer ?

→ Reply

pllk

9 years ago, # |

+39

C was a nice problem, but the test input is weak... After the contest, I implemented a simple brute force solution and noticed that it solved the input in 3 minutes, because almost all test cases were small.

→ Reply

MikeMirzayanov

9 years ago, # |

+24

In Gym: 2016 Facebook Hacker Cup, Round 2

Tests in D have been splitted to contain at most 5 test cases.

→ Reply

I_love_tigersugar

9 years ago, # |

+16

Brutal round :( Solving at least 3 problems (including the last one) is needed to advance :'(

→ Reply

9 years ago, # |

← Rev. 2 →

I implemented the intended idea for problem D, but my program only fits into 6min if I split it in 3 processes in a 5 year old computer. I only fixed a bug with 2 min left and the program didn't finish until the end of the contest.

I think the MCMF implementation I used is reasonably fast ( http://ideone.com/OREX0W ), I don't understand why the limits were so tight.

→ Reply

I_love_Hoang_Yen

9 years ago, # ^ |

+21

I think fastest MCMF is still much slower than Hungarian. My code runs in 20s in my laptop (~ 4 times faster than Codeforces).

→ Reply

9 years ago, # ^ |

I didn't have Hungarian code at hand, I will check it out!

→ Reply

9 years ago, # ^ |

You were right, I only had time to try using hungarian now and switching just my MCMF to Hungarian makes my program solve the input in 13s as opposed to several minutes. I didn't expect such a huge difference.

→ Reply

knightL

9 years ago, # ^ |

Well, you can speed up your solutions K times by noticing that you can actually split coloring vertexes with even and odd distances to root and thus eliminate one of the parameters — the color of current node.

→ Reply

9 years ago, # ^ |

Yeah, I've seen that idea in Egor's comment.

My 'complaint' is due to the editorial explaining the solution like I did with the current label as one of the state parameters.

→ Reply

knightL

9 years ago, # ^ |

Well, if you put it that way... Your code on my notebook runs in one and a half minute. And it can be further reduced by half a minute by replacing set with priority queue. And did you compile it with optimisations ? Without them it indeed runs in 7-8 minutes.

→ Reply

9 years ago, # ^ |

Yes, I use -O2. I picked this version from my library because it's shorter code and has been efficient enough for contests.

I didn't think my laptop was that slow. Ouch!

→ Reply

viesis

9 years ago, # ^ |

-23

There was only 1666 participants in this round. why don't facebook use online judging system at least from this round on? It could help people like mogers.

→ Reply

9 years ago, # ^ |

+16

It gives people the freedom to use whatever tools they prefer.

→ Reply

viesis

9 years ago, # ^ |

-18

The context was about your slow laptop and I'm showing benefits of judging all participant's code with the same judging system. Nobody would feel unlucky then although writing correct solutions.

→ Reply

wjomlex

9 years ago, # ^ |

+15

There are two main reasons Hacker Cup and GCJ use the run-the-code-yourself format:

1) People can use basically any language they want, and any libraries they want. This affords a lot of flexibility that encourages more participation.

2) We don't have to maintain a judging system. Ask MikeMirzayanov about running Codeforces. It's a ton of work. Hacker Cup and GCJ are side projects that exist largely as labours of love.

The main disadvantage of this format is that people have different hardware and different internet connections. We try to err on the side of giving more than sufficient time. For the problem in question, we have a judge solution in Java that runs in 6 seconds on a 3-year-old mid-level laptop. I think it does O(NK) passes of flow, and we have another solution with O(NK^2) passes of flow that takes about 30 seconds.

→ Reply

9 years ago, # ^ |

← Rev. 2 →

I just checked how my code would behave with jqdai0815 's MCMF code. It solves the input data in 50s on my computer.

I would have qualified to Round3 with this. Lesson learned

→ Reply

alibaba

9 years ago, # ^ |

MCMF can be used to solve weighted bipartite matching? Do you have any article about that? I thought Hungarian is the only way ?

→ Reply