Why so mainstream? featuring data structures

№	Пользователь	Рейтинг
1	tourist	4009
2	jiangly	3823
3	Benq	3738
4	Radewoosh	3633
5	jqdai0815	3620
6	orzdevinwang	3529
7	ecnerwala	3446
8	Um_nik	3396
9	ksun48	3390
10	gamegame	3386

№	Пользователь	Вклад
1	cry	167
2	Um_nik	163
3	maomao90	162
3	atcoder_official	162
5	adamant	159
6	-is-this-fft-	158
7	awoo	157
8	TheScrasse	154
9	Dominater069	153
9	nor	153

Seriously, why so serious? And why so mainstream? Competitive programming is all about fun, therefore...

Cutting to the chase

I'll talk about something that goes like that: everyone talks about it, few really know how to do it, everyone thinks everybody else knows how to do it, so everyone claims they know how to do it.Not teenage sex, but understanding data structures. In the sense of the metaphor before, I am close to a virgin but I still can give you some insight as there are some structures that are my type.

Don't get me wrong, I am not talking about mainstream stuff like segment trees, but not so usual data structure that tend to be implemented and modeled exactly the same way each time or even not implemented(cause STL has them). So, we're gonna talk about tries and heaps.

The good ol' dictionary

We'll try to implement a structure that can handle the following queries:

add(w) — add an apparition of word w in the data structure
delete(w) — delete an apparition of word w in the data structure
ask(w) — get the number of apparition of word w in the list
lcp(w) — get a word from the data structure that has the longest common prefix with string w

Let's ignore the 4th type of query.

Now what's the simplest way to do this? Keep a list an loop through it. Nothing wrong with that, but we need faster stuff. The obvious thing that a programmer would think at is giving the words some order so that the search is faster. This is exactly what STL's unordered_map does. So a map would to the business. Supposing you want to implement it yourself, you should find a nice hashing function the problem might be solved. OK, I'll go home, we're done.

Give it a trie

At this moment we have a decent data structure but we did not solved the 4th query. Also, we did not use one information: we are working with strings. Now we need some observation to go on: if some strings have a common prefix, the prefix should be stored only once and we would need to somehow point to more places from that position.

Now the observation, even if pertinent, does not give any simplification because we are talking about prefixes, so chunks of strings. A simple man, like me, would start with baby steps.

For starters, we'll make from each letter a node and we will bound it to the next word in the string. Suppose we introduced words "awesome" and "awful".

[a] -> [w] -> [e] -> [s] -> [o] -> [m] -> [e]
[a] -> [w] -> [f] -> [u] -> [l]

Now put the common prefix together and we have:

[a] -> [w]  -> [e] -> [s] -> [o] -> [m] -> [e]
            -> [f] -> [u] -> [l]

That tree structure looks quite nice, because if we will have something like that and a nice method of accessing each element we could solve this in something close to O(length * accessing) where O(accessing) is the time of the access method we choose.

Let's choose how we store data and access it. Store a root node that has no character in it — this will be the start point in the tree. Say we have only 26 chars, then we can keep a vector of pointers and the current char in each node. That's how our model looks now:

Your text to link here...

The complexities are O(length) for each operation due to the, but the memory is quite big as it can go to maximum O(total_length * SIGMA) where SIGMA is the total number of chars.

Rev.	Кто	Когда	Δ	Комментарий
en13	DanAlex	2016-02-07 15:37:39	5	Tiny change: 's on it._ \n\n### Th' -> 's on it._ [cut]\n\n### Th'
en12	DanAlex	2016-01-25 01:09:21	2	Tiny change: ' string:\n- $s$ > ' -> ' string:\n\n- $s$ > '
en11	DanAlex	2016-01-25 00:49:36	0	Tiny change: 'u enjoyed!' -> 'u enjoyed!\n\n### ' (published)
en10	DanAlex	2016-01-25 00:47:19	480
en9	DanAlex	2016-01-25 00:42:53	482
en8	DanAlex	2016-01-25 00:37:21	903
en7	DanAlex	2016-01-25 00:27:54	966
en6	DanAlex	2016-01-25 00:17:31	1867
en5	DanAlex	2016-01-25 00:14:16	3	Tiny change: 'm $O(total_length * SI' -> 'm $O(totalLength * SI'
en4	DanAlex	2016-01-25 00:13:47	25	Tiny change: ' now: \n\n[Your text to link here...](http://w' -> ' now: \n\n![ ](http://w'
en3	DanAlex	2016-01-25 00:13:20	1449
en2	DanAlex	2016-01-24 23:59:10	12
en1	DanAlex	2016-01-24 23:58:48	2148	Initial revision (saved to drafts)

Cutting to the chase

The good ol' dictionary

Give it a trie

Trie harder

История