Any optimzations to this problem ? - Codeforces

→ Обратите внимание

До соревнования
Rayan Programming Contest 2024 - Selection (Codeforces Round, Div. 1 + Div. 2)
5 дней
Зарегистрироваться »

*есть доп. регистрация

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3993
2	jiangly	3743
3	orzdevinwang	3707
4	Radewoosh	3627
5	jqdai0815	3620
6	Benq	3564
7	Kevin114514	3443
8	ksun48	3434
9	Rewinding	3397
10	Um_nik	3396

Страны | Города | Организации

→ Лидеры (вклад)

№	Пользователь	Вклад
1	cry	167
2	Um_nik	163
3	maomao90	162
3	atcoder_official	162
5	adamant	159
6	-is-this-fft-	158
7	awoo	156
8	TheScrasse	154
9	Dominater069	153
10	nor	152

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя d4rkc0de

Any optimzations to this problem ?

Автор d4rkc0de, история, 9 лет назад, По-английски

По-английски

Hello CF's . I was trying to find some optimizations to the this problem

So we have n strings , we are trying to find all rare subset of letters of theses strings.

a rares subset is a subset of letters x of a string s, such that the number of appearances of x in all strings is strictly lower than a number min_appear.

example :

let's say we have 3 strings and min_appear = 2

abcd abc cde

than the answer is e,ad,bd,ce,de,abc,abd,bcd,abcd,cde (because the number of their appearances = 1 < 2)

these are all the possible subset of letters and their number of appearances { a=2 ,b=2 ,c=3 ,d=2 ,e=1 ,ab=2 ,ac=2 ,ad=1 ,bc=2 ,bd=1 ,cd=2 ,ce=1 ,de=1 ,abc=2 ,abd=1,bcd=1,abcd=1,cde=1}

The best solution i come with have a n*2^(length of taller string) , i'm wondering if there is a solution with a better complexity.

UDP : Sorry i forgot about subsets of length 3 and 4 . also each two different letters from each string are distinct.

Теги

optimization

0

d4rkc0de
9 лет назад
4

Комментарии

Комментарии (4)

Написать комментарий?

»

9 лет назад, # |

Проголосовать: нравится

0

Проголосовать: не нравится

We need some more details about this task. Either you're not asked to report all subsets (which I understood as subsequence?) but count them or it must have some pretty low constraints, such as in string length, number of different characters, or min_appear.

Suppose the case where there's only one string s = "abc...xyz". There are $\text{[math]}$ distinct subsequences of length k. Let's also say min_appear = |s| + 1.

Then reporting all this subsequences would take $\text{[math]}$ characters which for |s| = 26 is about 872 million. A bit too much, don't you think?

→ Ответить

»

»

9 лет назад, # ^ |

← Rev. 2 →

Проголосовать: нравится

+5

Проголосовать: не нравится

I think that by subsets, he means a subset of letters. In this case we have 2²⁶ subsets, and every string can be represented simply with a mask indicating whether it contains each letter or not.

A simple solution involving bitmasks would have complexity O(N * 2²⁶), but we would need to know more constraints to see if that's enough (I doubt it).

Anyway, I still don't understand why he didn't mention other possible subsets for that example test case, like "abc", "cde" or "abcd".

→ Ответить

»

»

»

9 лет назад, # ^ |

← Rev. 4 →

Проголосовать: нравится

0

Проголосовать: не нравится

I added the missing subsets , thanks for pointing it .

By a subset i mean a subset of letters .

I know about the O(N * 2^26) solution . i'm wondering if there is a better complexity ?

→ Ответить

»

9 лет назад, # |

← Rev. 4 →

Проголосовать: нравится

+3

Проголосовать: не нравится

It's obvious that if x is rare than all substrings that contain x are rare ( in your example e is rare so ce,de,cde are also rare ) , this is can be helpful not to generate all possible substrings in such case , so one we have found the first rare we can generate all it's ascendants, right ??

→ Ответить

Codeforces (c) Copyright 2010-2024 Михаил Мирзаянов

Соревнования по программированию 2.0

Время на сервере: 25.11.2024 17:28:26 (h1).

Десктопная версия, переключиться на мобильную.

При поддержке