#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	156
6	Qingyu	155
7	djm03178	152
7	adamant	152
9	luogu_official	150
10	awoo	147

SPyofgame's blog

Can somebody hack my brute-forces solution ?

By SPyofgame, history, 3 years ago, In English

The Statement

Given an integer $$$n$$$ and a binary table $$$A[n][n]$$$

We are asked to find maximum $$$x$$$ that exist a binary table $$$B\left[\left \lceil \frac{n}{x} \right \rceil \right]\left[\left \lceil \frac{n}{x} \right \rceil \right]$$$

That this equation is satisfied $$$B\left[\left \lceil \frac{i}{x} \right \rceil \right]\left[\left \lceil \frac{j}{x} \right \rceil \right]$$$ $$$= A[i][j]\ \forall\ 1 \leq i, j \leq n$$$ (sorry but I dont know why the latex is broken like that)

The solution

So my approach is to check for all $$$v | n$$$ if there is exist such table.

Let call a square $$$(lx, ly)$$$ is a subtable $$$A[lx..rx][ly..ry]$$$ for

$$$\begin{cases} 1 \leq lx \leq rx \leq n\\ 1 \leq ly \leq ry \leq n\\ rx - lx + 1 = v\\ ry - ly + 1 = v\\ v\ |\ rx\\ v\ |\ ry\\ \end{cases}$$$

So there are $$$\frac{n}{v} \times \frac{n}{v}$$$ such subtables

Binary Table $$$B[][]$$$ will exist if and only if $$$A[x][y] = A[u][v]$$$ for all subtables $$$(lx, ly)$$$ and $$$lx \leq x, u \leq rx$$$ and $$$ly \leq y, v \leq ry$$$

My solution is just check for all $$$x\ |\ n$$$ from $$$n$$$ downto $$$1$$$.

If $$$x$$$ is satisfied then just simply output it.

The checking can simply be done using partial sum matrix.

The complexity will be $$$\underset{x|n}{\Large \Sigma} \left(\frac{n}{x} \right)^2 = n^2 \times \underset{x|n}{\Large \Sigma} \left(\frac{1}{x}\right)^2 = n^2 \times \frac{\pi^2}{6} = O(n^2)$$$

This give us a simple solution that run in $$$373 ms$$$.

https://codeforces.net/contest/1107/submission/138112085

int n;
bool a[LIM][LIM];
int b[LIM][LIM];
 
bool check(int x, int y, int u, int v) /// Check if this square is valid
{
    int sum = b[u][v] - b[u][y - 1] - b[x - 1][v] + b[x - 1][y - 1];
    return (sum == 0) || (sum == (u - x + 1) * (v - y + 1));
}
 
bool check(int scale) /// Check if the scaling is valid
{
    for (int lx = 1, rx = scale; lx <= n; lx += scale, rx += scale)
        for (int ly = 1, ry = scale; ly <= n; ly += scale, ry += scale)
            if (!check(lx, ly, rx, ry)) return false;
 
    return true;
}
 
int main()
{
    ios::sync_with_stdio(NULL);
    cin.tie(NULL);
 
    /// Input
    cin >> n;
    for (int i = 1; i <= n; ++i)
    {
        for (int j = 1; j <= n; j += 4)
        {
            char c;
            cin >> c;
 
            /// Decode a[i][j]
            int v = (c <= '9') ? c - '0' : c + 10 - 'A';
            a[i][j + 0] = v >> 3 & 1;
            a[i][j + 1] = v >> 2 & 1;
            a[i][j + 2] = v >> 1 & 1;
            a[i][j + 3] = v >> 0 & 1;
        }
    }
 
    /// Constructing partialsum table
    for (int i = 0; i <= n; ++i) b[i][0] = b[0][i] = 0;
    for (int i = 1; i <= n; ++i)
        for (int j = 1; j <= n; ++j)
            b[i][j] = b[i - 1][j] + b[i][j - 1] - b[i - 1][j - 1] + a[i][j];
 
    /// Find answer
    for (int x = n; x >= 1; --x)
        if (n % x == 0 && check(x))
            return cout << x, 0;
 
    return 0;
}

My Hacking Question

For brute-force version, I check it by comparing each position one by one.

So the complexity seems to be at most $$$O(d(n) \times n^2)$$$ and atleast $$$O(n^2)$$$.

It should fail but it didnt, it passed in $$$1216 ms$$$

https://codeforces.net/contest/1107/submission/138111659


int n;
bool a[LIM][LIM];
bool check(int scale) /// Check if this scaling is valid
{
    for (int lx = 1, rx = scale; lx <= n; lx += scale, rx += scale)
        for (int ly = 1, ry = scale; ly <= n; ly += scale, ry += scale)
            for (int x = lx; x <= rx; ++x)
                for (int y = ly; y <= ry; ++y)
                    if (a[x][y] != a[lx][ly])
                        return false;
 
    return true;
}
 
int main()
{
//    file("Test");
    ios::sync_with_stdio(NULL);
    cin.tie(NULL);
 
    /// Input
    cin >> n;
    for (int i = 1; i <= n; ++i)
    {
        for (int j = 1; j <= n; j += 4)
        {
            char c;
            cin >> c;
 
            /// Decode a[i][j]
            int v = (c <= '9') ? c - '0' : c + 10 - 'A';
            a[i][j + 0] = v >> 3 & 1;
            a[i][j + 1] = v >> 2 & 1;
            a[i][j + 2] = v >> 1 & 1;
            a[i][j + 3] = v >> 0 & 1;
        }
    }
 
    /// Find result
    for (int x = n; x >= 1; --x)
        if (n % x == 0 && check(x))
            return cout << x, 0;
 
    return 0;
}

Can someone construct a test hack for it, since I failed to find such test that it will fail.

Full text and comments »

SPyofgame
3 years ago
5

Count a, b, c satisfy a + b + c <= S and a * b * c <= T for large S, T

By SPyofgame, history, 4 years ago, In English

Statement

This question is based on bonus of this problem.

We need to count such non-negative integer triple $$$(a, b, c)$$$ that satisfy $$$(0 \leq a + b + c \leq S)$$$ and $$$(0 \leq a \times b \times c \leq T)$$$.

Since the result may be very big, you can either use bignum or modulo $$$10^9 + 7$$$ for convention

Notice that:

$$$(0, 0, 1) \neq (0, 1, 0) \neq (1, 0, 0)$$$

Constraint:

$$$0 \leq S, T \leq 10^{18}$$$
$$$0 \leq a, b, c$$$

No Time Limit. But expect to be 10 seconds

Memory Limit: 1Gb

Input:

A single line contain only two positive 60-bit integers $$$S$$$ and $$$T$$$ ($$$0 \leq S, T \leq 10^{18}$$$)

Output:

Print a single integer, the number of positive tuple satisfy mathematical condition

Example:

Example 0

Input:
0 0

Output:
1

Explain:
(0, 0, 0)

Example 1

Input:
1 1

Output:
4

Explain:
(0, 0, 0)
(0, 0, 1)
(0, 1, 0)
(1, 0, 0)

Example 2

Input:
10 10

Output:
213

Example 3

Input:
100 100

Output:
16616

Example 4

Input:
1000 1000

Output:
1530920

Example 5

Input:
10000 10000

Output:
150511618

Example 6

Input:
100000 100000

Full Output:
15007668845

Output Modulo 1e9 + 7:
7668740

Example 7

Input:
1000000 1000000

Full Output:
1500107530589

Output Modulo 1e9 + 7:
107520089

Example 8

Input:
10000000 10000000

Full Output:
150001436760246

Output Modulo 1e9 + 7:
435710239

Example 9

Input:
100000000 100000000

Full Output:
15000018512473629

Output Modulo 1e9 + 7:
407473503

Example 10

Input:
1000000000 1000000000

Full Output:
1500000231875375222

Output Modulo 1e9 + 7:
375373675

Example 11

Input:
10000000000 10000000000

Full Output:
?????????????????????

Output Modulo 1e9 + 7:
786369931

Example 12

Input:
100000000000 100000000000

Full Output:
??????????????????????

Output Modulo 1e9 + 7:
72345276

Example 13

Input:
1000000000000 1000000000000

Full Output:
???????????????????????

Output Modulo 1e9 + 7:
173128245

Example 14

Input:
10000000000000 10000000000000

Full Output:
???????????????????????

Output Modulo 1e9 + 7:
144115209

Example 15

Input:
100000000000000 100000000000000

Full Output:
???????????????????????

Output Modulo 1e9 + 7:
607035370

After many hours, the answer for $$$f(10^{18}, 10^{18})$$$ is reached but not confirmed therefore I wont add in the example

Current Research

When $$$S \leq \lfloor \frac{S}{3} \rfloor \times \lfloor \frac{S + 1}{3} \rfloor \times \lfloor \frac{S + 2}{3} \rfloor \leq T$$$. The condition $$$a \times b \times c \leq T$$$ is satisfied, therefore the result is $$$\frac{(S+1)(S+2)(S+3)}{6}$$$

When $$$T = 0$$$, at least one of them must be zero, therefore the result will be $$$\frac{3S(S-1)}{2} + 1$$$

When $$$S = 0$$$, there is only one triple satisfied $$$(0, 0, 0)$$$

When $$$S = T$$$, the function $$$f(S, T) \approx 1.5 S^2$$$ (Tested with $$$10^7$$$ integers $$$n \leq 10^{12}$$$

Without depend on $$$O(f(S))$$$, the best current known algorithm is $$$O(T^{5/9})$$$

Without depend on $$$O(f(T))$$$, the best current known algorithm is $$$O(S^2)$$$ (can be further optimized but not on researched)

Sadly, there were no papers, documentaries, integer sequences or math formulas found to apply this function.

Math discussion for Proof

Used this Paper

Reference Code

Current best known algorithm: O(T^(5/9)) - Used modulo for large result

Query for $$$(0 \leq S \leq 10^{18}, 0 \leq T \leq 10^{12})$$$ in under $$$830ms$$$

Update: Fixed the bug when calculate C++std::cbrt($$$n$$$) for $$$n$$$ = int($$$x$$$)$$$^3$$$

Code

/// @title: Count non negative integer tuple (a, b, c) satisfied (sum <= S) and (product <= T)
/// @testing: for large S <= 1e18, T <= 1e12
/// > in O(T^(5/9)) time complexity = 830ms codeforces = 310ms ideone
/// > in O(const) memory complexity
/// > in O(3700Mb) code memory in 200 lines, 4444 characters
///
/// @date:    23/08/2021
/// @link:    https://ideone.com/yMi8tu
/// @author:  SPyofgame
/// @lisence: free lisence

#pragma GCC target ("avx2")
#pragma GCC optimization ("O3")
#pragma GCC optimization ("unroll-loops")

#include <iostream>
#include <cmath>

typedef long long ll;
const int MOD = 1e9 + 7;

///====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

/// Utility function
ll min(ll a, ll b) { return a < b ? a : b; }
ll max(ll a, ll b) { return a > b ? a : b; }
ll square(ll x) { return x * x; }

/// Sum of (1 + 2 + ... + x)
ll natural_sum(ll x)
{
    return (x <= 0) ? 0 : x * (x + 1) / 2;
}

/// Sum of (l + (l + 1) + ... + r)
ll natural_sum(ll l, ll r)
{
    return (l > r) ? 0 : natural_sum(r) - natural_sum(l - 1);
}

/// Sigma(p = y -> x) floor(n / p) in O(cbrt(n))
/// @link: https://arxiv.org/pdf/1206.3369.pdf
/// @author: Richard Sladkey
ll fastsumdiv(ll y, ll x, ll n)
{
    if (y > x) return 0;

    ll S = 0;
    ll B = n / (x + 1);
    ll E = n % (x + 1);
    ll D = n / x - B;
    ll G = B - x * D;
    ll d = 0;

    for (; x >= y; --x)
    {
        E += G;
        if (E >= x)
        {
            D += 1, G -= x, E -= x;
            if (E >= x)
            {
                D += 1, G -= x, E -= x;
                if (E >= x) break; /// not likely to happen more
            }
        }
        else if (E < 0)
        {
            D -= 1, G += x, E += x;
        }

        G += 2 * D, B += D, S += B;
    }

    E = n % (x + 1);
    D = n / x - B;
    G = B - x * D;
    for (; x >= y; --x)
    {
        E += G;
        d = E / x;
        D += d;
        E -= x * d;
        G += 2 * D - x * d, B += D, S += B;
    }

    for (; x >= y; --x)
    {
        S += n / x;
    }

    return S;
}

/// you can modify to -> Bignum / Modulo / Overflow
void add(ll &res, ll val)
{
    val %= MOD;
    res += val;
    if (res >= MOD) res -= MOD;
}

/// you can modify to -> Bignum / Modulo / Overflow
void sub(ll &res, ll val)
{
    val %= MOD;
    res -= val;
    if (res < 0) res += MOD;
}

/// Count (a, b, c) satisfied
/// { min(a, b, c) >= 0
/// { a + b + c <= S
/// { a * b * c <= T
/// O(cbrt(T)^2) complexity
///
/// @proof: https://math.stackexchange.com/questions/4230187/faster-algorithm-for-counting-non-negative-tripplea-b-c-satisfied-a-b-c
/// @author: SPyofgame
ll solve_ABC(ll S, ll T)
{
    ll cbT = cbrt(T); /// Not very accuracy
    while (cbT * cbT * cbT < T) ++cbT;
    while (cbT * cbT * cbT > T) --cbT;



/// [1] 0 <= a < b < c && a <= cbrt(T) -> cnt1 * 6
    ll cnt1 = 0;
    add(cnt1, (S / 2) * S - 2 * natural_sum(S / 2)); /// a = 0

    ll k = 1;
    for (ll a = 1, upa = min(S, cbT); a <= upa; ++a) /// a > 0 -> count(b < c)
    {
        ll SSS = S - a;
        ll TTT = T / a;
        ll KKK = min(SSS / 2, min((ll)sqrt(TTT), TTT / 2 + 1));

        /// Binary search for max k satisfy (S - a - k) <= (T / a / k)
        ll k = a;
        for (ll l = a + 1, r = KKK; l <= r; )
        {
            ll m = (l + r) >> 1;
            ll v = TTT / m;
            if (SSS - m <= v)
            {
                k = m;
                l = m + 1;
            }
            else
            {
                r = m - 1;
            }
        }

        sub(cnt1, natural_sum(a + 1, KKK));
        add(cnt1, SSS * (k - a) - natural_sum(a + 1, k));
        add(cnt1, fastsumdiv(k + 1, KKK, TTT));
    }



/// [2] 0 <= a < b = c && a <= cbrt(T) -> cnt2 * 3
    ll cnt2 = 0;
    add(cnt2, S / 2); /// a = 0
    for (ll a = 1, upa = min(S, cbT); a <= upa; ++a) /// a > 0 -> count(b = c)
    {
        add(cnt2, max(0LL, min((S - a) / 2, ll(floor(sqrt(T / a)))) - a));
    }



/// [3] 0 <= a = b < c && a <= cbrt(T) -> cnt3 * 3
    ll cnt3 = 0;
    add(cnt3, S); /// a = 0
    for (ll a = 1, upa = min(S / 2, cbT); a <= upa; ++a) /// a > 0 -> count(a < c)
    {
        add(cnt3, max(0LL, min(S - a - a, T / a / a) - a));
    }



/// [4] 0 <= a = b = c && a <= cbrt(T) -> cnt4 * 1
    ll cnt4 = 0;
    add(cnt4, min(S / 3, cbT) + 1); /// a = b = c >= 0



/// Final result: total counting
    ll res = 0;
    add(res, cnt1 * 6 + cnt2 * 3 + cnt3 * 3 + cnt4 * 1);
    return res;
}

/// Main function
int main()
{
	/// Input any number 0 <= S, T <= 1e18
    std::cout << solve_ABC(1000000000000000000LL, 1000000000000LL);
    return 0;
}

Note: It is now A347221

Full text and comments »

#math, #algebra, #combinatorics

SPyofgame
4 years ago
25

Suggestion: How can I not load all the comments when not need to read all of them

By SPyofgame, history, 4 years ago, In English

The problem

It is kinda annoying when we just want to solve some problems in EDU Codeforces but we have to load a bunch of comments that we even not want to read that time.

Or sometimes you just need to read the editorials and dont want to see the comments

Or sometimes you just want to see what are there in the blogs, relearn things in there but not the comments and you still need to load those comments for nothing

My Suggestions

Why dont we make something like hide all comments and only load it when we click on show all comments or show first/latest [x] comments in any kind of blogs

Or something like separates comments into sections where each section only have maximumly X comments, and we can edit value X in our setting

Or something like show some few high contributed (most upvotes) comments, and we only see all comments when we click See All

Or something like show some few comments, and we see next 10 comments or deeper comments (reply-comment) by clicking See more

Or if it is not possible to do it right now then are there any kind of tricks or apps you known that can prevent from loading huge amount of comments when unneeded to read ?

(Sorry if there is such topic about this before but I cant find it anywhere though I search google and codeforces that I write this blog)

Full text and comments »

SPyofgame
4 years ago
2

Lexicographically Minimal String Rotation

By SPyofgame, history, 4 years ago, In English

Table of content

Teleporter	Description
I. The Problem	Describe about the problem
II. Bruteforce Approach	Simple way to do the problem
III. Hashing Approach	Reduce circular substring comparing complexity
IV. Sqrt Decomposition	Divide into parts, solve each part then solve the whole part
V. KMP Approach	Magical Linear Booth-KMP Algorithm
VI. Lyndon Factorization Approach	Incredible Linear Duval-Lyndon Algorithm
VII. Suffix Array Approach	Hard for constructing but simple to find result
VIII. Suffix Automation Approach	Complex Linear Construction but simple to find result
IX. Elimination Tournament Algorithm	From initial candidates, eliminate worst candidates until find one final champion
...................................................................	..........................................................................................................................

I. The Problem

A) The problem:

1) Statement:

Sometimes being as a coder, you will find some real life problems about strings. Some strings are just simple an array of characters with 2 ended. Sometimes you will face with circular strings, the one which circular around. Let take you to the biological laboratory, dont worry, I just teleport you without requiring any money. In the lab, you will find many interesting bacteria. How could you detect which one is belonged to the same group or not ? Let simplify the problem that two bacteria are in the same group if their circular encoded-DNA strings are the same. So how can you compare two randomly encoded-DNA ? One of the effectively way to do that is convert all strings into Minimal Lexicographical Acyclic Strings (a.k.a Lexicographically Minimal String Rotation) and then hash them for faster comparison.

This bring down to a much simpler problem. Let define a right-rotation of a string is that putting the leftmost character to the rightmost position. Given circular DNA string $$$S$$$ of size $$$n$$$. Find the minimum right-rotation to make it Minimal Lexicographical for all such rotations.

2) Question:

Given a string of latin words $$$S$$$ of size $$$N$$$
A right-rotation of $$$S$$$ is that $$$S_2 + S_3 + ... + S_{|S|} + S_1$$$ where ('+' notation mean concatenation)
Find the least right-rotation to make $$$S$$$ become the Lexicographically Minimal

3) Constraint:

$$$S$$$ is a string of lower latin (you can expand the problem to $$$A$$$ is an array of integers)
$$$|S| \leq 10^5$$$ (you can also expand the limit to $$$10^6$$$ or even higher)

4) Example:

Example 1

Input:  a
Output: 0
String: a

Example 2

Input:  ba
Output: 1
String: ab

Example 3

Input:  aaaaaa
Output: 0
String: aaaaaa

Example 4

Input:  aaaaab
Output: 0
String: aaaaab

Example 5

Input:  aaaaba
Output: 5
String: aaaaab

Example 6

Input:  aaabaa
Output: 4
String: aaaaab

Example 7

Input:  aabaaa
Output: 3
String: aaaaab

Example 8

Input:  abaaaa
Output: 2
String: aaaaab

Example 9

Input:  baaaaa
Output: 1
String: aaaaab

Example 10

Input:  baaaab
Output: 1
String: aaaabb

Example 11

Input:  abbbba
Output: 5
String: aabbbb

Example 12

Input:  baabaa
Output: 1
String: aabaab

Example 13

Input:  abaabaaabaababaaabaaababaab
Output: 14
String: aaabaaababaababaabaaabaabab

All Examples As Once

Input:  a ba aaaaaa aaaaab aaaaba aaabaa aabaaa abaaaa baaaaa baaaab abbbba baabaa abaabaaabaababaaabaaababaab
Output: 0  1 0      0           5     4     3     2     1      1          5  1                   14 
Output: 0 1 0 0 5 4 3 2 1 1 5 1 14 
String: a ab aaaaaa aaaaab aaaaab aaaaab aaaaab aaaaab aaaaab aaaabb aabbbb aabaab aaabaaababaababaabaaabaabab

B) Real Life Applications

1) Finger print identification:

We can encode the finger print into many detailed circular strings. How to search such finger print again from those in very huge data base ? Circular comparision using lyndon factorization is requried.

2) Biological genetics:

In some cases, we need to know whether these two group's genetics are belonged to the same bacteria, virus.

3) Games:

Well, ofcourse we can apply the algorithm in some language/words-related games

C) Practice Links

1) CSES — Minimal Rotation

Require: Output least rotation string with $$$|S| \leq 10^6$$$

2) SPOJ — Minimum Rotations

Require: Output least rotation number with $$$|S| \leq 10^5$$$

3) UVA — Glass Beads

Require: Queries output least rotation number with $$$|S| \leq 10^4$$$

II. Bruteforce Apprach

A) Sorting and Searching:

1) Idea:

We will take all possible circular substring in $$$S$$$
For every possible rotation of the string, we add it with its number of rotations needed
Then we sort all the array by their string (if they are equal, we take one with its smaller index).
The answer will be the smallest string (minimal lexicographical) with its lowest index.

2) Complexity:

Compare two random strings $$$S$$$ and $$$T$$$ cost $$$O(min(|S|, |T|)) = O(n)$$$ (since $$$|S| = |T| = n$$$)
The complexity of sorting all array is $$$O(n log n)$$$
Hence the total complexity is $$$O(n^2 log n)$$$

3) Implementations:

Vector Sorting Bruteforce Solution - O(n^2 log(n)) time - O(n^2) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;
    
int min_cyc(string s)
{
    vector<pair<string, int> > V; 
    for (int i = 0; i < s.size(); ++i)
    {
        V.push_back(make_pair(s, i));
        rotate(s.begin(), s.begin() + 1, s.end());
    }

    sort(V.begin(), V.end());
    return V.front().second;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Map Sorting Bruteforce Solution - O(n^2 log(n)) time - O(n^2) auxiliary space

#include <algorithm>
#include <iostream>
#include <map>

using namespace std;
    
int min_cyc(string s)
{
    map<string, int> M;
    for (int i = 0; i < s.size(); ++i)
    {
        if (!M.count(s)) M[s] = i;
        rotate(s.begin(), s.begin() + 1, s.end());
    }
    return (M.begin() -> second);
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

B) Loop over all rotations:

1) Idea:

Why should we store then sort anyway, we just need to loop over all rotations and comparing to select the smaller

2) Implementations:

For convention, you can just duplicate the string $$$T = S + S$$$
Then $$$S$$$ at the $$$ith$$$ rotations ($$$0 \leq i < n$$$) is the string $$$T[i \dots i + n]$$$

Bruteforce Solution - O(n^2) time - O(n) auxiliary space

#include <iostream>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;
    
    int t = 0;
    for (int i = 1; i < n; ++i)
        if (s.substr(i, n) < s.substr(t, n))
            t = i;

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

3) Optimization:

By quickly break when the two strings arent equal, this can optimize the function a little bit
Instead of making duplicated string, you can also define $$$s[x] = s[x\ mod\ |s|]$$$ since $$$s$$$ itself is a circular string
Instead of taking modulo, since $$$0 \leq x < 2 * n$$$, we just need reduce $$$x$$$ by $$$n$$$ whenever it passed $$$n$$$

Optimized Bruteforce Solution - O(n^2) time - O(1) auxiliary space

#include <iostream>

using namespace std;

int min_cyc(const string &s)
{
    int n = s.size();

    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int cmp = 0; /// EQUAL
        for (int p = n, l = t, r = i; p > 0; --p, ++l, ++r)
        {
            if (l == n) l = 0;
            if (r == n) r = 0;
            if (s[l] < s[r]) { cmp = -1; break; } ///  LESS 
            if (s[l] > s[r]) { cmp = +1; break; } /// GREATER
        }

        if (cmp == +1) t = i;
    }
    
    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

C) Substring Based Elimination

1) Idea:

From inital set of candidates, keep comparing substrings and eliminate the bigger one until we get the final champion

2) Algorithm:

First, let $$$mn$$$ / $$$mx$$$ is the minimal / maximal character of $$$s$$$

Define: $$$mn = \underset{c \in S}{min}(c)$$$ and $$$mx = \underset{c \in S}{max}(c)$$$

Pre-elimination Round: We take the position $$$p$$$ that $$$s[p] = mn$$$. Since all other positions wont provide Minimal Lexicographical Circular String

Define: $$$candidate = $$$ { $$$p\ \ |\ \ p \in $$$ { $$$0, 1, \dots, |s| - 1$$$ } $$$ \cap s[p] = mn$$$ }

Then we take maximumly $$$n - 1$$$ Rounds from $$$d = 1$$$ to $$$d = n - 1$$$. For all current candidate, add the next character to it. Find the minimal substring, and eliminater unsatisfied others.

2) Optimization

Optimization: At the $$$p$$$ round we will find $$$c = $$$ minimal character among all candidate's $$$pth$$$-next character, then traverse again and eliminate the one whose $$$pth$$$-nxt character is greater than $$$c$$$
Define: $$$s[x] = s[x\ mod\ |s|]$$$ and $$$c = \underset{p \in\ candidate}{min}(s[p + d])$$$ and $$$next\ candidate = $$$ { $$$p\ \ |\ \ p \in candidate \cap s[p + d] = c$$$ }

3) Example:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Pre-elimination - 26 candidates

Candidate: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    abaabaaabaababaaabaaababaab
Candidate Substring:
0 - a 
1 - b
2 - a
3 - a
4 - b
5 - a
6 - a
7 - a
8 - b
9 - a
10 - a
11 - b
12 - a
13 - b 
14 - a
15 - a
16 - a
17 - b
18 - a
19 - a
20 - a
21 - b
22 - a
24 - a
25 - a
26 - b
Winner: a*

Round I - 18 candidates

Candidate: 0 2 3 5 6 7 9 10 12 14 15 16 18 19 20 22 24 25 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    a_aa_aaa_aa_a_aaa_aaa_a_aa_
Candidate Substring:
0 - ab
2 - aa
3 - ab
5 - aa
6 - aa
7 - ab
9 - aa
10 - ab
12 - ab 
14 - aa
15 - aa
16 - ab
18 - aa
19 - aa
20 - ab
22 - ab
24 - aa
25 - ab
Winner: aa*

Round II - 9 candidates

Candidate: 2 5 6 9 14 15 18 19 24 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    __aa_aa__a____aa__aa____a__
Candidate Substring:
2 - aab
5 - aaa
6 - aab
9 - aab
14 - aaa
15 - aab
18 - aaa
19 - aab
24 - aab
Winner: aaa*

Round III - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaab
14 - aaab
18 - aaab
Winner: aaab*

Round IV - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaaba
14 - aaaba
18 - aaaba
Winner: aaaba*

Round V - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaabaa
14 - aaabaa
18 - aaabab
Winner: aaabaa*

Round VI - 2 candidates

Candidate: 5 14
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a____________
Candidate Substring:
5 - aaabaab
14 - aaabaaa
Winner: aaabaaa*

Champion

Winner:   14 - VI
Number:   012345678901234567890123456
String:   abaabaaabaababaaabaaababaab
Remain:   ______________a____________
Champion: aaabaaa*

4) Notice

After all eliminations, if there are more than one candidate, select the one with lowest index

5) Complexity

If the string is $$$k$$$-repeated strings (each substring size $$$d$$$) or similar then there will be atleast $$$k$$$ candidate after $$$d$$$ eliminations. About $$$O(k \times d)$$$ in the best cases
Best case to say, like a string where all characters are unique, then it will be Linear $$$O(n)$$$
Worst case to say, like a string where all characters are the same, then it will be $$$O(n \times n) = O(n^2)$$$

6) Implementations

For convention, let not use obviously-not-optimized code

Detail Substring Based Elimination - O(n^2) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size(); /// For convention: n = |s|
    s += s;

    /// For convention
    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());

    /// Candidate list
    vector<int> a;

    /// Pre-elimination
    for (int i = 0; i < n; ++i)
        if (s[i] == mn)
            a.push_back(i);

    /// Doing dth elimination
    for (int d = 1; d < n; ++d)
    {
        /// Minimal character among substrings
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        /// New candidate list
        vector<int> b;

        /// Elimination
        for (int x : a)
            if (s[x + d] == c)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break; /// Found final candidate
    }

    return a.front(); /// The one with smallest index
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Noncomment Substring Based Elimination - O(n^2) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());
    vector<int> a;
    for (int i = 0; i < n; ++i)
        if (s[i] == mn)
            a.push_back(i);

    for (int d = 1; d < n; ++d)
    {
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        vector<int> b;
        for (int x : a)
            if (s[x + d] == c)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break;
    }

    return a.front();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

III. Hashing Apprach

A) Loop over all rotations

1) Bruteforces Algorithm:

Iterating over all rotations and comparing for finding the smaller

If they are fully equal, then that position not exceed, then we take the one which smaller index
Else find the first position of character that they are difference, which character is smaller, the which of it is also smaller

Bruteforce Approach - O(n^2 log n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = 0, r = n - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            if (s.substr(t, m) == s.substr(i, m))
            {
                lcp = m;
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }

        if (lcp == n) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

2) Hashing Improvement:

We reduce the complexity by using binary search to find the first different position of two strings:
Let $$$A$$$ is the circular substring of $$$s$$$ with the starting point $$$l$$$
Let $$$B$$$ is the circular substring of $$$s$$$ with the starting point $$$r$$$
Let $$$lcp$$$ (longest common prefix) is the last position that $$$A$$$ and $$$B$$$ equal
Let $$$t = s + s$$$, for convention of circular string
For every $$$p \leq lcp$$$, we have $$$t[l + p - 1] = t[r + p - 1]$$$
For every $$$p > lcp$$$, we have $$$t[l + p - 1] \neq t[r + p - 1]$$$
Hence we can use binary search to find $$$lcp$$$
Fully equal case is that $$$lcp = n$$$
If they are difference, compare $$$t[l + lcp]$$$ with $$$t[r + lcp]$$$

3) Implementations:

Single Hashing Approach - O(n log(MOD)) precalculation - O(n log n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

int MOD = 1e9 + 7;
int BASE = 123;
int powMOD(int x, int n)
{
    if (n == 0) return 1;
    if (n & 1) return (1LL * powMOD(x, n - 1) * x) % MOD;
    return (1LL * powMOD(x, n / 2) * powMOD(x, n / 2)) % MOD;
}

vector<int> P;
vector<int> H;
void init(const string &s)
{
    int n = s.size();
    int b = 1;
    
    P.assign(n, 0);
    H.assign(n + 1, 0);
    for (int i = 0; i < n; ++i)
    {
        H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
        P[i] = powMOD(b, MOD - 2);
        b = (1LL * b * BASE) % MOD;
    }
}

int query(int l, int r)
{
    return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
}

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    init(s);
    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = 0, r = n - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            if (query(t, t + m) == query(i, i + m))
            {
                lcp = m;
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }

        if (lcp == n - 1) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Multiple Hashing Approach - O(n Sigma(log(MOD))) precalculation - O(n log n * k) time - O(n * k) space

#include <iostream>
#include <vector>

using namespace std;

template <const int MOD, const int BASE>
struct Hash 
{
    int powMOD(int x, int n)
    {
        if (n == 0) return 1;
        if (n & 1) return (1LL * powMOD(x, n - 1) * x) % MOD;
        return (1LL * powMOD(x, n / 2) * powMOD(x, n / 2)) % MOD;
    }

    vector<int> P;
    vector<int> H;
    void init(const string &s)
    {
        int n = s.size();
        int b = 1;
     
        P.assign(n, 0);
        H.assign(n + 1, 0);
        for (int i = 0; i < n; ++i)
        {
            H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
            P[i] = powMOD(b, MOD - 2);
            b = (1LL * b * BASE) % MOD;
        }
    }

    int query(int l, int r)
    {
        return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
    }
};

struct Pack
{
    int a, b, c, d, e, f;
    Pack (int a = 0, int b = 0, int c = 0, int d = 0, int e = 0, int f = 0)
    : a(a), b(b), c(c), d(d), e(e), f(f) {}

    bool operator == (const Pack &o)
    {
        return (a == o.a)
            && (b == o.b)
            && (c == o.c)
            && (d == o.d)
            && (e == o.e)
            && (f == o.f);
    }
};

struct Multihash
{
    Hash <1000000007, 123> A;
    Hash <1000000009, 1234> B;
    Hash <1000000021, 12345> C;
    Hash <1000000033, 123456> D;
    Hash <1000000087, 1234567> E;
    Hash <1000000093, 12345678> F;
    void init(const string &s)
    {
        A.init(s);
        B.init(s);
        C.init(s);
        D.init(s);
        E.init(s);
        F.init(s);
    }

    Pack query(int l, int r)
    {
        return Pack(
            A.query(l, r),
            B.query(l, r),
            C.query(l, r),
            D.query(l, r),
            E.query(l, r),
            F.query(l, r)
        );
    }
};

Multihash H;
int min_cyc(string s)
{
    int n = s.size();
    s += s;

    H.init(s);
    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = 0, r = n - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            if (H.query(t, t + m) == H.query(i, i + m))
            {
                lcp = m;
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }

        if (lcp == n - 1) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

4) Optimization:

Hash are so heavy of hidden constant, obviously most by modulo operators, but you can have some tricks to solve it
Significant improvement: Declare $$$MOD,\ BASE,\ LIM$$$ as const or constexpr
In Single Hash, you can use overflow modulo for significant faster but it is also dangerous in same cases (especially hacking)
Replace vector with pre-declared array
Replace recursive power function by iterative one
Improve time to calculate inverse modulo. You can do it linear but cost more space and modulo operators, so it is better to do like below.

Optimized Bruteforce Approach - O(n^2 log n) time - O(n) auxiliary space

#include <iostream>

using namespace std;

int min_cyc(const string &s)
{
    int t = 0;
    int n = s.size();
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = t, r = i; lcp < n; ++lcp, ++l, ++r)
        {
            if (l == n) l = 0;
            if (r == n) r = 0;
            if (s[l] != s[r]) break;
        }

        if (lcp == n) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Optimized Single Hashing Approach - O(n + log(MOD)) precalculation - O(n log n) time - O(n) space

#include <iostream>
#include <cstring>

using namespace std;

const int LIM = 2e5 + 25;
const int MOD = 1e9 + 7;
const int BASE = 123;
int powMOD(int x, int n)
{
    int res = 1;
    for (; n > 0; n >>= 1)
    {
        if (n & 1) res = (1LL * res * x) % MOD;
        x = (1LL * x * x) % MOD;
    }

    return res;
}

int H[LIM];
int P[LIM];
void init(const string &s)
{
    int n = s.size();
    int b = 1;
    
    H[0] = 0;
    for (int i = 0; i < n; ++i)
    {
        H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
        b = (1LL * b * BASE) % MOD;
    }

    P[n] = powMOD(b, MOD - 2);
    for (int i = n - 1; i >= 0; --i)
        P[i] = (1LL * P[i + 1] * BASE) % MOD;
}

int query(int l, int r)
{
    return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
}

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    init(s);
    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = 0, r = n - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            if (query(t, t + m) == query(i, i + m))
            {
                lcp = m;
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }

        if (lcp == n - 1) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Optimized Multiple Hashing Approach - O(n Sigma(log((MOD))) precalculation - O(n log n * k) time - O(n * k) space

#include <iostream>

using namespace std;

template <const int LIM, const int MOD, const int BASE>
struct Hash 
{
    int powMOD(int x, int n)
    {
        int res = 1;
        for (; n > 0; n >>= 1)
        {
            if (n & 1) res = (1LL * res * x) % MOD;
            x = (1LL * x * x) % MOD;
        }

        return res;
    }

    int H[LIM];
    int P[LIM];
    void init(const string &s)
    {
        int n = s.size();
        int b = 1;
     
        H[0] = 0;
        for (int i = 0; i < n; ++i)
        {
            H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
            b = (1LL * b * BASE) % MOD;
        }

        P[n] = powMOD(b, MOD - 2);
        for (int i = n - 1; i >= 0; --i)
            P[i] = (1LL * P[i + 1] * BASE) % MOD;
    }

    int query(int l, int r)
    {
        return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
    }
};

struct Pack
{
    int a, b, c, d, e, f;
    Pack (int a = 0, int b = 0, int c = 0, int d = 0, int e = 0, int f = 0)
    : a(a), b(b), c(c), d(d), e(e), f(f) {}

    bool operator == (const Pack &o)
    {
        return (a == o.a)
            && (b == o.b)
            && (c == o.c)
            && (d == o.d)
            && (e == o.e)
            && (f == o.f);
    }
};

struct Multihash
{
    Hash <200200, 1000000007, 123> A;
    Hash <200200, 1000000009, 1234> B;
    Hash <200200, 1000000021, 12345> C;
    Hash <200200, 1000000033, 123456> D;
    Hash <200200, 1000000087, 1234567> E;
    Hash <200200, 1000000093, 12345678> F;
    void init(const string &s)
    {
        A.init(s);
        B.init(s);
        C.init(s);
        D.init(s);
        E.init(s);
        F.init(s);
    }

    Pack query(int l, int r)
    {
        return Pack(
            A.query(l, r),
            B.query(l, r),
            C.query(l, r),
            D.query(l, r),
            E.query(l, r),
            F.query(l, r)
        );
    }
};

Multihash H;
int min_cyc(string s)
{
    int n = s.size();
    s += s;

    H.init(s);
    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int lcp = 0;
        for (int l = 0, r = n - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            if (H.query(t, t + m) == H.query(i, i + m))
            {
                lcp = m;
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }

        if (lcp == n - 1) continue; /// Equal
        if (s[t + lcp] > s[i + lcp]) t = i;
    }

    return t;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

2) Logarithm Decomposition

1) Algorithm:

Let a candidate to say a starting position that might leed to lexicographically minimum string rotation. Hence we have the initial candidates are $$$0, 1, \dots, |S| - 1$$$
Let divide the candidates list into parts of size $$$\approx K$$$ (the final part might have much less). There will be $$$\lceil \frac{N}{K} \rceil$$$ parts.
Small iteration: For each parts, we find one smallest circular substring of size $$$K$$$, among all equal circular substrings, pick the one with smallest starting position. Each part will produce only one candidate
Big iteration: For $$$\lceil \frac{N}{K} \rceil$$$ next candidates we will find one smallest circular substring of size $$$N$$$, among all equal circular substrings, pick the one with smallest starting position. This will give you the answer

2) Proofs:

Let $$$A, B$$$ are the circular substrings start from candidates positions, that $$$|A| = |B|$$$. And let $$$X, Y$$$ are the prefixes of $$$A, B$$$, that $$$|X| = |Y|$$$
Since we are comparing in lexicographical order, if $$$X < Y$$$ then it will lead to $$$A < B$$$. Hence by using small iteration we sieved all unoptimal candidates, and reduce $$$N$$$ candidates to $$$\lceil \frac{N}{K} \rceil$$$ candidates only.

3) Complexity:

For small iteration, there are $$$N$$$ candidates, and each candidate are compared with $$$K$$$ lengthed circular substrings using hash. Hence $$$O(N \times log(K))$$$
For big iteration, there are $$$K$$$ candidates, and each candidate are compared with $$$N$$$ lengthed circular substrings using hash. Hence $$$O(\lceil \frac{N}{K} \rceil \times log(N))$$$
The total time complexity is $$$\approx O(N log(K) + \frac{N}{K} log(N))$$$
For optimal purpose, let $$$K \approx \log{N}$$$ the complexity therefore will be only $$$O(N log(log N))$$$

4) Notice:

We are comparing string in circular, you can either use modulo position or duplicated string
Becareful that the real size of string and the duplicated one

5) Implementations:

For convention, let just ignore those obvious not a good code (ugly — complex — stupid code worth nothing to say)

Single Hashing Approach - O(n * log((MOD)) precalculation - O(n log(log n)) time - O(n) space

#include <iostream>
#include <cmath>

using namespace std;

const int LIM = 1e6 + 16;
const int MOD = 1e9 + 7;
const int BASE = 123;
const int LESS    = -1;
const int EQUAL   =  0;
const int GREATER = +1;

/// Calculate (x ^ n) % MOD
int powMOD(int x, int n)
{
    int res = 1;
    for (; n > 0; n >>= 1)
    {
        if (n & 1) res = (1LL * res * x) % MOD;
        x = (1LL * x * x) % MOD;
    }
    
    return res;
}

int H[LIM];
int P[LIM];
/// Calculate hash of (S)
void init(const string &s)
{
    int n = s.size();

    H[0] = 0;
    int b = 1;
    for (int i = 0; i < n; ++i)
    {
        H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
        b = (1LL * b * BASE) % MOD;
    }

    P[n] = powMOD(b, MOD - 2);
    for (int i = n; i >= 1; --i)
        P[i - 1] = (1LL * P[i] * BASE) % MOD;
}

/// Hash value of s[l..r]
int query(int l, int r)
{
    return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
}

/// Compare s[l..l+d-1] vs s[r..r+d-1]
int cmp(const string &s, int i, int j, int d)
{
    int lcp = 0;
    for (int l = 0, r = d - 1; l <= r; )
    {
        int m = (l + r) >> 1;
        if (query(i, i + m) == query(j, j + m))
        {
            lcp = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }

    if (lcp == d - 1 || s[i + lcp] == s[j + lcp]) return EQUAL;
    return (s[i + lcp] < s[j + lcp]) ? LESS : GREATER;
}

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size();
    int k = log(n) + 1;
    s += s;
    init(s);

    int big = 0;
    int upper = n / k + 1;
    for (int part = 0; part < upper; ++part)
    {
        int small = k * part;
        int upper = min(n, k * (part + 1));
        for (int i = k * part; i < upper; ++i)
            if (cmp(s, small, i, k) == GREATER)
                small = i;

        if (cmp(s, big, small, n) == GREATER) 
            big = small;
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Multiple Hashing Approach - O(n * Sigma(log((MOD))) precalculation - O(n log(log n) * k) time - O(n * k) space

#include <iostream>
#include <cmath>

using namespace std;

const int LIM = 1e6 + 16;
const int LESS    = -1;
const int EQUAL   =  0;
const int GREATER = +1;

template<const int MOD, const int BASE>
struct Hash
{
    /// Calculate (x ^ n) % MOD
    int powMOD(int x, int n)
    {
        int res = 1;
        for (; n > 0; n >>= 1)
        {
            if (n & 1) res = (1LL * res * x) % MOD;
            x = (1LL * x * x) % MOD;
        }
        
        return res;
    }

    int H[LIM];
    int P[LIM];
    /// Calculate hash of (S)
    void init(const string &s)
    {
        int n = s.size();

        H[0] = 0;
        int b = 1;
        for (int i = 0; i < n; ++i)
        {
            H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
            b = (1LL * b * BASE) % MOD;
        }

        P[n] = powMOD(b, MOD - 2);
        for (int i = n; i >= 1; --i)
            P[i - 1] = (1LL * P[i] * BASE) % MOD;
    }

    /// Hash value of s[l..r]
    int query(int l, int r)
    {
        return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
    }
};

struct Pack 
{
    int a, b, c, d, e, f;
    Pack (int a = 0, int b = 0, int c = 0, int d = 0, int e = 0, int f = 0)
    : a(a), b(b), c(c), d(d), e(e), f(f) {}

    bool operator == (const Pack &o)
    {
        return (a == o.a)
            && (b == o.b)
            && (c == o.c)
            && (d == o.d)
            && (e == o.e)
            && (f == o.f);
    }
};

struct Multihash 
{
    Hash<1000000007, 123> A;
    Hash<1000000009, 1234> B;
    Hash<1000000021, 12345> C;
    Hash<1000000033, 123456> D;
    Hash<1000000087, 1234567> E;
    Hash<1000000093, 12345678> F;

    void init(const string &s)
    {
        A.init(s);
        B.init(s);
        C.init(s);
        D.init(s);
        E.init(s);
        F.init(s);
    }

    Pack query(int l, int r)
    {
        return Pack(
            A.query(l, r), 
            B.query(l, r),
            C.query(l, r),
            D.query(l, r),
            E.query(l, r),
            F.query(l, r)
        );
    }
};

Multihash H;

/// Compare s[l..l+d-1] vs s[r..r+d-1]
int cmp(const string &s, int i, int j, int d)
{
    int lcp = 0;
    for (int l = 0, r = d - 1; l <= r; )
    {
        int m = (l + r) >> 1;
        if (H.query(i, i + m) == H.query(j, j + m))
        {
            lcp = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }

    if (lcp == d - 1 || s[i + lcp] == s[j + lcp]) return EQUAL;
    return (s[i + lcp] < s[j + lcp]) ? LESS : GREATER;
}

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size();
    int k = log(n) + 1;
    s += s;
    H.init(s);

    int big = 0;
    int upper = n / k + 1;
    for (int part = 0; part < upper; ++part)
    {
        int small = k * part;
        int upper = min(n, k * (part + 1));
        for (int i = k * part; i < upper; ++i)
            if (cmp(s, small, i, k) == GREATER)
                small = i;

        if (cmp(s, big, small, n) == GREATER) 
            big = small;
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

IV. Sqrt Decomposition

1) Divide candidate list into many parts

1) Algorithm:

Let a candidate to say a starting position that might leed to lexicographically minimum string rotation. Hence we have the initial candidates are $$$0, 1, \dots, |S| - 1$$$
Let divide the candidates list into parts of size $$$\approx K$$$ (the final part might have much less). There will be $$$\lceil \frac{N}{K} \rceil$$$ parts.
Small iteration: For each parts, we find one smallest circular substring of size $$$K$$$, among all equal circular substrings, pick the one with smallest starting position. Each part will produce only one candidate
Big iteration: For $$$\lceil \frac{N}{K} \rceil$$$ next candidates we will find one smallest circular substring of size $$$N$$$, among all equal circular substrings, pick the one with smallest starting position. This will give you the answer

2) Proofs:

Let $$$A, B$$$ are the circular substrings start from candidates positions, that $$$|A| = |B|$$$. And let $$$X, Y$$$ are the prefixes of $$$A, B$$$, that $$$|X| = |Y|$$$
Since we are comparing in lexicographical order, if $$$X < Y$$$ then it will lead to $$$A < B$$$. Hence by using small iteration we sieved all unoptimal candidates, and reduce $$$N$$$ candidates to $$$\lceil \frac{N}{K} \rceil$$$ candidates only.

3) Complexity:

For small iteration, there are $$$N$$$ candidates, and each candidate are compared with $$$K$$$ lengthed circular substrings. Hence $$$O(N \times K)$$$
For big iteration, there are $$$K$$$ candidates, and each candidate are compared with $$$N$$$ lengthed circular substrings. Hence $$$O(\lceil \frac{N}{K} \rceil \times N)$$$
The total time complexity is $$$O(N \times (K + \lceil \frac{N}{K} \rceil))$$$
For optimal purpose, let $$$K \approx \sqrt{N}$$$ the complexity therefore will be only $$$O(N \sqrt{N})$$$

4) Notice:

We are comparing string in circular, you can either use modulo position or duplicated string
Becareful that the real size of string and the duplicated one

5) Implementations:

For convention, let just ignore those obvious not a good code (ugly — complex — stupid code worth nothing to say)

Detail Sqrt Decomposition Solution - O(N√N) time - O(N) auxiliary space

#include <iostream>
#include <cmath>

using namespace std;

int min_cyc(string s)
{
    int n = s.size(); /// real size of string
    int k = sqrt(n); /// optimal choice
    s += s; /// for convention

    int big = 0; /// One random candidate
    /// Big iteration: Compare string of size K
    for (int part = 0; part < n / k + 1; ++part) 
    {
        int small = part * k; /// One random candidate
        /// Small iteration: Compare string of size K
        for (int i = part * k; i < min(n, (part + 1) * k); ++i) 
        {
            /// Local comparision: Circular Prefixstring starts from here
            if (s.substr(small, k) > s.substr(i, k)) 
            {
                /// Assign new minimal point
                small = i; 
            }
        }

        /// Global comparision: Circular String starts from here
        if (s.substr(big, n) > s.substr(small, n)) 
        {
            /// Assign new minimal point
            big = small;
        }
    }

    /// The final winner
    return big;
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Noncomment Sqrt Decomposition Solution - O(N√N) time - O(N) auxiliary space

#include <iostream>
#include <cmath>

using namespace std;

int min_cyc(string s)
{
    int n = s.size(); 
    int k = sqrt(n);
    s += s; 

    int big = 0; 
    for (int part = 0; part < n / k + 1; ++part) 
    {
        int small = part * k; 
        for (int i = part * k; i < min(n, (part + 1) * k); ++i) 
            if (s.substr(small, k) > s.substr(i, k)) 
                small = i; 

        if (s.substr(big, n) > s.substr(small, n)) 
            big = small;
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Optimized Sqrt Decomposition Solution - O(N√N) time - O(1) auxiliary space

#include <iostream>
#include <cmath>

using namespace std;

int min_cyc(const string &s)
{
    int n = s.size(); 
    int k = sqrt(n);

    int big = 0; 
    int upper = n / k + 1;
    for (int part = 0; part < upper; ++part) 
    {
        int small = part * k; 
        int upper = min(n, (part + 1) * k);
        for (int i = part * k; i < upper; ++i) 
        {
            for (int p = 0, l = small, r = i; p < k; ++p, ++l, ++r)
            {
                if (l == n) l = 0;
                if (r == n) r = 0;
                if (s[l]  > s[r]) small = i;
                if (s[l] != s[r]) break;
            }
        }

        for (int p = 0, l = big, r = small; p < n; ++p, ++l, ++r)
        {
            if (l == n) l = 0;
            if (r == n) r = 0;
            if (s[l]  > s[r]) big = small;
            if (s[l] != s[r]) break;
        }
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

2) Hash Improvement

1) Idea:

By using hash for comparing two known substrings $$$X$$$, $$$Y$$$ of equal length $$$D$$$. We can compare them from $$$O(S) \rightarrow O(log(S))$$$ by finding their LCP — Longest Common Prefix and comparing $$$X[lcp]$$$ with $$$Y[lcp]$$$

2) Complexity:

The total time complexity is $$$O(N \times log(K) + \lceil \frac{N}{K} \rceil \times log(N)) \approx O(N \times \sqrt{N})$$$ but significant smaller constant (About 2 or/to 3 times faster). But the trade off is more complex code and more space is taken.

3) Optimization:

Like the above Hashing approach
Use constant modulo
Replace vector with array
Replace recursive with iterative
Quick calculating inverse modulo

4) Implementations:

Single Hash Solution - O(N + log(MOD)) preprocessing - O(N√N) time - O(N) auxiliary space

#include <iostream>
#include <cmath>

using namespace std;

const int LIM = 2e5 + 25;
const int MOD = 1e9 + 7;
const int BASE = 123;
const int LESS    = -1;
const int EQUAL   =  0;
const int GREATER = +1;

/// Calculate (x ^ n) % MOD
int powMOD(int x, int n)
{
    int res = 1;
    for (; n > 0; n >>= 1)
    {
        if (n & 1) res = (1LL * res * x) % MOD;
        x = (1LL * x * x) % MOD;
    }
    
    return res;
}

int H[LIM];
int P[LIM];
/// Calculate hash of (S)
void init(const string &s)
{
    int n = s.size();

    H[0] = 0;
    int b = 1;
    for (int i = 0; i < n; ++i)
    {
        H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
        b = (1LL * b * BASE) % MOD;
    }

    P[n] = powMOD(b, MOD - 2);
    for (int i = n; i >= 1; --i)
        P[i - 1] = (1LL * P[i] * BASE) % MOD;
}

/// Hash value of s[l..r]
int query(int l, int r)
{
    return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
}

/// Compare s[l..l+d-1] vs s[r..r+d-1]
int cmp(const string &s, int i, int j, int d)
{
    int lcp = 0;
    for (int l = 0, r = d - 1; l <= r; )
    {
        int m = (l + r) >> 1;
        if (query(i, i + m) == query(j, j + m))
        {
            lcp = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }

    if (lcp == d - 1 || s[i + lcp] == s[j + lcp]) return EQUAL;
    return (s[i + lcp] < s[j + lcp]) ? LESS : GREATER;
}

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size();
    int k = sqrt(n);
    s += s;
    init(s);

    int big = 0;
    int upper = n / k + 1;
    for (int part = 0; part < upper; ++part)
    {
        int small = k * part;
        int upper = min(n, k * (part + 1));
        for (int i = k * part; i < upper; ++i)
            if (cmp(s, small, i, k) == GREATER)
                small = i;

        if (cmp(s, big, small, n) == GREATER) 
            big = small;
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Single Hash Solution - O(N + Sigma(log(MOD))) preprocessing - O(N√N * K) time - O(N * K) auxiliary space

#include <iostream>
#include <cmath>

using namespace std;

const int LIM = 2e5 + 25;
const int LESS    = -1;
const int EQUAL   =  0;
const int GREATER = +1;

template<const int MOD, const int BASE>
struct Hash
{
    /// Calculate (x ^ n) % MOD
    int powMOD(int x, int n)
    {
        int res = 1;
        for (; n > 0; n >>= 1)
        {
            if (n & 1) res = (1LL * res * x) % MOD;
            x = (1LL * x * x) % MOD;
        }
        
        return res;
    }

    int H[LIM];
    int P[LIM];
    /// Calculate hash of (S)
    void init(const string &s)
    {
        int n = s.size();

        H[0] = 0;
        int b = 1;
        for (int i = 0; i < n; ++i)
        {
            H[i + 1] = (H[i] + 1LL * b * (s[i] - 'a' + 1)) % MOD;
            b = (1LL * b * BASE) % MOD;
        }

        P[n] = powMOD(b, MOD - 2);
        for (int i = n; i >= 1; --i)
            P[i - 1] = (1LL * P[i] * BASE) % MOD;
    }

    /// Hash value of s[l..r]
    int query(int l, int r)
    {
        return (1LL * (H[r] - H[l] + MOD) * P[l]) % MOD;
    }
};

struct Pack 
{
    int a, b, c, d, e, f;
    Pack (int a = 0, int b = 0, int c = 0, int d = 0, int e = 0, int f = 0)
    : a(a), b(b), c(c), d(d), e(e), f(f) {}

    bool operator == (const Pack &o)
    {
        return (a == o.a)
            && (b == o.b)
            && (c == o.c)
            && (d == o.d)
            && (e == o.e)
            && (f == o.f);
    }
};

struct Multihash 
{
    Hash<1000000007, 123> A;
    Hash<1000000009, 1234> B;
    Hash<1000000021, 12345> C;
    Hash<1000000033, 123456> D;
    Hash<1000000087, 1234567> E;
    Hash<1000000093, 12345678> F;

    void init(const string &s)
    {
        A.init(s);
        B.init(s);
        C.init(s);
        D.init(s);
        E.init(s);
        F.init(s);
    }

    Pack query(int l, int r)
    {
        return Pack(
            A.query(l, r), 
            B.query(l, r),
            C.query(l, r),
            D.query(l, r),
            E.query(l, r),
            F.query(l, r)
        );
    }
};

Multihash H;

/// Compare s[l..l+d-1] vs s[r..r+d-1]
int cmp(const string &s, int i, int j, int d)
{
    int lcp = 0;
    for (int l = 0, r = d - 1; l <= r; )
    {
        int m = (l + r) >> 1;
        if (H.query(i, i + m) == H.query(j, j + m))
        {
            lcp = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }

    if (lcp == d - 1 || s[i + lcp] == s[j + lcp]) return EQUAL;
    return (s[i + lcp] < s[j + lcp]) ? LESS : GREATER;
}

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size();
    int k = sqrt(n);
    s += s;
    H.init(s);

    int big = 0;
    int upper = n / k + 1;
    for (int part = 0; part < upper; ++part)
    {
        int small = k * part;
        int upper = min(n, k * (part + 1));
        for (int i = k * part; i < upper; ++i)
            if (cmp(s, small, i, k) == GREATER)
                small = i;

        if (cmp(s, big, small, n) == GREATER) 
            big = small;
    }

    return big;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

V. KMP Approach

A) Normal KMP

1) Algorithm:

In this problem, there is no effeciency using normal KMP over all pair substring comparing, even if we divide into queries and optimize it. It still have no improvement compare to all other implementations
But if you want to compare two strings by KMP, then here it is
We find their first different position using KMP of two strings or KMP on concatenation string ($$$s$$$ + $$$c$$$ + $$$t$$$) — for such $$$c$$$ is smaller than any character in $$$s$$$ and $$$t$$$.

2) Implementations:

My Simple KMP Implementations - O(n) time - O(1) auxiliary space

void KMP(const string &s, vector<int> &K)
{
    K.assign(s.size(), 0);
    for (int l = 1, r = 1; r < s.size(); ++r)
    {
        for (l = K[r - 1]; l && s[l] != s[r]; l = K[l - 1]);
        K[r] = l + (s[l] == s[r]);
    }
}

Bruteforces using KMP - O(n^2) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

vector<int> KMP(const string &s)
{
    vector<int> K(s.size());
    for (int l = 1, r = 1; r < s.size(); ++r)
    {
        for (l = K[r - 1]; l && s[l] != s[r]; l = K[l - 1]);
        K[r] = l + (s[l] == s[r]);
    }
    return K;
}

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    int p = 0;
    for (int i = 1; i < n; ++i)
    {
        vector<int> K = KMP(s.substr(p, n) + "#" + s.substr(i, n));
        int k = 0; /// First difference character
        while (k < n && K[k + n] + 1 == K[k + n + 1]) ++k;
        if (k < n && s[p + k] > s[i + k]) p = i;
    }

    return p;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

B) Booth Algorithm

1) Idea:

Remember when you compare circular substrings using hashing ? We find first difference character $$$k$$$ by binary search and compare only at that position to know whether one is greater, equal or less than other. But this give you huge hidden constant that might lead you TLE (the Optimized version doesnt provides that much but its constant is not that small too). While we can take the advantage of $$$KMP$$$ to find its first difference too, in a much smaller constant ofcourse. But we have to do some clever tricks to ignore its high complexity.

2) Code Explanation:

For convention let duplicate the initial string $$$S + S$$$. We can easily find the prefix function for the string. We have to find that at position $$$i > n$$$, which position $$$p$$$ that the initial one and the current rotation differ, then compare the two rotations. However it can only provides comparision between inital rotation and the current rotation, if you want for all rotations you have to merge all $$$N$$$ circular substrings. But, since we only care about the minimal lexicographical, we can just immediately eliminate the worse rotation when you find a better rotation. Then we treat initial rotation by this new better a rotation, but by how can we do that ? The important is that we cut the relationship of the current prefix function with the previous one, by assign the previous as $$$-1$$$, hence we are comparing the prefix function for this current rotation. This allow the algorithm complexity drop from normal KMP $$$O(n^2)$$$ to Booth-KMP $$$O(n)$$$ and way much smaller constant than Hashing $$$O(n log n)$$$.

3)* Implementations:

Detail Booth Algorithm - O(n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    /// Prefix Function, not for the inital but the current one
    vector<int> f(s.size(), -1); /// -1 means KMP function stop right there
    s += s; /// Duplicating for convention

    int p = 0; /// The current rotation we are compare with
    for (int l = 1, r = 1; r < s.size(); ++r) 
    {
        /// KMP function
        ///
        /// l = f[r - p - 1]      <=       l = K[K[r] - 1]
        /// Latest KMP value              Latest KMP value
        ///
        ///           l != 0 && s[l] != s[r]
        ///    non-ended KMP and non-equal character 
        ///            V    V    V    V    V
        ///       l != -1 && s[p + l + 1] != s[r]          
        /// non-ended KMP and non-equal character at pth rotation            
        ///
        ///  l = f[l]              <=        l = K[l - 1]
        /// Previous KMP                     Previous KMP 
        /// For convention we just made (l) difference a little bit
        ///

        for (l = f[r - p - 1]; l != -1 && s[p + l + 1] != s[r]; l = f[l])
        {
            if (s[l + p + 1] > s[r]) /// pth rotation is bigger than current rotation
            {  
                p = r - l - 1;
            }
        }

        /// KMP not ended here and this is the first different character between pth rotation and current rotation
        if (l == -1 && s[l + p + 1] != s[r])
        {
            if (s[p + l + 1] > s[r]) /// pth rotation is bigger than current rotation
            {
                p = r;
            }
                           /// f[r - p] <=> K[K[r]]
            f[r - p] = -1; /// KMP must ended here since we compared the rotation
        }
        else
        {
            f[r - p] = l + 1; /// Normal KMP where "(s[l] = s[r]) = (1)"
        }
    }
    
    /// Return the rotation
    return p;
}
    
int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Non-comment Booth Algorithm - O(n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    int p = 0;
    s += s;

    vector<int> f(s.size(), -1);
    for (int l = 1, r = 1; r < s.size(); ++r) 
    {
        for (l = f[r - p - 1]; l != -1 && s[p + l + 1] != s[r]; l = f[l])
            if (s[l + p + 1] > s[r])
                p = r - l - 1;

        if (l == -1 && s[p + l + 1] != s[r])
        {
            if (s[p + l + 1] > s[r])
                p = r;

            f[r - p] = -1;
        }
        else
        {
            f[r - p] = l + 1;
        }
    }
    
    return p;
}
    
int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

VI. Lyndon Factorization Approach

A) Approach

1) Definitions:

For detail definitions, proves, applications, implementations, you can read Simple Linear and Effectively Duval Algorithm for Lyndon Factorization
Lyndon word is nonempty string that is strictly smaller in lexicographic order than all of its rotations.
Lyndon factorization is a unique way to split the string into many lyndon words in such a way that the words in the sequence are in non-increasing order. That means we factorize $$$s = s_1 + s_2 + \dots + s_k$$$ where $$$s_1, s_2, \dots, s_k$$$ are lyndon words and in non-increasing order ($$$s1 \geq s2 \geq \dots \geq s_k$$$)
From the above definition, we con conclude that the last factor in lyndon factorization of the string itself is minimal lexicographical.

2) Duval Algorithm:

In $$$1983$$$, Duval provides an effecient algorithm for listing the Lyndon words of length at most $$$n$$$ with a given alphabet size $$$s$$$ in lexicographic order in $$$O(n)$$$

For string $$$S$$$ of length $$$n$$$
For each new word $$$x$$$ from $$$S$$$ that $$$\forall i$$$ we have $$$x[i] = s[i\ mod\ n]$$$ (mean $$$x$$$ is a sub-string of some cyclic strings that shifted from initial $$$S$$$)
While the last symbol of $$$x$$$ is in sorted order, remove it to make a shorter word
Replace the last remained symbol of $$$x$$$ by its successor in sorted order.

3) Examples:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Lyndon Factorization

4) Notice:

Wait wait wait, are you going to take the head position of last factor of string factorization and it will be the answer ?
Nope, becaure the string are circular, you must duplicate the string before doing so, else there might exist such string that start from from right but connect with left to form a minimal lexicographical string rotation.

5) Implementations:

Detail Duval Solution - O(n) Time - O(n) Auxiliary Space

The idea is that when we factorize duplicated string $$$t = s + s$$$
Then the answer will be a substring of maximum starting position $$$p$$$ not exceed $$$|s|$$$
The proves is already inside the code

#include <algorithm>
#include <iostream>

using namespace std;

/// Find starting position of minimum acyclic string in (s)
int min_cyc(string s)
{
    int n = s.size(); /// the real size of the string
    s += s; /// for convention since we are deadling with acyclic

    ///
    /// s = s1 + s2 + s3
    /// s1 = s[1..l-1] is handled
    /// s2 = s[l..r]   is handling
    /// s3 = s[p..n]   is going to be handled
    /// 

    int res = 0; /// minimum acyclic string
    /// while (s2) is a lyndon word, try to add s2 with s[p]
    for (int l = 0; l < n; )
    {
        ///
        /// - Case 1: 
        ///     If (s) is fully ordered, then return 0
        ///     Surely will this loop make [l..r] = [0..n-1]
        ///     Ans it is currently that (l = 0) 
        ///     => res = l is a correct answer
        ///
        /// - Case 2:
        ///     Minimum acyclic string s' = s[l..r] that 0 <= l < n <= r < 2n
        ///     Also if s2 is s', then the loop will extend its (r >= n)
        ///     Since l < n, the latest (l) will create s'    
        ///     => res = l is a correct answer
        /// 
        /// Hence in both cases, res = last(l) will return a correct answer
        ///
        res = l;

        /// Extend as much as possible lyndon word s2 = s[l..r]
        int r = l, p = l + 1;
        while (p < s.size())
        {
            /// (s2 + s[p]) is not a lyndon word
            if (s[r] > s[p]) 
            {
                break;
            }

            /// (s2 + s[p]) is stil a lyndon word, hence extend s2
            if (s[r] == s[p]) 
            {
                ++r;
                ++p;
                continue;
            }
            
            /// (s2 + s[p]) is a lyndon word, but it may be a repeated string
            if (s[r] < s[p]) 
            {
                r = l;
                ++p;
                continue;
            }
        }

        /// The lyndon word may have the form of s2 = sx + sx + .. + sx like "12312123"
        while (l <= r) 
        {
            /// s[l..l + p - r] is sx
            l += p - r;
        }
    }

    /// Dont forget to return the value ;)
    return res;
}

/// If you wanna know about that minimum acyclic string
string cyc(string s, int k)
{
    rotate(s.begin(), s.begin() + k, s.end());
    return s;
}

int main()
{
    /// Input 
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s) << '\n';
//  cout << cyc(s, min_cyc(s));
    return 0;
}

None-comment Duval Solution - O(n) Time - O(n) Auxiliary Space

#include <iostream>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    int res = 0;
    for (int l = 0; l < n; )
    {
        res = l;
        int r = l, p = l + 1;
        for (; p < s.size() && s[r] <= s[p]; ++r, ++p)
            if (s[r] < s[p]) r = l - 1;

        while (l <= r) l += p - r;
    }

    return res;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s) << '\n';
    return 0;
}

6) Optimization:

We dont need to duplicate the string
We dont need to continue the function when $$$S_2$$$ has the size of $$$n$$$ (or when starting point of $$$S_2$$$ $$$< n \leq$$$ ending point of $$$S_2$$$)
We just skip those cases $$$S_2 = s_x + s_x + \dots + s_x + s_y$$$ but jump directly to the next character

Optimized Duval Solution - O(n) Time - O(1) Auxiliary Space

#include <iostream>
    
using namespace std;
    
int min_cyc(const string &s) 
{
    int n = s.size();
    int res = 0;
    for (int l = 0; l < n; )
    {
        res = l;
        int r = l, p = l + 1;
        for (; r < n; ++r, ++p) /// If there is such string found, then its length wont exceed |s|
        {
            char c = (p < n) ? s[p] : s[p - n]; /// to avoid modulo
            if (s[r] > c) break;
            if (s[r] < c) r = l - 1;
        }        
        l = max(r, l + p - r); /// just skip those (s2 = sx + sx + ... + sx + sy) cases
    }
    return res;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

VII. Suffix Array Approach

A) Approach

1) Wrong Idea:

Store all suffix of $$$S$$$ (with its index) into an array then sort it (in lexicographical order).

Let $$$S = $$$"$$$baabaa$$$" then there are such suffixes: "$$$a$$$", "$$$aa$$$", "$$$baa$$$", "$$$abaa$$$", "$$$aabaa$$$", "$$$baabaa$$$"

Since the first having smallest lexicographical order among all suffixes then its index must be the answer ?

2) Prove it wrong:

We are dealing with circular string, and the normal suffix doesnt have enough information.
Let $$$A, B$$$ are some different suffixes, and $$$A < B$$$, but if you extend both by size $$$n$$$ then ($$$A < B$$$) might no longer be true, but because of the lexicographical sorting order and all suffixes here having different size, therefore the one with smaller size is considered as "smaller".

3) Correct idea:

Sort all suffixes of $$$S + S$$$ with its index
Find smallest suffix whose index in $$$0 \dots |s| - 1$$$
Compare with other suffixes to find one with smaller index

4) Algorithm:

For convention

Let smaller / biggers means in lexicographical order
Let equal means two suffixes have their first $$$n$$$ characters equal (first means prefix of suffix)
Let $$$T = S + S + c$$$ ($$$c$$$ is smaller than any character in $$$S$$$)

First we store all suffixes of $$$T$$$ (with its index) into an array then sort it.
We can easily find the first suffix $$$X$$$ whose index $$$t$$$ is between ($$$0 \dots n - 1$$$) (those suffixes whose length are at least $$$n$$$)
Since there might be some other equal suffixes of size $$$n$$$ whose index is smaller, we compare strings and find it

5) Optimizations:

Since all suffixes are sorted and we only need to care smallest suffixes (all among them we select the one with smallest index)
Hence we only need to care about consecutive suffixes with $$$X$$$ (they are either bigger than $$$X$$$ or equal to $$$X$$$ but with smaller index)

Let $$$Y$$$ is the current suffix (consecutive to right of $$$X$$$ of course)
If $$$X = Y$$$ then $$$Y$$$ must have smaller index, hence we update the result
Otherwise since $$$Y > X$$$, hence we break

6) Examples:

Let $$$SA[]$$$ (Suffix Array) is the array which stores order of suffixes

$$$SA[i] < SA[j]$$$ means suffix start from $$$i$$$ is smaller than suffix start from $$$j$$$

Let $$$LCP[]$$$ (Longest Common Prefix) is the array with store the longest common prefix between two consecutive suffixes in the order of $$$SA[]$$$

$$$LCP[x] = k$$$ means suffix start from $$$SA[x]$$$ and $$$SA[x + 1]$$$ have their longest common prefix is $$$k$$$

For $$$S = $$$"$$$aaba$$$"

Example

At (0): SA[i] = 8 | LCP[i] = 0 | Suffix we care: <p>Unable to parse markup [type=CF_MATHJAX]</p>$$$ | Real suffix = $$$\$$
At (1): SA[i] = 7 | LCP[i] = 0 | Suffix we care: a$   | Real suffix = a$
At (2): SA[i] = 3 | LCP[i] = 1 | Suffix we care: aaab | Real suffix = aaaba$
At (3): SA[i] = 4 | LCP[i] = 2 | Suffix we care: aaba | Real suffix = aaba$
At (4): SA[i] = 0 | LCP[i] = 0 | Suffix we care: aaba | Real suffix = aabaaaba$
At (5): SA[i] = 5 | LCP[i] = 1 | Suffix we care: aba$ | Real suffix = aba$
At (6): SA[i] = 1 | LCP[i] = 3 | Suffix we care: abaa | Real suffix = abaaaba$
At (7): SA[i] = 6 | LCP[i] = 0 | Suffix we care: ba$  | Real suffix = ba$
At (8): SA[i] = 2 | LCP[i] = 2 | Suffix we care: baaa | Real suffix = baaaba$

t = 2
answer = 3

7) Notices:

With $$$T = S + S$$$ instead of $$$T = S + S + c$$$, there are actually have plentiful implementations to make algorithm right, even only $$$T = S$$$ there are ways to solve the problem too. But they might much more complex, slow or just correct for this problem that you should be careful.
Smallest lexicographical suffix might not have smallest index

8) Implementations:

There are many ways to implement suffix array (I wont go deeper)

Spoiler

$$$O(n^2 log n)$$$ by storing and sorting by the whole suffix
$$$O(n^2)$$$ by improving the sorting
$$$O(n log^2(n))$$$ by only provide positions and sort by the $$$2^0, 2^1, \dots, 2^k \approx n$$$ character
$$$O(n log(n))$$$ by using radix sort instead
$$$O(n log(log(n)))$$$ by improving how you sort further deep
$$$O(n)$$$ by using DC3 algorithm

Simple Suffix Array Construction - O(n^2 log n) time - O(n) space

When you have little time less to code

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>
 
using namespace std;

#define all(x) (x).begin(), (x).end()
vector<int> cal_SA(const string &s)
{
    vector<int> p(s.size());
    iota(all(p), 0);
    sort(all(p), [&](int x, int y) { return s.substr(x, s.size() - x) < s.substr(y, s.size() - y); });
    return p;
}

int main()
{
    string s;
    cin >> s;
    s += '$';

    vector<int> SA = cal_SA(s);
    for (int x : SA) 
    {
        string suffix = s.substr(x);
        cout << x << ' ' << suffix << endl;
    }

    return 0;
}

Simple Improved Suffix Array Construction - O(n log^2(n)) time - O(n) space

When you have little time less to code but not to TLE

#include <algorithm>
#include <iostream>
#include <vector>
 
using namespace std;

typedef long long ll;
#define all(x) (x).begin(), (x).end()
vector<int> cal_SA(const string &s)
{
    const ll n = s.size();
    vector<int> p(n);
    vector<ll> c(n), d(n);
    for (int i = 0; i < n; ++i) p[i] = i, c[i] = s[i];

    for (int k = 0; k < n; k = !k ? 1 : k <<= 1)
    {
        for (int i = 0; i < n; ++i) d[i] = n * c[i] + c[(i + k) % n];
        sort(all(p), [&](int i, int j) { return d[i] < d[j]; });
 
        c[p[0]] = 0;
        for (int i = 1; i < n; ++i)
            c[p[i]] = c[p[i - 1]] + (d[p[i]] != d[p[i - 1]]);
    }
 
    return p;
}
int main()
{
    string s;
    cin >> s;
    s += '$';
    
    vector<int> SA = cal_SA(s);
    for (int x : SA) 
    {
        string suffix = s.substr(x);
        cout << x << ' ' << suffix << endl;
    }

    return 0;
}

Optimized Suffix Array Construction - O(n log(n)) time - O(n) space

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>
 
using namespace std;

typedef long long ll;
#define all(x) (x).begin(), (x).end()
vector<int> cal_SA(const string &s)
{
    int n = s.size();
    vector<int> f(2 * n), p(n), c(n);
    for (int i = 0; i < n; ++i)
        f[i] = f[i + n] = i;
        
    { /// k = 0
        /// sort one character
        iota(all(p), 0);
        stable_sort(all(p), [&](int i, int j) { return s[i] < s[j]; });

        /// assign initial order
        c[p[0]] = 0;
        for (int i = 1; i < n; ++i)
            c[p[i]] = c[p[i - 1]] + (s[p[i]] != s[p[i - 1]]);
    }
 
    vector<int> cn(n), pn(n), cnt(n + 1);
    for (int k = 1; k < n; k <<= 1)
    {
        /// radix sort
        for (int &x : p) x = f[x - k + n];
        fill(all(cnt), 0);
        for(int x : c) cnt[x + 1]++;
        partial_sum(all(cnt), cnt.begin());
        for(int x : p) pn[cnt[c[x]]++] = x;
        swap(p, pn);
 
        /// assign new order
        cn[p[0]] = 0;
        for (int i = 1; i < n; ++i)
        {
            int l = p[i - 1], r = p[i];
            cn[r] = cn[l] + (c[l] != c[r] || c[f[l + k]] != c[f[r + k]]);
        }
        swap(c, cn);
    }
    
    return p;
}

int main()
{
    string s;
    cin >> s;
    s += '$';
    
    vector<int> SA = cal_SA(s);
    for (int x : SA) 
    {
        string suffix = s.substr(x);
        cout << x << ' ' << suffix << endl;
    }

    return 0;
}

There are many ways to implement longest common prefix: (I wont go deeper)

Spoiler

Here are main implementations

Simple Optimized Implementation - O(n log n) SA - O(n) LCP - O(n log n) total time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>
 
using namespace std;
 
#define all(x) (x).begin(), (x).end()
vector<int> cal_SA(const string &s)
{
    int n = s.size();
    vector<int> f(2 * n), p(n), c(n);
    for (int i = 0; i < n; ++i)
        f[i] = f[i + n] = i;
        
    { /// k = 0
        iota(all(p), 0);
        stable_sort(all(p), [&](int i, int j) { return s[i] < s[j]; });

        c[p[0]] = 0;
        for (int i = 1; i < n; ++i)
            c[p[i]] = c[p[i - 1]] + (s[p[i]] != s[p[i - 1]]);
    }
 
    vector<int> cn(n), pn(n), cnt(n + 1);
    for (int k = 1; k < n; k <<= 1)
    {
        for (int &x : p) x = f[x - k + n];
        fill(all(cnt), 0);
        for(int x : c) cnt[x + 1]++;
        partial_sum(all(cnt), cnt.begin());
        for(int x : p) pn[cnt[c[x]]++] = x;
        swap(p, pn);
 
        cn[p[0]] = 0;
        for (int i = 1; i < n; ++i)
        {
            int l = p[i - 1], r = p[i];
            cn[r] = cn[l] + (c[l] != c[r] || c[f[l + k]] != c[f[r + k]]);
        }
        swap(c, cn);
    }
    
    return p;
}
 
vector<int> cal_LCP(const string &s, const vector<int> &p)
{
    int n = s.size();
    vector<int> LCP(n), c(n);
    for (int i = 0; i < n; i++) c[p[i]] = i;
    for (int i = 0, k = 0; i < n; ++i) if (p[i] != n - 1)
    {
        for (int j = p[c[i] - 1]; s[i + k] == s[j + k]; ++k);
        LCP[c[i]] = k;
        if (k) --k;
    }
 
    return LCP;
}
 
int min_cyc(const string &s)
{
    if (count(all(s), s[0]) == s.size()) return 0;
    int n = s.size();
    vector<int> SA = cal_SA(s + s + '$');
    vector<int> LCP = cal_LCP(s + s + '$', SA);

    for (int i = 0; i < 2 * n + 1; ++i) if (SA[i] < n)
    {
        while (i <= 2 * n && SA[i] > SA[i + 1] && LCP[i + 1] >= n) ++i;
        return SA[i];
    }
}
 
int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
}

VIII. Suffix Automation Approach

A) Approach

1) Algorithm:

Construct Suffix Automaton of $$$s + s$$$. The automaton itself is a path graph of all string rotations
Inorder to get lexicographically minimal string rotation, we traverse from initial state $$$|s|$$$ times by smallest edge (in lexicographical order), Let this state is $$$X$$$
For inding its index, we find the first occurence of minal strings, equivalence to first occurence of state $$$X$$$. We construct occurences of each state. Let first occurence of state $$$P$$$ is $$$f(P)$$$
Let not go deeper in construction here. We have least rotation $$$r = f(X) - |s| + 1$$$

2) Implementations:

Suffix Automaton Solution - O(n log(alphabet)) time - O(n) auxiliary space

#include <iostream>
#include <vector>
#include <map>

using namespace std;

struct state 
{
    // leng : longest lengh of all strings in the pth class
    // minp : minimum position of occurence
    // link : provide you the parent of sa_size
    // edge : labeled edge from node sa_size

    int leng;
    int minp;
    int link;
    map<char, int> edge;
};

const int LIM = 2e6 + 26;
state sa[LIM * 2];
int sa_size;
int last;

void construct(const string &s) 
{
    /// Initialization
    last = 0;
    sa[0].leng = 0;
    sa[0].minp = -1;
    sa[0].link = -1;
    sa_size = 1;

    /// Extend suffix automaton
    for (char c : s)
    {
        /// Make new state
        int cur = sa_size++;
        sa[cur].leng = sa[last].leng + 1;
        sa[cur].minp = sa[cur].leng - 1;

        int u = last;
        /// Find such state (u) linked to (last)
        for (; u != -1 && !sa[u].edge.count(c); u = sa[u].link)
            sa[u].edge[c] = cur;

        last = cur;
        if (u == -1) continue; /// (last) is linked with inital state only
        
        int v = sa[u].edge[c];
        if (sa[u].leng + 1 == sa[v].leng) /// we dont split state (v) here
        {
            sa[cur].link = v;
            continue;
        } 
         
        /// Split state (v) by making its clone (v')
        int nxt = sa_size++;
        sa[nxt].leng = sa[u].leng + 1;
        sa[nxt].minp = sa[v].minp;
        sa[nxt].link = sa[v].link;
        sa[nxt].edge = sa[v].edge;

        /// Find such state linked with it
        sa[v].link = sa[cur].link = nxt;
        for (; u != -1 && sa[u].edge[c] == v; u = sa[u].link)
            sa[u].edge[c] = nxt;
    }
}

int min_cyc(const string &s)
{
    /// Construct Suffix Automata of all string rotations
    construct(s + s);

    /// Find lexicographically minimal string rotation
    int root = 0;
    for (int i = 0; i < s.size(); ++i)
    {
   /// cout << sa[root].edge.begin() -> first; /// if you want to output such string
        root = sa[root].edge.begin() -> second;
    }

    /// Find its first occurence
    return sa[root].minp - s.size() + 1;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

IX. Elimination Tournament Algorithm

So here are my other approachs for this problem. I dont know whether there are names for these algorithm. I just got the ideas from soccer elimination and apply to this problem. By self-researching and computing, finaly I archive linear algorithm
About the algorithm name, if one know any paper about it before, please tag me in and I will tag the first authurs of those/these algorithms
The simplest idea is that we have a list of candidate, and keep doing elimination until we have the final champion

A) Dual Elimination

1) Idea:

So the ideas is that for the initial candidate list, we will select 2 consecutive candidates, then compare two circular strings of the length equal to the minimum gap of the two candidates

2) Algorithm:

Let $$$t = s + s$$$, and the selected candidates are $$$l$$$ and $$$r$$$. For convention, let $$$l < r$$$, then we will compare $$$A$$$ and $$$B$$$, for which $$$A = t[l \dots l+(r-l)-1]$$$ and $$$B = t[r \dots r+(r-l)-1]$$$.

Case 1: $$$A < B$$$, we will select the starting point of $$$A$$$ become the next candidate, it is $$$l$$$
Case 2: $$$A > B$$$, we will select the starting point of $$$B$$$ become the next candidate, it is $$$r$$$
Case 3: $$$A = B$$$, we will select the smaller starting point of $$$A$$$ and $$$B$$$, it is $$$min(l, r)$$$

3) Example:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Round I - 27 candidates

Candidate: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    abaabaaabaababaaabaaababaab
Dual:      AABBCCDDEEFFGGHHIIJJKKLLMM?
Compare:   
A: a < b 
B: a = a
C: b > a
D: a = a 
E: b > a 
F: a < b 
G: a < b 
H: a = b
I: a < b
J: a = b
K: a < b
L: a < b 
M: a = b
?: b ?

Round II - 13 candidates

Candidate: 0 2 5 6 9 12 14 16 18 20 22 24 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    a_a__aa__a__a_a_a_a_a_a_a_b
Dual:      A A  BB  C  C D D E E F F ?
Compare:   
A: ab > aa 
B: aa = aa
C: aa < ab
D: aa < ab
E: aa < ab
F: ab > aa
?: b ?

Round III - 7 candidates

Candidate: 2 5 9 14 18 24 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    __a__a___a____a___a_____a_b
Dual:        A  A   B    B   C     C ?
Compare:   
A: aab > aaa 
B: aabab > aaaba
C: aaabab < aababa
?: b ?

Round IV - 4 candidates

Candidate: 5 14 18 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a_______b
Dual:           A        A   B       B
Compare:   
A: aaabaab > aaabaaa 
B: aaabab < babaab

Round V - 2 candidates

Candidate: 14 18
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    ______________a___a________
Dual:                    A   A
Compare:   
A: aaab = aaab

Champion


Winner:  14 - V
Number:  012345678901234567890123456
String:  abaabaaabaababaaabaaababaab
Remain:  ______________a____________
Dual:                  *   
Compare: Stopped

4) Notice:

We are comparing circular string !

5) Complexity:

Normal Version: Stable Complexity
Optimized Version: Linear when characters are unique
At the $$$k$$$ elimination, the number of participants will be about $$$O(\frac{n}{2^k})$$$
At the $$$k$$$ elimination, each string will be compared $$$O(2^k)$$$ times
Hence the total complexity is $$$O(\overset{\lceil log_n \rceil}{\underset{k = 1}{\Sigma}}(\frac{n}{2^k} \times 2^k))$$$ $$$=$$$ $$$O(n \times 1 + \frac{n}{2} \times 2 + \dots + \frac{n}{2^{\lceil log_2(n) \rceil}} \times \lceil log_2(n) \rceil)$$$ $$$=$$$ $$$O(n \times \lceil log_2(n) \rceil)$$$ $$$=$$$ $$$O(n\ log\ n)$$$

6) Implementations:

Detail Dual Elimination Algorithm - O(n log n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size(); /// For convention: n = |s|

    /// Candidate list
    vector<int> a(n);
    for (int i = 0; i < n; ++i)
        a[i] = i; /// Acyclic string start from (i) might the one with minimal lexicographical

    /// For convention
    s += s;

    /// Keep doing dual-elimination until we have the final champion
    while (a.size() > 1)
    {
        /// Next Candidate List
        vector<int> b;
        for (int i = 0; i + 1 < a.size(); i += 2)
        {
            /// Notice that for convention, l < r
            int l = a[i];
            int r = a[i + 1];

            /// Compare substring of size d = r - l
            /// -  Left string: A = s[l...l+d]
            /// - Right string: B = s[r...r+d]
            ///
            /// # Case (A < B): Left string is smaller 
            ///   > We will pick (a[i]) as a winner
            ///
            /// # Case (A > B): Right string is smaller
            ///   > We will pick (a[i + 1]) as a winner
            /// 
            /// # Case (A = B): Both string are equal
            ///   > We will pick one whose starting point is smaller
            ///

            if (s.substr(l, r - l) <= s.substr(r, r - l))
                b.push_back(l);
            else 
                b.push_back(r);
        }

        /// The remain candidate that not join the elimination
        if (a.size() & 1) b.push_back(a.back());
        
        /// Do the next elimination
        swap(a, b);
    }

    /// The final winner
    return a.front();
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Noncomment Dual Elimination Algorithm - O(n log n) time - O(n) auxiliary space

#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    vector<int> a(n);
    for (int i = 0; i < n; ++i)
        a[i] = i;

    s += s;
    while (a.size() > 1)
    {
        vector<int> b;
        for (int i = 0; i + 1 < a.size(); i += 2)
        {
            int l = a[i];
            int r = a[i + 1];
            if (s.substr(l, r - l) <= s.substr(r, r - l))
                b.push_back(l);
            else 
                b.push_back(r);
        }

        if (a.size() & 1) b.push_back(a.back());
        swap(a, b);
    }

    return a.front();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Optimized and Simple Dual Elimination Algorithm - O(n log n) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <deque>

using namespace std;

int min_cyc(const string &s)
{
    char c = *min_element(s.begin(), s.end());
    int n = s.size();
    deque<int> S; 
    for (int i = 0; i < n; ++i)
        if (s[i] == c)
            S.push_back(i); 

    while (S.size() > 1)
    {
        int a = S.front(); S.pop_front();
        int b = S.front(); S.pop_front();
        if (a > b) swap(a, b);
        int res = min(a, b);
        for (int l = a, r = b, p = r - l; p >= 1; --p, ++l, ++r)
        {
            if (l == n) l = 0;
            if (r == n) r = 0;
            if (s[l] < s[r]) { res = a; break; }
            if (s[l] > s[r]) { res = b; break; }
        }
        S.push_back(res);
    }

    return S.back();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

B) Substring Based Elimination

1) Idea:

From inital set of candidates, keep comparing substrings and eliminate the bigger one until we get the final champion

2) Algorithm:

First, let $$$mn$$$ / $$$mx$$$ is the minimal / maximal character of $$$s$$$

Define: $$$mn = \underset{c \in S}{min}(c)$$$ and $$$mx = \underset{c \in S}{max}(c)$$$

Pre-elimination Round: We take the position $$$p$$$ that $$$s[p] = mn$$$. Since all other positions wont provide Minimal Lexicographical Circular String

Define: $$$candidate = $$$ { $$$p\ \ |\ \ p \in $$$ { $$$0, 1, \dots, |s| - 1$$$ } $$$ \cap s[p] = mn$$$ }

Then we take maximumly $$$n - 1$$$ Rounds from $$$d = 1$$$ to $$$d = n - 1$$$. For all current candidate, add the next character to it. Find the minimal substring, and eliminater unsatisfied others.

3) Optimization

Optimization: At the $$$p$$$ round we will find $$$c = $$$ minimal character among all candidate's $$$pth$$$-next character, then traverse again and eliminate the one whose $$$pth$$$-nxt character is greater than $$$c$$$
Define: $$$s[x] = s[x\ mod\ |s|]$$$ and $$$c = \underset{p \in\ candidate}{min}(s[p + d])$$$ and $$$next\ candidate = $$$ { $$$p\ \ |\ \ p \in candidate \cap s[p + d] = c$$$ }

4) Example:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Pre-elimination - 26 candidates

Candidate: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    abaabaaabaababaaabaaababaab
Candidate Substring:
0 - a 
1 - b
2 - a
3 - a
4 - b
5 - a
6 - a
7 - a
8 - b
9 - a
10 - a
11 - b
12 - a
13 - b 
14 - a
15 - a
16 - a
17 - b
18 - a
19 - a
20 - a
21 - b
22 - a
24 - a
25 - a
26 - b
Winner: a*

Round I - 18 candidates

Candidate: 0 2 3 5 6 7 9 10 12 14 15 16 18 19 20 22 24 25 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    a_aa_aaa_aa_a_aaa_aaa_a_aa_
Candidate Substring:
0 - ab
2 - aa
3 - ab
5 - aa
6 - aa
7 - ab
9 - aa
10 - ab
12 - ab 
14 - aa
15 - aa
16 - ab
18 - aa
19 - aa
20 - ab
22 - ab
24 - aa
25 - ab
Winner: aa*

Round II - 9 candidates

Candidate: 2 5 6 9 14 15 18 19 24 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    __aa_aa__a____aa__aa____a__
Candidate Substring:
2 - aab
5 - aaa
6 - aab
9 - aab
14 - aaa
15 - aab
18 - aaa
19 - aab
24 - aab
Winner: aaa*

Round III - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaab
14 - aaab
18 - aaab
Winner: aaab*

Round IV - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaaba
14 - aaaba
18 - aaaba
Winner: aaaba*

Round V - 3 candidates

Candidate: 5 14 18 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Candidate Substring:
5 - aaabaa
14 - aaabaa
18 - aaabab
Winner: aaabaa*

Round VI - 2 candidates

Candidate: 5 14
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a____________
Candidate Substring:
5 - aaabaab
14 - aaabaaa
Winner: aaabaaa*

Champion

Winner:   14 - VI
Number:   012345678901234567890123456
String:   abaabaaabaababaaabaaababaab
Remain:   ______________a____________
Champion: aaabaaa*

5) Notice

After all eliminations, if there are more than one candidate, select the one with lowest index

6) Complexity

If the string is $$$k$$$-repeated strings (each substring size $$$d$$$) or similar then there will be atleast $$$k$$$ candidate after $$$d$$$ eliminations. About $$$O(k \times d)$$$ in the best cases
Best case to say, like a string where all characters are unique, then it will be Linear $$$O(n)$$$
Worst case to say, like a string where all characters are the same, then it will be $$$O(n \times n) = O(n^2)$$$

7) Implementations

For convention, let not use obviously-not-optimized code

Detail Substring Based Elimination - O(n^2) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size(); /// For convention: n = |s|
    s += s;

    /// For convention
    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());

    /// Candidate list
    vector<int> a;

    /// Pre-elimination
    for (int i = 0; i < n; ++i)
        if (s[i] == mn)
            a.push_back(i);

    /// Doing dth elimination
    for (int d = 1; d < n; ++d)
    {
        /// Minimal character among substrings
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        /// New candidate list
        vector<int> b;

        /// Elimination
        for (int x : a)
            if (s[x + d] == c)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break; /// Found final candidate
    }

    return a.front(); /// The one with smallest index
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Noncomment Substring Based Elimination - O(n^2) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());
    vector<int> a;
    for (int i = 0; i < n; ++i)
        if (s[i] == mn)
            a.push_back(i);

    for (int d = 1; d < n; ++d)
    {
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        vector<int> b;
        for (int x : a)
            if (s[x + d] == c)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break;
    }

    return a.front();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

C) Elimination and Colliding

1) Idea:

This is the improvement from Substring Based Elimination. It takes me few hours to think and to implement the idea
If two substrings colide then one will be eliminate

2) Algorithm:

At the $$$dth$$$ elimination
Let $$$s[p] = s[p\ mod\ |s|]$$$ for convention
Let $$$l, r$$$ are the candidates and $$$l < r$$$
Let $$$A$$$ is a circular substring of $$$s$$$ and $$$A = s[l \dots l + d - 1]$$$
Let $$$B$$$ is a circular substring of $$$s$$$ and $$$B = s[r \dots r + d - 1]$$$
If $$$A$$$ and $$$B$$$ collide ($$$l + d - 1 = r$$$), we will select $$$l$$$ and eliminate $$$r$$$ (since $$$l < r$$$)

3) Proof:

Let $$$A_1, A_2, \dots, A_k$$$ are consecutive circular substring of $$$s$$$ that satisfy

$$$\Sigma(|A_i|) = |S|$$$

$$$A_i = s[l_1, r_i]$$$

$$$l_{i+1} \equiv r_i + 1$$$ (mod $$$n$$$)

No loss of generality. Let $$$A_1$$$ and $$$A_2$$$ are the substring we are comparing, for such $$$l1, l2$$$ are the candidates, then we also have $$$l_1 = min(l_i)$$$
In lexicographical order (in comparing the strings) to say

If $$$A_1 < A_2$$$ then $$$l_2$$$ will be eliminated
If $$$A_1 > A_2$$$ then $$$l_1$$$ will not be the next candidate
If $$$A_1 = A_2 = \dots = A_k$$$ then $$$min(l_i)$$$ will be the champion. Hence $$$l_1$$$ is
Else there is such $$$p$$$ that $$$A_1 = A_2 = \dots = A_p \neq A_{p+1}$$$

There is only the case $$$A_p < A_{p+1} \Rightarrow A_1A_2\dots A_p < A_2A_3 \dots A_{p+1}$$$, hence $$$l_2$$$ will be eliminated

Because for this case $$$A_1 = A_2 = A_p > A_{p+1}$$$ — the contradiction happened where $$$l_1, l_2, \dots, l_p$$$ are all candidates (Since $$$A_1 = A_2 = \dots = A_p$$$), but $$$A_{p+1}$$$ is smaller ($$$l_{p+1}$$$ should be the candidate instead

Let $$$p_1, p_2, \dots, p_k$$$ the candidates and $$$A_1, A_2, \dots, A_k$$$ the candidate circular substrings of $$$s$$$ that start from those position

$$$S = aabaaa\dots$$$ and $$$A_1 = aab$$$, $$$A_2 = aaa$$$ then $$$A_1$$$ is eliminated, $$$p_2 = 3$$$ the next candidate
$$$S = aabaac\dots$$$ and $$$A_1 = aab$$$, $$$A_2 = aac$$$ then $$$A_2$$$ is eliminated, $$$p_1 = 0$$$ the next candidate
$$$S = aabaabaab\dots aab$$$ and $$$A_1 = A_2 = \dots = A_k = aab$$$, then $$$p_1 = 0$$$ the champion
$$$S = aabaabaab\dots aabaac\dots$$$ and $$$A_1 = A_2 = \dots = A_p = aab \neq A_{p+1} = aac$$$, then $$$p_1 = 0$$$ the next candidate
$$$S = aabaabaab\dots aabaaa\dots$$$ and $$$A_1 = A_2 = \dots = A_p = aab \neq A_{p+1} = aaa$$$, then contradiction happened since $$$aaa$$$ should be the candidate instead

4) Example:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Pre-elimination - 26 candidates

Candidate: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    abaabaaabaababaaabaaababaab
Minimal Elimination: a*  -> 0 2 3 5 6 7 9 10 12 14 15 16 18 19 20 22 24 25 
Collide Elimination: aa* -> 0 2 5 7 9 12 14 16 18 20 22 24

Round I - 18 candidates

Candidate: 0 2 5 7 9 12 14 16 18 20 22 24 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    a_a__a___a__a_a___a___a_a__
Minimal Elimination: aa*   -> 2 5 9 14 18 24
Collide Elimination: aaaa* -> 2 5 9 14 18 24

Round II - 6 candidates

Candidate: 2 5 9 14 18 24 
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    __a__a___a____a___a_____a__
Minimal Elimination: aaa*    -> 5 14 18 
Collide Elimination: aaaaaa* -> 5 14 18

Round III - 3 candidates

Candidate: 5 14 18
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Minimal Elimination: aaab*     -> 5 14 18 
Collide Elimination: aaabaaab* -> 5 14

Round IV - 2 candidates

Candidate: 5 14
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a____________
Minimal Elimination: aaaba*      -> 5 14
Collide Elimination: aaabaaaaba* -> 5 14

Round V - 2 candidates

Candidate: 5 14
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a____________
Minimal Elimination: aaabaa*       -> 5 14
Collide Elimination: aaabaaaaabaa* -> 5 14

Round VI - 2 candidates

Candidate: 5 14
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a____________
Minimal Elimination: aaabaaa*        -> 14
Collide Elimination: aaabaaaaaabaaa* -> 14

Champion

Winner:   14 - VI
Number:   012345678901234567890123456
String:   abaabaaabaababaaabaaababaab
Remain:   ______________a____________
Champion: aaabaaa*

5) Notice:

Collision Detecting is on Circular Substring

6) Complexity:

If the string continues to grown, then they will collide and dies
Trying to extend string size $$$k$$$ mean minimize candidate list size $$$O(k \times \lfloor \frac{n}{k} \rfloor) = O(n)$$$
Trying to maximize candidate list size $$$k$$$ mean reduce the string size $$$O(f(x)) = O(k \times (\lfloor \frac{n}{k} \rfloor \times k - k)) + O(f(\lfloor \frac{k}{2} \rfloor)) = O(k\ log\ k)$$$

7) Implementations:

For convention, let not use obviously-not-optimized code

Detail Elimination and Colliding - O(n log n) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <deque>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    int n = s.size(); /// For convention: n = |s|
    s += s;

    /// For convention
    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());

    /// Candidate list
    deque<int> a;

    /// Pre-elimination
    for (int i = 0; i < n; ++i) if (s[i] == mn)
        if (a.empty() || a.back() != i - 1)
            a.push_back(i);

    /// Doing dth elimination
    for (int d = 1; d < n; ++d)
    {
        /// Minimal character among substrings
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        /// New candidate list
        deque<int> b; 

        /// Elimination
        for (int x : a) if (s[x + d] == c)
            if (b.empty() || b.back() != x - d - 1)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break; /// Found final candidate
    }

    return a.front(); /// The one with smallest index
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Elimination and Colliding - O(n log n) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <deque>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    char mx = *max_element(s.begin(), s.end());
    char mn = *min_element(s.begin(), s.end());
    deque<int> a;
    for (int i = 0; i < n; ++i) if (s[i] == mn)
        if (a.empty() || a.back() != i - 1)
            a.push_back(i);

    for (int d = 1; d < n; ++d)
    {
        char c = mx;
        for (int x : a) c = min(c, s[x + d]);

        deque<int> b; 
        for (int x : a) if (s[x + d] == c)
            if (b.empty() || b.back() != x - d - 1)
                b.push_back(x);

        swap(a, b);
        if (a.size() == 1) break;
    }

    return a.front();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

D) Elimination and Merging

1) Idea:

This is the improvement from Elimination and Colliding. It takes me few days to think, to prove and to implement the idea
If many substrings collide, we can eliminate all as once. And jump directly to the1
From the above proof, we also have the property that when $$$p$$$ is one of the candidates then $$$repeated-string$$$ start from $$$p$$$ might be one of the candidate

2) Algorithm:

At the $$$dth$$$ elimination, let $$$p_1, p_2, \dots, p_k$$$ are the candidates, $$$A_1, A_2, \dots, A_k$$$ are circular substring of $$$s$$$ that $$$A_1 = s[p_1 \dots p_1 + d - 1]$$$. Notice that all of them are either collide ($$$p_i + d - 1 \equiv p_{i+1}$$$ (mod $$$n$$$)) or not intersect (because the intersect pairs are eliminated at their first collision)
Let $$$v$$$ is the maximum collision or/and exist such consecutive candidates $$$q_1, q_2, \dots, q_v$$$ that $$$A_{q_1}$$$ collides $$$A_{q_2}$$$ collides $$$\dots$$$ collides $$$A_{q_v}$$$. Then $$$A_{q_1}A_{q_2} \dots A_{q_v}$$$ is the best candidates up to the $$$(d \times v)$$$-th elimination.
Therefore, instead of eliminating all the weaker candidates, we can just merge them and teleport to the next elimination that they collides again

3) Proofs:

I think you can prove this based on Elimination and Colliding and Duval Lyndon Approach.
Good thing is that this will also prove the linear complexity of the algorithm

4) Example:

$$$S = $$$"$$$abaabaaabaababaaabaaababaab$$$"

Pre-elimination - 26 candidates

Candidate: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    abaabaaabaababaaabaaababaab
Minimal Elimination: a*           -> 0 2 3 5 6 7 9 10 12 14 15 16 18 19 20 22 24 25 
Merging Elimination: aaa* = (a)3* -> 5 14 18
Teleport Tournament: III

Round III - 3 candidates

Candidate: 5 14 18
Number:    012345678901234567890123456
String:    abaabaaabaababaaabaaababaab
Remain:    _____a________a___a________
Minimal Elimination: aaab*                           -> 5 14 18 
Merging Elimination: aaabaaab = (aaab)2* = ((a)3b)2* -> 14
Teleport Tournament: VIII

Champion

Winner:   14 - VIII
Number:   012345678901234567890123456
String:   abaabaaabaababaaabaaababaab
Remain:   ______________a____________
Champion: aaabaaab*

5) Notice:

If the candidates arent changed, teleport to next tournament not itself
The last candidate might merge with the first candidate
When merging, you should use modulo operator for safety
Because of its complex code, you can just ignore pre-elimination and play round $$$d = 1$$$ to $$$n$$$

6) Complexity:

It seem to be about $$$O(n\ log\ n)$$$ when each time the candidates is eliminate atleast a half
But actually the complexity is Linear $$$O(n)$$$ because every position is called maximumly once for preprocessing, twice times for comparings, thrice for merging only.

7) Implementations:

Detail Elimination and Merging - O(n) time - O(n) auxiliary space


#include <iostream>
#include <vector>

using namespace std;

/// Finding minimal string right-rotation to make string minimal lexicographical
int min_cyc(string s)
{
    /// For conventions:
    char mx = 0;
    for (char c : s)
    {
        mx = max(mx, c);
    }

    int n = s.size();
    s += s;

    /// Candidate list
    vector<int> a;
    for (int p = 0; p < n; ++p)
        a.push_back(p);

    /// dth elimination
    for (int d = 1; d <= n; ++d)
    {
        /// Minimal character among substrings
        char mn = mx;
        for (int p : a)
        {
            mn = min(mn, s[p + d - 1]);
        } 
        
        /// Longest Duplicated Minimal |d|-length Substring
        int k = 1;
        vector<int> v, f;
        for (int p : a)
        {
            /// This is not a minimal substring
            if (mn == s[p + d - 1])
            {
                /// Elimination
                if (v.empty() || (v.back() + 1LL * f.back() * d) % n != p) /// Maybe next candidate
                {
                    v.push_back(p);
                    f.push_back(1);
                }
                else 
                {
                    k = max(k, ++f.back()); /// Merging Case
                }
            }
        }

        /// Merge the last with the front if possible
        if ((v.back() + 1LL * f.back() * d) % n == v.front())
        {
            k = max(k, f.back() += f.front());
        }

        /// Next candidate list
        a.clear();
        for (int i = 0; i < v.size(); ++i)
        {
            if (f[i] == k) /// Ignore less repeated substrings since they arent optimal 
            {
                a.push_back(v[i]);
            }
        }

        /// Junp to the next tournament
        d = d * k;

        /// Found the winner
        if (a.size() == 1) break;
    }

    /// Return the winner
    return a.front();
}

int main()
{
    /// Input
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s);
    return 0;
}

Noncomment Elimination and Merging - O(n) time - O(n) auxiliary space


#include <iostream>
#include <vector>

using namespace std;

int min_cyc(string s)
{
    char mx = 0;
    for (char c : s)
    {
        mx = max(mx, c);
    }

    int n = s.size();
    s += s;

    vector<int> a;
    for (int p = 0; p < n; ++p)
        a.push_back(p);

    for (int d = 1; d <= n; ++d)
    {
        char mn = mx;
        for (int p : a)
        {
            mn = min(mn, s[p + d - 1]);
        } 
        
        int k = 1;
        vector<int> v, f;
        for (int p : a)
        {
            if (mn != s[p + d - 1]) continue;
            if (v.empty() || (v.back() + 1LL * f.back() * d) % n != p)
            {
                v.push_back(p);
                f.push_back(1);
            }
            else 
            {
                k = max(k, ++f.back());
            }

        }

        if ((v.back() + 1LL * f.back() * d) % n == v.front())
            k = max(k, f.back() += f.front());

        d = d * k;

        a.clear();
        for (int i = 0; i < v.size(); ++i)
        {
            if (f[i] == k)
            {
                a.push_back(v[i]);
            }
        }

        if (a.size() == 1) break;
    }

    return a.front();
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

8) Optimization

Since every position $$$p$$$, will in range $$$0 \dots 2 \times n - 1$$$, hence we can precalculate the modulo of each

Optimized Elimination and Merging - O(n) time - O(n) auxiliary space

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

#define all(x) (x).begin(), (x).end()
int min_cyc(const string &s)
{
    char mx = *max_element(all(s));
    int n = s.size();

    vector<int> a(n);
    for (int i = 0; i < n; ++i) 
        a[i] = c[i] = c[i + n] = i;
    
    for (int d = 1, k = 1; d <= n; d = k + 1)
    {
        char mn = mx;
        for (int p : a) mn = min(mn, s[c[p + d - 1]]);
        
        vector<int> v;
        for (int p : a) if (mn == s[c[p + d - 1]])
        {
            if (v.empty() || c[v.back() + f[v.back()]] != p)
                v.push_back(p), f[v.back()] = 0;
            
            k = max(k, f[v.back()] += d);
        }

        if (c[v.back() + f[v.back()]] == v.front())
            k = max(k, f[v.back()] += f[v.front()]);

        a.clear();
        for (int p : v) if (f[p] == k) a.push_back(p);
        if (a.size() == 1) return a.front();
    }
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Full text and comments »

#string, #editorial, #lexicographical, #kmp, #hashing, #sqrt decomposition, #implementation, #lyndon_duval, #suffix_array, #automaton, #tournament_elimination, #booth

+106

SPyofgame
4 years ago
12

Simple Linear and Effectively Duval Algorithm for Lyndon Factorization

By SPyofgame, history, 4 years ago, In English

Table of content

Teleporter	Description
I. Lyndon Definitions	Definitions of Lyndon word, Lyndon factorization, ...
II. Duval Algorithm	Talk about the duval algorithm and how it works
III. Real Life Applications	Motivation to learn the algorithm
IV. Programming Applications	The code is short and simple to implement
V. My Questions	Are there any other applications ?
...................................................................	..........................................................................................................................

I. Lyndon Factorization

1) String Concatenation

Definition: The operation of joining character strings end-to-end
Property I: Non-empty string $$$S$$$ is a concatenation of all substrings of itself
Property II: Non-empty string $$$S$$$ is a concatenation of any empty string at any position in itself with itself
Property III: Let $$$A, B, C$$$ the non-empty string then $$$A + B + C = A + (B + C) = (A + B) + C$$$
For convention, let define the operator + is string concatenation

2) Lyndon Word

Definition: A nonempty string that is strictly smaller in lexicographic order than all of its rotations.
Property I: Lyndon word is nonempty and lexicographically strictly smaller than any of its proper suffixes.
Property II: Let $$$S, T, Z$$$ is nonempty word. $$$S$$$ is Lyndon word if $$$S < T\ \ \ \forall\ S = Z + T$$$
Property III: Lyndon word cant be factorized. It means that the only factor in its factorization is itself.

3) Lyndon Factorization

Definition: Split the string into many lyndon words in such a way that the words in the sequence are in lexicographically non-increasing order.
Property I: For $$$s = s_1 + s_2 + \dots + s_k$$$ where $$$s_1, s_2, \dots, s_k$$$ are lyndon words and in non-increasing order ($$$s1 \geq s2 \geq \dots \geq s_k$$$)
Property II: Lyndon factorization is unique.
Property III: The last Lyndon Factor is Lexicographically Smallest Suffix of the string

4) Least Starting Position (LSP)

Definition: The minimal position of some substrings that make it LMSR.
Property I: Let $$$X$$$ the substring of $$$S$$$ that its starting position $$$p$$$ is LSP. Then some Lyndon Factors in $$$X$$$ has the LSP $$$p$$$
Property II: $$$K$$$-repeated String, where each substring has size $$$L$$$ then there are $$$K$$$ LSP: $$$0, L, 2L, \dots, (K-1)L$$$
Property III: The Circular String start from LSP of given string is Lexicographically Minimal String Rotation

II. Duval Algorithm

Exist Time: 1983
Duval algorithm is an effecient algorithm for listing the Lyndon words of length at most $$$n$$$ with a given alphabet size $$$s$$$ in lexicographic order in $$$O(n)$$$
The position of the last Lyndon factorized word from Duval algorithm provides minimum cyclic string

The pseudo algorithm

Implementation - O(n) Time - O(1) Auxiliary Space

#include <iostream>

using namespace std;

void duval(const string &s)
{
    int n = s.size();
    for (int l = 0; l < n; )
    {
        int r = l, p = l + 1;
        for (; r < n && s[r] <= s[p]; ++r, ++p)
            if (s[r] < s[p]) r = l - 1;

        while (l <= r)
        {
            cout << s.substr(l, p - r) << '\n';
            l += p - r;
        }
    }
}

int main()
{
    string s;
    cin >> s;
    duval(s);
    return 0;
}

Detail Implementation - O(n) Time - O(1) Auxiliary Space


#include <iostream>

using namespace std;

/// Factorize all lyndon word in s
void duval(const string &s)
{
    int n = s.size();

    ///
    /// s = s1 + s2 + s3
    /// s1 = s[1..l-1] is handled
    /// s2 = s[l..r]   is handling
    /// s3 = s[p..n]   is going to be handled
    /// 

    for (int l = 0; l < n; )
    {
        /// Extend as much as possible lyndon word s2 = s[l..r]
        int r = l, p = l + 1;
        while (r < n)
        {
            /// (s2 + s[p]) is not a lyndon word
            if (s[r] > s[p]) 
            {
                break;
            }

            /// (s2 + s[p]) is stil a lyndon word, hence extend s2
            if (s[r] == s[p]) 
            {
                ++r;
                ++p;
                continue;
            }
            
            /// (s2 + s[p]) is a repeated string of Lyndon words
            if (s[r] < s[p]) 
            {
                r = l;
                ++p;
                continue;
            }
        }

        /// The lyndon word may have the form of s2 = sx + sx + .. + sx + sy like "1231231231"
        while (l <= r) 
        {
            /// s[l..l + p - r] is sx
            cout << s.substr(l, p - r) << '\n';
            l += p - r;
        }
    }
}

int main()
{
    string s;
    cin >> s;
    duval(s);
    return 0;
}

III. Real Life Applications

1) Finger print identification:

We can encode the finger print into many detailed circular strings. How to search such finger print again from those in very huge data base ? Circular comparision using lyndon factorization is requried.

2) Biological genetics:

In some cases, we need to know whether these two group's genetics are belonged to the same bacteria, virus.

3) Games:

Well, ofcourse we can apply the algorithm in some language/words-related games

IV. Programming Applications

1) Least rotation to make smallest lexicographical ordered string.

The problem:

Given a string $$$S$$$ of size $$$N$$$
A right-rotation is that move the leftmost character to rightmost of $$$S$$$
Find the least right-rotation to make $$$S$$$ become the smallest lexicographical ordered string

Important Property: After those right-rotations, the string will be Minimum Acyclic String

The solution:

One thing we can come in mind is to use hash or string algorithms in $$$O(n\ log\ n)$$$, but they are kinda complex
Some other approachs can be found here

Bruteforces Solution: Let $$$t = s + s$$$. Then for each substring of size $$$|s|$$$, we compare and find the smallest

Bruteforces Solution - O(n^2) Time - O(n) Auxiliary Space

#include <iostream>

using namespace std;

int main()
{
    string s;
    cin >> s;
    int n = s.size();
    s += s;

    int t = 0;
    for (int i = 1; i < n; ++i)
        if (s.substr(i, n) < s.substr(t, n))
            t = i;

    cout << t;
    return 0;
}

Optimized Bruteforces Solution - O(n) to O(n^2) Time - O(n) Auxiliary Space

#include <iostream>

using namespace std;

int main()
{
    string s;
    cin >> s;
    int n = s.size();
    s += s;

    int t = 0;
    for (int i = 1; i < n; ++i)
    {
        int cmp = 0; /// EQUAL
        for (int p = n, l = t, r = i; p > 0; --p, ++l, ++r)
        {
            if (s[l] < s[r]) { cmp = -1; break; } ///  LESS 
            if (s[l] > s[r]) { cmp = +1; break; } /// GREATER
        }

        if (cmp == +1) t = i;
    }

    cout << t;
    return 0;
}

Duval Solution: We can apply lyndon factorization with a little bit observation

Detail Duval Solution - O(n) Time - O(n) Auxiliary Space

The idea is that when we factorize duplicated string $$$t = s + s$$$
Then the answer will be a substring of maximum starting position $$$p$$$ not exceed $$$|s|$$$
The proves is already inside the code

#include <algorithm>
#include <iostream>

using namespace std;

/// Find starting position of minimum acyclic string in (s)
int min_cyc(string s)
{
    int n = s.size(); /// the real size of the string
    s += s; /// for convention since we are deadling with acyclic

    ///
    /// s = s1 + s2 + s3
    /// s1 = s[1..l-1] is handled
    /// s2 = s[l..r]   is handling
    /// s3 = s[p..n]   is going to be handled
    /// 

    int res = 0; /// minimum acyclic string
    /// while (s2) is a lyndon word, try to add s2 with s[p]
    for (int l = 0; l < n; )
    {
        ///
        /// - Case 1: 
        ///     If (s) is fully ordered, then return 0
        ///     Surely will this loop make [l..r] = [0..n-1]
        ///     Ans it is currently that (l = 0) 
        ///     => res = l is a correct answer
        ///
        /// - Case 2:
        ///     Minimum acyclic string s' = s[l..r] that 0 <= l < n <= r < 2n
        ///     Also if s2 is s', then the loop will extend its (r >= n)
        ///     Since l < n, the latest (l) will create s'    
        ///     => res = l is a correct answer
        /// 
        /// Hence in both cases, res = last(l) will return a correct answer
        ///
        res = l;

        /// Extend as much as possible lyndon word s2 = s[l..r]
        int r = l, p = l + 1;
        while (p < s.size())
        {
            /// (s2 + s[p]) is not a lyndon word
            if (s[r] > s[p]) 
            {
                break;
            }

            /// (s2 + s[p]) is stil a lyndon word, hence extend s2
            if (s[r] == s[p]) 
            {
                ++r;
                ++p;
                continue;
            }
            
            /// (s2 + s[p]) is a lyndon word, but it may be a repeated string
            if (s[r] < s[p]) 
            {
                r = l;
                ++p;
                continue;
            }
        }

        /// The lyndon word may have the form of s2 = sx + sx + .. + sx like "12312123"
        while (l <= r) 
        {
            /// s[l..l + p - r] is sx
            l += p - r;
        }
    }

    /// Dont forget to return the value ;)
    return res;
}

/// If you wanna know about that minimum acyclic string
string cyc(string s, int k)
{
    rotate(s.begin(), s.begin() + k, s.end());
    return s;
}

int main()
{
    /// Input 
    string s;
    cin >> s;

    /// Output
    cout << min_cyc(s) << '\n';
//  cout << cyc(s, min_cyc(s));
    return 0;
}

None-comment Duval Solution - O(n) Time - O(n) Auxiliary Space

#include <iostream>

using namespace std;

int min_cyc(string s)
{
    int n = s.size();
    s += s;

    int res = 0;
    for (int l = 0; l < n; )
    {
        res = l;
        int r = l, p = l + 1;
        for (; p < s.size() && s[r] <= s[p]; ++r, ++p)
            if (s[r] < s[p]) r = l - 1;

        while (l <= r) l += p - r;
    }

    return res;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s) << '\n';
    return 0;
}

Optimized Duval Solution - O(n) Time - O(1) Auxiliary Space

#include <iostream>
    
using namespace std;
    
int min_cyc(const string &s) 
{
    int n = s.size();
    int res = 0;
    for (int l = 0; l < n; )
    {
        res = l;
        int r = l, p = l + 1;
        for (; r < n; ++r, ++p) /// If there is such string found, then its length wont exceed |s|
        {
            char c = (p < n) ? s[p] : s[p - n]; /// to avoid modulo
            if (s[r] > c) break;
            if (s[r] < c) r = l - 1;
        }        
        l = max(r, l + p - r); /// just skip those (s2 = sx + sx + ... + sx + sy) cases
    }
    return res;
}

int main()
{
    string s;
    cin >> s;
    cout << min_cyc(s);
    return 0;
}

Practices Problem:

V. My Question

1) Are there any other programming applications for Lyndon Factorization ?

The algorithm is simple to code while running pretty fast as its low complexities. It would be kinda sad if there is only one main application, isnt it :(

2) Are there any other problems for Lyndon Factorization ?

To remember something better and understand the algorithm deeper, we need to practice right :D It would be nice if there are some more problems

Full text and comments »

string algorithm, lyndon words, lyndon factorization, duval algorithm

+137

SPyofgame
4 years ago
5

Codeforces SPoilers Error

By SPyofgame, history, 4 years ago, In English

The problem

So I have try to post a blog, but then I realize that the text inside spoilers is outsides even if the spoiler is still closed

I searched on google and codeforces and tried several ways people say like

Use other browsers
Add small space, tab, empty char add the end of summary=""
Add small texts near (above / below) the spoiler
Add some empty lines under <spoiler > or/and over </spoiler>
Use ~~~ instead of ```cpp

But sadly none of them work. However, for somewhat a reason that sometimes the spoilers arent broken though they are similar

Do someone know how to fix it? Thank you <3

UPDATE 0: I finnaly figured out that if the spoiler has it (last line) = (a line with - at its head) will destroy all spoilers below

UPDATE 1: It is so weird that using my old blog but add extra word or remove old word can still break the spoilers :( And the spoilers are now all broken again

Full text and comments »

SPyofgame
4 years ago
0

Knapsack the tutorial

By SPyofgame, history, 4 years ago, In English

Teleporter: [Previous] | | | [Next]

Table of content

This blog isnt the best but worth spending time to read it

Teleporter	Description
I. STATEMENT	Taking about the problem
II. EXAMPLE	To understand the problem better
III. Solution for small number of element — N	How much will you get in each possible subset ?
IV. Solution for small sum of weight — C[i]	What is the maximum value possible when your bag is exact $$$W$$$ weight ?
V. Solution for small sum of value — V[i]	What is the minimum bag weight possible when it is exact $$$S$$$ sum value ?
VI. Tracing for selected elements	Which next state will lead to the best result ?
VII. Other solutions	How to solve the problem with special condition ?
VIII. Online Algorithm	How to solve the problem when you need to output the result whenever you receive a new item ?
IX. Optimizations and Heuristic	How to improve the algorithm faster, shorter, simpler, safetier or saving space
X. Debugging	Support you when you are in a trouble that you cant find your bug
XI. Knapsack Variation and Practice Problems	In case you need a place to practice or submitting
XII. Blog status	The current progress and contributor of this blogs

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

I. STATEMENT

Taking about the problem

Problem: During midnight, a thief breaks into a jewelry shop. There are $$$N$$$ priceful items whose value and weight are known. The item $$$p$$$ can be sold for $$$V_p$$$ money but having $$$C_p$$$ weight. There is a bag with infinity amount of space, means that any item can be pinto it while the item's value in the bag remain unchanged ! But, it can only hold maximumly $$$W$$$ weight.

Question: What is the value $$$V$$$ that the thief can steal from the shop.

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

II. EXAMPLE

To understand the problem better

Input

Output

Explanation:

There are 8 possible cases
{} -> 0 value, 0 weight
{1} -> 10 value, 2 weight
{2} -> 20 value, 4 weight
{3} -> 30 value, 6 weight
{1, 2} -> 30 value, 6 weight
{1, 3} -> 40 value, 8 weight - optimal
{2, 3} -> 50 value, 10 weight - invalid weight
{1, 2, 3} -> 60 value, 12 weight - invalid weight

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

III. Solution for small number of element — N

How much will you get in each possible subset ?

A. Permutation Approach (Bad) — $$$O(n!)$$$ time — $$$O(n)$$$ space

For each possible permutation, pick elements until it weight too much
The result is the maximum value sum, for which weight sum is not greater than $$$W$$$

Permutation Approach - O(n!) time - O(n) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <numeric>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int p[n];
    iota(p, p + n, 0);

    ll res = 0;
    do {
        ll sum_weight = 0;
        ll sum_value = 0;
        for (int i = 0; i < n; ++i)
        {
            int weight = c[p[i]];
            int value = v[p[i]];

            sum_weight += weight;
            sum_value += value;
            if (sum_weight > w) 
            {
                break;
            }
            else
            {
                maximize(res, sum_value);
            }
        }

    }
    while (next_permutation(p, p + n));

    cout << res;
    return 0;
}

B. Bitmasking Approach (Good) — $$$O(2^n \times n)$$$ time — $$$O(n)$$$ space

Because the order isnt important, we just need to test all every possible subset
The result is the maximum value sum, for which weight sum is not greater than $$$W$$$

Bitmasking Approach - O(2^n * n) time - O(n) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        int weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            maximize(res, value);
        }
    }

    cout << res << endl;
    return 0;
}

C. Meet-in-the-middle Approach (Better) — $$$O(2^{^{\frac{n}{2}}} \times \frac{n}{2})$$$ time — $$$O(2^{^{\frac{n}{2}}})$$$ space

Split the array into two halves $$$L$$$ and $$$R$$$. In each half, we will calculate every possible subsets. And in each subset we store a pair of $$$(value\ sum, weight\ sum)$$$
For each element $$$X(value_X, weight_X) \in L$$$, we need to find suitable element $$$Y(value_Y, weight_Y) \in R$$$ that satisfying maximum $$$value_R$$$ and $$$weight_L + weight_R \leq W$$$
Therefore, we can sort all the $$$R$$$ set by increasing weight. Let $$$maxval_Y = max(value_E | E \in R, weight_E \leq weight_Y)$$$. Then for each $$$X \in L$$$, we can find its suitable $$$Y$$$ by binary search in $$$O(log\ |R|)$$$ with $$$O(|R|)$$$ precalculation

Meet in the middle approach - O(2^(n/2) * (n/2)) time - O(2^(n/2)) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct Node 
{
    ll maxval = 0;

    ll value;
    int weight;
    Node (ll value = 0, int weight = 0)
    : value(value), weight(weight) {}
};

int n, w;
void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            S.push_back(Node(value, weight));
        }    
    }
}

int main()
{
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int m = n / 2;
    vector<int> cl, cr;
    vector<int> vl, vr;
    for (int i = 0; i < n; ++i)
    {
        if (i < m)
        {
            cl.push_back(c[i]);
            vl.push_back(v[i]);
        }
        else 
        {
            cr.push_back(c[i]);
            vr.push_back(v[i]);
        }
    }

    vector<Node> Sl, Sr;
    solve(cl, vl, Sl);
    solve(cr, vr, Sr);

    sort(all(Sr), [](const Node &a, const Node &b) {
        return (a.weight != b.weight) ? a.weight < b.weight : a.value < b.value;
    });

    ll maxval = 0;
    for (Node &x : Sr)
    {
        maximize(maxval, x.value);
        x.maxval = maxval;
    }

    ll res = 0;
    for (Node &y : Sl)
    {
        for (int l = 0, r = int(Sr.size()) - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            Node x = Sr[m];
            if (x.weight + y.weight <= w)
            {
                maximize(res, x.maxval + y.value);
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }
    }

    cout << res;
    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

IV. Solution for small sum of weight — C[i]

What is the maximum value possible when your bag is exact $$$W$$$ weight ?

A) Recursive Dynamic Programming — $$$O(N \times W)$$$ time — $$$O(N \times W)$$$ space

Memorization:
- f[i][s] = magic(int i, int s) stand for using from the $$$ith$$$ items, with the total weight of $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$-1$$$
Base cases
- If ($$$s > w$$$) then $$$v = -oo$$$ since we use more than what the bag can hold
- If ($$$i \geq n$$$) then $$$v = 0$$$ since there is no available item, so no weight added into the bag
Transistion
- Using current item, it will be $$$A = magic(i + 1, s + c_i) + v_i)$$$ — move to next item, weight is added with $$$c_i$$$, value is increased by $$$v_i$$$
- Not using current item, it will be $$$B = magic(i + 1, s + 0) + 0)$$$ — move to next item, weight is remained, value is not increased
- We want the maximum value so $$$magic(int\ i, int\ s) = max(A, B)$$$
The final result: $$$result = magic(1, 0)$$$ — starting from first item with $$$0$$$ weighted bag

Recursive Approach - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i > n) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    memset(f, -1, sizeof(f));
    cout << magic();
    return 0;
}

B) Iterative Dynamic Programming — $$$O(N \times W)$$$ time — $$$O(N \times W)$$$ space

Memorization:
- f[i][s] stand for using from the $$$ith$$$ items, with the total weight exact $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$0$$$ not $$$-1$$$
Base cases:
- $$$\forall x \geq 0, f[0][x] = 0$$$ — using no item, hence return no value
- $$$\forall x \geq 0, f[x][0] = 0$$$ — having no weight, hence no using item
- $$$\forall x > 0, y < 0, f[x][y] = -oo$$$ — define it as negative infinity for easier calculation
Transistion:
- Using current item, $$$A = \underset{0 \leq t + c_i \leq s}{\underset{j \leq i}{max}}(f[j][t]) + v[i] = \underset{0 \leq t = s - c_i}{\underset{j \leq i}{max}}(f[j][t]) + v[i] = \underset{0 \leq t = s - c_i}{\underset{j = i - 1}{f[j][t]}} + v[i]$$$ maximum value among all previous bags added to current item
- Not using current item, it will be $$$B = \underset{0 \leq t + 0 \leq s}{\underset{j \leq i}{max}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j \leq i}{max}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j = i - 1}{f[j][t]}} + 0$$$ — move to next item, weight is remained, value is not increased
- We want the maximum value so $$$f[i][s] = max(A, B) = max(f[i - 1][s], f[i - 1][s - c_i] + v_i)$$$
The final result: $$$result = \underset{0 \leq s \leq w}{max}(f[n][s])$$$ — starting from first item with $$$0$$$ weighted bag

Bad Approach - O(N^2 * W^2) time

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            for (int j = 1; j < i; ++j)
            {
                for (int t = 0; t <= s; ++t)
                {
                    maximize(f[i][s], f[i][t] + 0);
                }
                
                for (int t = 0; t + c[i] <= s; ++t)
                {
                    maximize(f[i][s], f[i - 1][t] + v[i]);
                }
            }

            maximize(res, f[i][s]);
        }
    }

    cout << res;
    return 0;
}

Iterative Approach - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = f[i - 1][s];
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    ll res = 0;
    for (int s = 0; s <= w; ++s)
        maximize(res, f[n][s]);

    cout << res;
    return 0;
}

Prefixmax DP Approach - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    cout << f[n][w];
    return 0;
}

C) Recursive Dynamic Programming (Space optimization) — $$$O(N \times W)$$$ time — $$$O(N + W)$$$ space

A) O(2W) DP space

Observe: $$$\forall i > 0, f[i][x]$$$ depends on $$$f[i - 1]$$$ and $$$f[i]$$$ only, hence we just need 2 dp array space
Define: When we calculate at pth element, we have $$$\underset{x \equiv p (mod 2)}{f[x]}$$$ is current dp array, $$$\underset{y \equiv p + 1 (mod 2)}{f[y]}$$$ is previous dp array
Transistion: $$$f[i][s] = max(f[i - 1][s], f[i - 1][s - c_i] + v_i)$$$ equivalent to $$$f[x][s] = max(f[y][s], f[y][s - c_i] + v_i)$$$

Space Optimization Approach - O(NW) time - O(N + 2W) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[2][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s)
        {
            f[cur][s] = f[pre][s];
            if (s >= c[i])
            {
                maximize(f[cur][s], f[pre][s - c[i]] + v[i]);
            }
        }
    }

    ll res = 0;
    for (int s = 0; s <= w; ++s)
        maximize(res, f[n & 1][s]);

    cout << res;
    return 0;
}

B) O(W) 1D — DP space

From the above algorithm, we can change the inner loop

Inner Part

    ll f[2][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s)
            f[cur][s] = f[pre][s];
        
        for (int s = w; s >= c[i]; --s)
            maximize(f[cur][s], f[pre][s - c[i]] + v[i]);
    }

Kinda tricky, but we only need one array, for each query $$$f[s]$$$ stand for maximum value with bag of weight $$$s$$$ upto that query.

Inner Part

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s) /// Unneeded loop
            f[s] = f[s];
        
        for (int s = w; s >= c[i]; --s)
            maximize(f[s], f[s - c[i]] + v[i]);
    }

Notice that it is required for the second-inner-loop to iterate from $$$w$$$ downto $$$c_i$$$. Here is the reason

From c[i] upto w

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = c; s <= w; ++s)
                maximize(f[s], f[s - c] + v);

        equivalent to

            for (int s = c; s <= w; ++s)
                maximize(f[cur][s], f[cur][s - c] + v);

From w downto c[i]

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = w; s >= c; --s)
                maximize(f[s], f[s - c] + v);

        equivalent

            for (int s = w; s >= c; --s)
                maximize(f[cur][s], f[pre][s - c] + v);

Finally, here is 1D Dynamic Programming Solution

Space Optimization Approach - O(NW) time - O(N + 1W) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
        for (int s = w; s >= c[i]; --s)
            maximize(f[s], f[s - c[i]] + v[i]);

    cout << f[w];
    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

V. Solution for small sum of value — V[i]

What is the minimum bag weight possible when it is exact $$$S$$$ sum value ?

A) Recursive Dynamic Programming — $$$O(N \times SUM)$$$ time — $$$O(N \times SUM)$$$ space

Memorization:
- f[i][s] = magic(int i, int s) stand for using from the $$$ith$$$ items, with the total value of $$$s$$$ that minimum weight is exact $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$-1$$$
Base cases
- If ($$$s < 0$$$) then $$$v = +oo$$$ means we use more value than expected
- If ($$$i > n$$$ and $$$s \neq 0$$$) then $$$v = +oo$$$ means there is currently no bag of exact $$$s$$$ value
- If ($$$i > n$$$ and $$$s = 0$$$) then $$$v = 0$$$ means there is actually a bag of exact $$$s$$$ value
Transistion
- Using current item, it will be $$$A = magic(i + 1, s - v_i) + c_i)$$$ — move to next item, sum value is reduce by $$$v_i$$$, weight is added with $$$c_i$$$
- Not using current item, it will be $$$B = magic(i + 1, s - 0) + 0)$$$ — move to next item, sum value is remained, weight is not increased
- We want the minimum weight so $$$magic(int\ i, int\ s) = min(A, B)$$$
The final result: $$$result = \underset{0 \leq s \leq \Sigma(v_i)}{max}(s | magic(1, s) \leq w)$$$ — maximum value whose weight is not greater than $$$W$$$

Recursive Approach - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    memset(f, -1, sizeof(f));
    for (int res = sum; res >= 0; --res)
    {
        if (magic(1, res) <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

B) Iterative Dynamic Programming — $$$O(N \times SUM)$$$ time — $$$O(N \times SUM)$$$ space

Memorization:
- f[i][s] stand for using from the $$$ith$$$ items, with the total value of exact $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$+oo$$$ not $$$-1$$$
Base cases:
- $$$\forall x \geq 0, f[0][x] = 0$$$ — using no item, hence return no weight
- $$$\forall x \geq 0, f[x][0] = 0$$$ — having no value, hence no using item
- $$$\forall x > 0, y < 0, f[x][y] = +oo$$$ — define it as negative infinity for easier calculation
Transistion:
- Using current item, $$$A = \underset{0 \leq t + v_i \leq s}{\underset{j \leq i}{min}}(f[j][t]) + c[i] = \underset{0 \leq t = s - v_i}{\underset{j \leq i}{min}}(f[j][t]) + c[i] = \underset{0 \leq t = s - c_i}{\underset{j = i - 1}{f[j][t]}} + v[i]$$$ minimum weight among all previous bags added to current item
- Not using current item, it will be $$$B = \underset{0 \leq t + 0 \leq s}{\underset{j \leq i}{min}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j \leq i}{min}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j = i - 1}{f[j][t]}} + 0$$$ — move to next item, value is remained, weight is not increased
- We want the minimum weight so $$$f[i][s] = min(A, B) = min(f[i - 1][s], f[i - 1][s - v_i] + c_i)$$$
The final result: $$$result = \underset{0 \leq s \leq \Sigma(v_i)}{max}(s | f[n][s] \leq w)$$$ — maximum value whose weight is not greater than $$$W$$$

Iterative Approach - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 0; s <= sum; ++s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    for (int res = sum; res >= 0; --res)
    {
        if (f[n][res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization) — $$$O(N \times SUM)$$$ time — $$$O(N + SUM)$$$ space

A) O(2SUM) DP space

Observe: $$$\forall i > 0, f[i][x]$$$ depends on $$$f[i - 1]$$$ and $$$f[i]$$$ only, hence we just need 2 dp array space
Define: When we calculate at pth element, we have $$$\underset{x \equiv p (mod 2)}{f[x]}$$$ is current dp array, $$$\underset{y \equiv p + 1 (mod 2)}{f[y]}$$$ is previous dp array
Transistion: $$$f[i][s] = min(f[i - 1][s], f[i - 1][s - v_i] + c_i)$$$ equivalent to $$$f[x][s] = min(f[y][s], f[y][s - v_i] + c_i)$$$

Space Optimization Approach - O(NSUM) time - O(N + 2SUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= sum; ++s)
        {
            f[cur][s] = f[pre][s];

            if (s >= v[i])
            {
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);
            }
        }
    }

    for (int res = sum; res >= 0; --res)
    {
        if (f[n & 1][res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

B) O(SUM) 1D — DP space

From the above algorithm, we can change the inner loop

Inner Part

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= sum; ++s)
        {
            f[cur][s] = f[pre][s];

            if (s >= v[i])
            {
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);
            }
        }
    }

Kinda tricky, but we only need one array, for each query $$$f[s]$$$ stand for maximum value with bag of weight $$$s$$$ upto that query.

Inner Part

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= w; ++s) /// Unneeded loop
            f[s] = f[s];
        
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);
    }

Notice that it is required for the second-inner-loop to iterate from $$$sum$$$ downto $$$v_i$$$. Here is the reason

From v[i] upto sum

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = v[i]; s <= sum; ++s)
                minimize(f[s], f[s - v[i]] + c[i]);

        equivalent to

            for (int s = v[i]; s <= sum; ++s)
                minimize(f[cur][s], f[cur][s - v[i]] + c[i]);

From sum downto v[i]

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = sum; s >= v[i] --s)
                minimize(f[s], f[s - v[i]] + c[i]);

        equivalent to

            for (int s = sum; s >= v[i] --s)
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);

Finally, here is 1D Dynamic Programming Solution

Space Optimization Approach - O(NSUM) time - O(N + SUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[sum + 1];
    memset(f, +LINF, sizeof(f));

    f[0] = 0;
    for (int i = 1; i <= n; ++i)
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);

    for (int res = sum; res >= 0; --res)
    {
        if (f[res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VII. Tracing for selected elements

Which next state will lead to the best result ?

A) Solution for small number of element — N

A) Permutation Approach: We will update selected elements when we see a better solution

Permutation - O(n!) time - O(n) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <numeric>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int p[n];
    iota(p, p + n, 0);

    vector<int> selected;
    ll res = 0;
    do {
        bool better = false;
        vector<int> current;
        ll sum_weight = 0;
        ll sum_value = 0;
        for (int i = 0; i < n; ++i)
        {
            int weight = c[p[i]];
            int value = v[p[i]];

            sum_weight += weight;
            sum_value += value;
            if (sum_weight > w) 
            {
                break;
            }
            else
            {
               current.push_back(p[i]);
                if (res < sum_value)
                {
                    better = true;
                    res = sum_value;
                }
            }
        }

        if (better) selected = current;
    }
    while (next_permutation(p, p + n));

    cout << res << '\n';
    sort(selected.begin(), selected.end());
    for (int p : selected)
    {
        cout << p + 1 << ' ' << c[p] << ' ' << v[p] << '\n';
    }

    return 0;
}

B) Bitmasking Approach: We will update bitmask when we see a better solution

O(2^n) time - O(n) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    int selected = 0;
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            if (res <= value)
            {
                res = value;
                selected = mask;
            }
        }
    }

    cout << res << '\n';
    for (int i = 0; i < n; ++i)
    {
        if (selected >> i & 1)
        {
            cout << i + 1 << ' ' << c[i] << ' ' << v[i] << '\n';
        }
    }
    return 0;
}

C) Meet-in-the-middle Approach: We will update bitmask when we see a better solution AND ON DP-CALCULATION.

Bitmasking - O(2^(n/2) * (n/2)) time - O(2^(n/2)) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
#include <bitset>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct Node 
{
    ll maxval = 0;
    int maxmask = 0;

    int mask;
    ll value;
    int weight;
    Node (int mask = 0, ll value = 0, int weight = 0)
    : mask(mask), value(value), weight(weight) {}
};

int n, w;
void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        S.push_back(Node(mask, value, weight));
    }
}

int main()
{
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int m = n / 2;
    vector<int> cl, vl;
    for (int i = 0; i < m; ++i)
    {
        cl.push_back(c[i]);
        vl.push_back(v[i]);
    }

    vector<int> cr, vr;
    for (int i = m; i < n; ++i)
    {
        cr.push_back(c[i]);
        vr.push_back(v[i]);
    }

    vector<Node> Sl, Sr;
    solve(cl, vl, Sl);
    solve(cr, vr, Sr);

    sort(all(Sr), [](const Node &a, const Node &b) {
        return (a.weight != b.weight) ? a.weight < b.weight : a.value > b.value;
    });

    ll maxval = 0;
    int maxmask = 0;
    for (Node &x : Sr)
    {
        if (maxval < x.value)
        {
            maxval = x.value;
            maxmask = x.mask;
        }
        x.maxval = maxval;
        x.maxmask = maxmask;
    }

    ll res = 0;
    int mask_l = 0;
    int mask_r = 0;
    for (Node &y : Sl)
    {
        for (int l = 0, r = int(Sr.size()) - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            Node x = Sr[m];
            if (x.weight + y.weight <= w)
            {
                if (res < x.maxval + y.value)
                {
                    res = x.maxval + y.value;
                    mask_l = y.mask;
                    mask_r = x.maxmask;
                }
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }
    }

    vector<int> selected;
    for (int i = 0; i < m; ++i)
        if (mask_l >> i & 1)
            selected.push_back(i);

    for (int i = 0; i < n - m; ++i)
        if (mask_r >> i & 1)
            selected.push_back(i + m);

    cout << res << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p + 1 << ' ' << c[p] << ' ' << v[p] << '\n';
    }

    return 0;
}

B) Solution for small sum of weight — C[i]

A) Recursive Dynamic Programming: Starting from $$$(i = 0, s = 0)$$$, we already have $$$magic(i,s)$$$ return the best result, $$$magic(i + 1,s + 0) + 0)$$$ or/and $$$magic(i + 1, s + c[i]) + v[i]$$$ will be the best result

Trace cases

Recursive DP - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i > n) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

vector<int> selected;
void trace(int i = 1, int s = 0)
{
    if (s > w) return ;
    if (i > n) return ;

    ll res = magic(i, s);
    if (res == magic(i + 1, s + 0) + 0)
    {
        return trace(i + 1, s + 0);
    }
    else 
    {
        selected.push_back(i);
        return trace(i + 1, s + c[i]);
    }
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    memset(f, -1, sizeof(f));
    cout << magic() << '\n';

    trace();
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

B) Iterative Dynamic Programming:

Prefixmax Iterative DP - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            s -= c[i];
            i -= 1;
            continue;
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

    cout << f[n][w] << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

Explanation

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
	
using namespace std;
	
template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }
	
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====
	
const int LIM_N = 111;
const int LIM_W = 1e6 + 16;
	
int n, w;
int c[LIM_N], v[LIM_N];
ll f[LIM_W];

int calc(ll *f, int l = 1, int r = n)
{
	ll upper = 0;
	for (int i = l; i <= r; ++i)
		upper += c[i];
	
	minimize(upper, (ll)w);
	for (int s = 0; s <= upper; ++s)
		f[s] = 0;
	
	for (int i = l; i <= r; ++i)
		for (int s = upper; s >= c[i]; --s)
			maximize(f[s], f[s - c[i]] + v[i]);
	
	return upper;
}
	
ll L[LIM_W], R[LIM_W];
vector<int> selected;
void trace(int s = w, int l = 1, int r = n)
{
	if (l == r)
	{
		if (s == c[l])
		{
			selected.push_back(l);
		}
	
		return ;
	}
	
	int m = (l + r) >> 1;
	int sleft  = calc(L, l, m + 0);
	int sright = calc(R, m + 1, r);
	
	ll mx = -1;
	int pleft = 0;
	int pright = s;
	for (int v = max(0, s - sright); v <= min(s, sleft); ++v)
	{
		if (mx < L[v] + R[s - v])
		{
			mx = L[v] + R[s - v];
			pleft = v;
			pright = s - v;
		}
	}
	
	trace(pleft , l, m + 0);
	trace(pright, m + 1, r);
}
		
int main()
{
	cin >> n >> w;
	for (int i = 1; i <= n; ++i)
		cin >> c[i] >> v[i];
	
	calc(f);
	ll res = f[w];
	int weight_used = max_element(f, f + w + 1) - f;
	
	trace(weight_used);	
	cout << res << '\n';	
	cout << selected.size() << '\n';
	for (int p : selected)
	{
	    cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
	}
	return 0;
}

C) Solution for small sum of value — V[i]

A) Recursive Dynamic Programming: Starting from $$$(i = 0, s = res)$$$, we already have $$$magic(i,s)$$$ return the best result, $$$magic(i + 1,s + 0) + 0)$$$ or/and $$$magic(i + 1, s - v[i]) + c[i]$$$ will be the best result

Trace cases

Recursive DP - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

vector<int> selected;
void trace(int i = 1, int s = 0)
{
    if (s < 0) return ;
    if (i > n) return ;

    ll res = magic(i, s);
    if (res == magic(i + 1, s + 0) + 0)
    {
        return trace(i + 1, s + 0);
    }
    else 
    {
        selected.push_back(i);
        return trace(i + 1, s - v[i]);
    }
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    memset(f, -1, sizeof(f));
    for (int res = sum; res >= 0; --res)
    {
        if (magic(1, res) <= w)
        {
            trace(1, res);
            
            cout << res << '\n';
            cout << selected.size() << '\n';
            for (int p : selected)
            {
                cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
            }
            return 0;
        }
    }

    return 0;
}

B) Iterative Dynamic Programming:

Iterative DP - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 0; s <= sum; ++s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    int res = sum;
    while (f[n][res] > w) --res;
    
    vector<int> selected;
    for (int i = n, s = res; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - v[i]] + c[i])
        {
            selected.push_back(i);
            s -= v[i];
            i -= 1;
        }
        else 
        {
            --i;
        }
    }

    cout << res << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

Explanation

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
	
using namespace std;
	
template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }
	
typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====
	
const int LIM_N = 111;
const int LIM_S = 1e6 + 16;
	
int n, w;
int c[LIM_N], v[LIM_N];
ll f[LIM_S + 1];

int calc(ll *f, int l = 1, int r = n)
{
	int upper = 0;
	for (int i = l; i <= r; ++i)
		upper += v[i];

	f[0] = 0;
	for (int s = upper; s >= 1; --s)
		f[s] = +LINF;

	for (int i = l; i <= r; ++i)
		for (int s = upper; s >= v[i]; --s)
			minimize(f[s], f[s - v[i]] + c[i]);
	
	return upper;
}

vector<int> selected;
ll L[LIM_S + 1], R[LIM_S + 1];
void trace(int s = LIM_S, int l = 1, int r = n)
{
	if (l == r)
	{
		if (s == v[l])
		{
			selected.push_back(l);
		}	

		return ;
	}

	int m = (l + r) >> 1;
	int sleft  = calc(L, l, m + 0);
	int sright = calc(R, m + 1, r);

	int mn = +INF;
	int pleft = 0;
	int pright = s;
	for (int v = max(0, s - sright); v <= min(s, sleft); ++v)
	{
		if (mn > L[v] + R[s - v])
		{
			mn = L[v] + R[s - v];
			pleft = v;
			pright = s - v;
		}
	}

	trace(pleft , l, m + 0);
	trace(pright, m + 1, r);
}

int main()
{
	cin >> n >> w;
	for (int i = 1; i <= n; ++i)
		cin >> c[i] >> v[i];

	int res = calc(f);
	while (f[res] > w) --res;
	trace(res);
	
	int ans = 0;
	for (int p : selected)
		ans += v[p];

	cout << ans;
	// cout << res << '\n';
	// cout << selected.size() << '\n';
	// for (int p : selected)
	// {
	// 	cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
	// }

	return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VII. Other solutions

How to solve the problem with special condition ?

A) Fractional Knapsack & Greedy Approach

On progress...

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VIII. Online Algorithm

How to solve the problem when you need to output the result whenever you receive a new item ?

A) Solution for small number of element — N

A) Permutation Approach: Really not worth being used thought it is possible to optimize it

B) Bitmasking Approach:

Explanation

Bitmasking Approach - O(2^N * N) time - O(n) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    ll res = 0;
    ll c[n], v[n];
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];
        int L = 1 << i;
        int R = 1 << (i + 1);
        for (int mask = L; mask < R; ++mask)
        {
            ll weight = 0;
            ll value = 0;
            for (int j = 1; j <= i; ++j)
            {
                if (mask >> j & 1)
                {
                    weight += c[j];
                    value += v[j];
                    if (weight > w) break;
                }
            }

            if (weight <= w)
            {
                maximize(res, value);
            }
        }

        cout << res << '\n';
    }

    return 0;
}

DP Bitmask Approach - O(2^N) time - O(2^N) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

const int LIM = 1 << 20;
ll dpc[LIM], dpv[LIM];
int main()
{
    int n, w;
    cin >> n >> w;

    ll res = 0;
    for (int i = 0; i < n; ++i)
    {
        ll c, v;
        cin >> c >> v;
        for (int mask = (1 << i); mask < (1 << (i + 1)); ++mask)
        {
            dpc[mask] = dpc[mask ^ (1 << i)] + c;
            dpv[mask] = dpv[mask ^ (1 << i)] + v;
            if (dpc[mask] <= w)
                maximize(res, dpv[mask]);
        }
        cout << res << '\n';
    }

    return 0;
}

C) Iterating Deque Approach:

Explanation

Deque approach with little optimization - O(min(W, 2^N)) time - O(2^N) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>
#include <deque>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;

const int LIM = 0;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct item 
{
    ll weight;
    ll value;
    item (ll weight = 0, ll value = 0)
    : weight(weight), value(value) {}

    void operator += (const item &other)
    {
        weight += other.weight;
        value += other.value;
    }
};

int main()
{
	int n;
    ll w;
	cin >> n >> w;

    ll res = 0;
    deque<item> S;
    S.push_back(item(0, 0));
    for (int i = 1; i <= n; ++i)
    {
        ll c, v;
        cin >> c >> v;
        if (c > w) continue;

        deque<item> T;
        for (item e : S)
        {
            e += item(c, v);
            if (e.weight <= w)
            {
                T.push_back(e);
                maximize(res, e.value);
            }
        }

        for (const item &e : T)
            S.push_back(e);

        cout << res << '\n';
    }
    
	return 0;
}

Deque approach with bad optimization - O(min(W, 2^N * N)) time - O(2^N * N) space


#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>
#include <deque>
#include <map>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;

const int LIM = 0;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct item 
{
    ll weight;
    ll value;
    item (ll weight = 0, ll value = 0)
    : weight(weight), value(value) {}

    void operator += (const item &other)
    {
        weight += other.weight;
        value += other.value;
    }
};

map<ll, ll> minC;
map<ll, ll> maxV;
bool update(ll &c, ll &v)
{
    bool ok = false;
    if (minC.count(v) == false || minC[v] > c)
    {
        minC[v] = c;
        ok = true;
    }

    if (maxV.count(c) == false || maxV[c] < v)
    {
        maxV[c] = v;
        ok = true;
    }

    return ok;
}


int main()
{
	int n;
    ll w;
	cin >> n >> w;

    ll res = 0;
    deque<item> S;

    minC[0] = 0;
    maxV[0] = 0;
    S.push_back(item(0, 0));
    for (int i = 1; i <= n; ++i)
    {
        ll c, v;
        cin >> c >> v;
        if (c > w) continue;

        deque<item> T;
        for (item e : S)
        {
            e += item(c, v);
            if (e.weight > w) continue;
            if (update(e.weight, e.value))
            {
                T.push_back(e);
                maximize(res, e.value);
            }
        }

        for (const item &e : T)
            S.push_back(e);

        cout << res << '\n';
    }
    
	return 0;
}

D) Recursive Approach:

Explanation

Recursive Approach - O(min(W, 2^N)) time - O(N) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>
#include <deque>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;

const int LIM = 25;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n;
ll w;
ll c[LIM], v[LIM];
ll solve(int i, ll s)
{
    if (s < 0) return -LINF;
    if (i == 0) return 0;

    ll res = 0;
    maximize(res, solve(i - 1, s));
    maximize(res, solve(i - 1, s - c[i]) + v[i]);
    return res;
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];
        cout << solve(i, w) << '\n';
    }
	return 0;
}

Recursive Approach with optimization - O(min(W, 2^N / 2^K)) time - O(N) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>
#include <deque>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;

const int LIM = 25;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n;
ll w;
ll c[LIM], psc[LIM];
ll v[LIM], psv[LIM];
ll solve(int i, ll s)
{
    if (s >= psc[i]) return psv[i];
    if (s < 0) return -LINF;
    if (i == 0) return 0;

    ll res = 0;
    maximize(res, solve(i - 1, s));
    maximize(res, solve(i - 1, s - c[i]) + v[i]);
    return res;
}

int main()
{
    cin >> n >> w;

    psc[0] = psv[0] = 0;
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];
        psc[i] = psc[i - 1] + c[i];
        psv[i] = psv[i - 1] + v[i];
        cout << solve(i, w) << '\n';
    }
	return 0;
}

E) Meet in the middle approach: On the progress...

B) Solution for small sum of weight — C[i]

A) Recursive Dynamic Programming:

What have changed from the orginal ?

O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i == 0) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i - 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i - 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

int main()
{
    memset(f, -1, sizeof(f));

    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];
        cout << magic(i, 0) << '\n';
    }
    return 0;
}

B) Iterative Dynamic Programming:

What have changed from the orginal ?

O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c)
            {
                maximize(f[i][s], f[i - 1][s - c] + v);
            }
        }

        cout << f[i][w] << '\n';
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

What have changed from the orginal ?

O(NW) time - O(W) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        for (int s = w; s >= c; --s)
            maximize(f[s], f[s - c] + v);
    
        cout << f[w] << '\n';
    }

    return 0;
}

C) Solution for small sum of value — V[i]

A) Recursive Dynamic Programming:

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i == 0) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i - 1, s - 0) + 0);
    minimize(res, magic(i - 1, s - v[i]) + c[i]);
    return res;
}

int main()
{
    cin >> n >> w;

    int sum = 0;
    memset(f, -1, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];

        sum += v[i];
        for (int res = sum; res >= 0; --res)
        {
            if (magic(i, res) <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

B) Iterative Dynamic Programming:

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>
#include <vector>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int LIMN = 100;
const int LIMSUM = 1e5 + 15;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

ll f[LIMN][LIMSUM];
int main()
{
    int n, w;
    cin >> n >> w;

    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    int sum = 0;
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        sum += v;
        for (int s = sum; s >= 0; --s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v)
            {
                minimize(f[i][s], f[i - 1][s - v] + c);
            }
        }

        for (int res = sum; res >= 0; --res)
        {
            if (f[i][res] <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int LIM = 1e6 + 16;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

ll f[LIM];
int main()
{
    int q, w;
    cin >> q >> w;

    memset(f, +LINF, sizeof(f));
    f[0] = 0;

    int sum = 0;
    while (q-->0) /// For each query
    {
        int c, v;
        cin >> c >> v;

        sum += v;
        for (int s = sum; s >= v; --s)
            minimize(f[s], f[s - v] + c);

        
        for (int res = sum; res >= 0; --res)
        {
            if (f[res] <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

IX. Optimizations and Heuristic

How to improve the algorithm faster, shorter, simpler, safetier or saving space

A) Filtering the array

1) Split items into 2 types, whose weight less than $$$W$$$ and the rest

Hint

2) Compressed the array

Hint

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

X. Debugging

Support you when you are in a trouble that you cant find your bug

A) Wrong answer

1) Becareful when weight sum and value sum is big, it would cause overflow

Debug

long long weight = 0;
long long value = 0;

2) Becareful that in Meet-in-the-middle approach:

You have to update the bitmask that have maxvalue.
You have to update the $$$maxval$$$ and $$$maxmask$$$ before assign $$$x.maxval$$$, $$$x.maxmask$$$
You have to use also in collecting the result

Wrong

    ll maxval = 0;
    for (Node &x : Sr)
    {
        /// What if x.value > maxval ??
        x.maxval = maxval;
        x.maxmask = maxmask;
        if (maxval < x.value)
        {
            maxval = x.value;
            /// not update maxmask ?
        }
    }

Wrong

    if (res < x.value + y.value) /// where is maxvalue ?
    {
        res = x.value + y.value;
        mask_l = y.mask;
        mask_r = x.mask;
    }

Wrong

    if (res < x.maxval + y.value)
    {
        res = x.maxval + y.value;
        mask_l = y.mask;
        mask_r = x.mask; /// this mask might not given the maxval !
    }

Debug

    ll maxval = 0;
    int maxmask = 0;
    for (Node &x : Sr)
    {
        if (maxval < x.value)
        {
            maxval = x.value;
            maxmask = x.mask;
        }
        x.maxval = maxval;
        x.maxmask = maxmask;
    }

Debug


    if (res < x.maxval + y.value)
    {
        res = x.maxval + y.value;
        mask_l = y.mask;
        mask_r = x.maxmask;
    }

3) Forget base cases: In type $$$IV$$$ the DP is already init as 0, so you dont need the loop to zero, while the $$$V$$$ is not like that when you init it as $$$+oo$$$

Wrong

```cpp memset(f, +LINF, sizeof(f)); f[0][0] = 0;

int sum = 0;
for (int i = 1; i <= n; ++i)
{
    int c, v;
    cin >> c >> v;

    sum += v;
    for (int s = sum; s >= 1; --s) /// you have to make a loop from s = sum -> 0
    {
        f[i][s] = f[i - 1][s];

        if (s >= v)
        {
            minimize(f[i][s], f[i - 1][s - v] + c);
        }
    }

    for (int res = sum; res >= 0; --res)
    {
        if (f[i][res] <= w)
        {
            cout << res << '\n';
            break;
        }
    }
}

B) Time Limit Exceed

1) Global variable $$$\neq$$$ Local variable

In Meet-in-the-middle approach, the solve() function didnt use global variable (n), it use $$$n = |c| = |s|$$$.

Debug

Assign this at the head of the function


void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    ...
}

or

void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = v.size(); /// Important !!!
    ...
}

2) Forget to use memorization

Wrong

ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll res = f[i][s]; /// is should be &res = [i][s]
    if (res != -1) return res;
    ll res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

Wrong

ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll res = +LINF;
    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return f[i][s] = res; /// It is calculated first then assigning dp value
}

3) You might get WA if you have wrong initalization or leave the value generated randomly

Wrong

    ll f[sum + 1];

    /// What if f[x > 0] negative ?
    f[0] = 0;
    for (int i = 1; i <= n; ++i)
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);

4) If you wanna binary search for the result, remember that you cant do Prefixmin DP $$$O(N \times SUM)$$$ as what it like in Prefixmax DP $$$O(N \times W)$$$

Wrong


    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= sum; ++s)
        {
            f[i][s] = min(f[i][s - 1], f[i - 1][s]);

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    int res = 0;
    for (int l = 1, r = sum; l <= r; )
    {
        int m = (l + r) >> 1;
        if (f[n][m] <= w)
        {
            res = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }
    cout << res;

C) Memory limit exceed

1) Though Meet-in-the-middle approach is faster than Bitmasking Approach, it requires large amount of space — $$$O(2^{^{\frac{n}{2}}})$$$, which may give you MLE !

2) In some cases you will need space optimization if the limit is too tight !

3) Becareful in tracing results

Wrong

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            i -= 1;
            s -= c[i];
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

Fixed

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            s -= c[i]; /// This first then decrease (i)
            i -= 1;
            continue; /// <--- Important in this case
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

D) Runtime Error

1) Out of bound

Wrong

ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (i > n) return (s <= w) ? 0 : -LINF;     /// No available item to add into the bag

    ll &res = f[i][s]; /// what if (s > w) ?
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

Wrong

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
        for (int s = w; s >= 0; --s)
            f[i][s] = max(f[i - 1][s], f[i - 1][s - c[i]] + v[i]); /// What if s < c[i] ?

2) Array too big in main function: Declare it globally with the init size bigger than problem constraint a bit

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

XI. Knapsack Variation and Practice Problems

In case you need a place to practice or submitting

CSES | DP 1158 | Book Shop

Hint

CSES | DP 1634 | Minimizing Coins

CSES | DP 1635 | Coin Combinations I

CSES | DP 1636 | Coin Combinations II

CSES | DP 1745 | Money Sums

CSES | DP 1093 | Two Sets II

CSES | DP 1665 | Coding Company

Easy but nice problem — (contributor TheScrasse)

Note

Codeforces #683 Div1 2020 | Problem 1446A | Knapsack

Hint

Codeforces #360 Div1 2016 | Problem 687C | The Values You Can Make

Codeforces #522 Div1 2018 | Problem 1078B | The Unbearable Lightness of Weights

Codeforces #77 Div1 2011 | Problem 95E | Lucky Country

Codeforces #26 Edu 2017 | Problem 837D | Round Subset

Codeforces #61 Edu 2019 | Problem 1132E | Knapsack

Codeforces 8VC Venture Cup 2017 | Problem 755F | PolandBall and Gifts

Codeforces Wunder Fund Round 2016 | Double Knapsack

Codeforces USP Try-outs 2016 | The Knapsack problem

SPOJ | Classical | Large Knapsack

SPOJ | Tutorial | Knapsack

Atcoder DP Contest | Problem D | Knapsack 1

Hint

Atcoder DP Contest | Problem E | Knapsack 2

Hint

DMOJ | Knapsack 3

DMOJ | Knapsack 4

LQDOJ | Bài toán ba lô 3

Hint

LQDOJ | Bài toán ba lô 4

Hint

LQDOJ | Bài toán ba lô 5

Hint

VNOI | Cái túi 1

VNOI | Cái túi 2

VNOI | Cái túi (Hard version)

VNOI | Bài toán cái túi

VNOI | Túi Fibonacci

VNOI | Siêu trộm

VNOI | Xếp ba lô

VNOI | Xếp ba lô 1

VNOI | Xếp ba lô knapsack

VNOI | Xếp ba lô (nhiều ba lô)

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

XII. Blog status

The current progress and contributor of this blogs

Recently
- 0) Added more practice links
On progress:
- 0) Table of content & Complexity comparision table
- 1) Online Algorithm
- 2) Optimizations and Heuristic
- 3a) Unbounded knapsack
- 3b) Bounded knapsack
- 3c) Item limitation knapsack
- 4a) Knapsack query maximum value with item in range $$$[L, R]$$$
- 4b) Knapsack query maximum value with weight in range $$$[L, R]$$$
- 4c) Knapsack query minimum weight with value in range $$$[L, R]$$$
- 5a) Multiple knapsack bags
- 5b) Multidimentional item
- 6) Online Algorithms
- 7) Remain space optimization while tracing for elements
- 8) Add more problems and ranking them
- 9) Asking local coordinator for permissions to release private knapsack problems
- 9) Knapsack 0/1
Special thank to contributors: SPyofgame, TheScrasse, Lusterdawn, jiangly

Full text and comments »

knapsack, dp, dp problem, tutorial, dynamic programming, optimization, brute force, memoization

SPyofgame
4 years ago
10

Knapsack the tutorial

By SPyofgame, history, 4 years ago, In English

Teleporter: [Previous] | | | [Next]

Table of content

Teleporter	Description
I. STATEMENT	Taking about the problem
II. EXAMPLE	To understand the problem better
III. Solution for small number of element — N	How much will you get in each possible subset ?
IV. Solution for small sum of weight — C[i]	What is the maximum value possible when your bag is exact $$$W$$$ weight ?
V. Solution for small sum of value — V[i]	What is the minimum bag weight possible when it is exact $$$S$$$ sum value ?
VI. Tracing for selected elements	Which next state will lead to the best result ?
VII. Other solutions	How to solve the problem with special condition ?
VIII. Online Algorithm	How to solve the problem when you need to output the result whenever you receive a new item ?
IX. Optimizations and Heuristic	How to improve the algorithm faster, shorter, simpler, safetier or saving space
X. Debugging	Support you when you are in a trouble that you cant find your bug
XI. Knapsack Variation and Practice Problems	In case you need a place to practice or submitting
XII. Blog status	The current progress and contributor of this blogs

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

I. STATEMENT

Taking about the problem

Question: What is the value $$$V$$$ that the thief can steal from the shop.

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

II. EXAMPLE

To understand the problem better

Input

Output

Explanation:

There are 8 possible cases
{} -> 0 value, 0 weight
{1} -> 10 value, 2 weight
{2} -> 20 value, 4 weight
{3} -> 30 value, 6 weight
{1, 2} -> 30 value, 6 weight
{1, 3} -> 40 value, 8 weight - optimal
{2, 3} -> 50 value, 10 weight - invalid weight
{1, 2, 3} -> 60 value, 12 weight - invalid weight

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

III. Solution for small number of element — N

How much will you get in each possible subset ?

A. Permutation Approach (Bad) — $$$O(n!)$$$ time — $$$O(n)$$$ space

For each possible permutation, pick elements until it weight too much
The result is the maximum value sum, for which weight sum is not greater than $$$W$$$

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <numeric>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int p[n];
    iota(p, p + n, 0);

    ll res = 0;
    do {
        ll sum_weight = 0;
        ll sum_value = 0;
        for (int i = 0; i < n; ++i)
        {
            int weight = c[p[i]];
            int value = v[p[i]];

            sum_weight += weight;
            sum_value += value;
            if (sum_weight > w) 
            {
                break;
            }
            else
            {
                maximize(res, sum_value);
            }
        }

    }
    while (next_permutation(p, p + n));

    cout << res;
    return 0;
}

B. Bitmasking Approach (Good) — $$$O(2^n)$$$ time — $$$O(n)$$$ space

Because the order isnt important, we just need to test all every possible subset
The result is the maximum value sum, for which weight sum is not greater than $$$W$$$

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        int weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            maximize(res, value);
        }
    }

    cout << res << endl;
    return 0;
}

C. Meet-in-the-middle Approach (Better) — $$$O(2^{n/2})$$$ time — $$$O(2^{n/2})$$$ space

Split the array into two halves $$$L$$$ and $$$R$$$. In each half, we will calculate every possible subsets. And in each subset we store a pair of $$$(value\ sum, weight\ sum)$$$
For each element $$$X(value_X, weight_X) \in L$$$, we need to find suitable element $$$Y(value_Y, weight_Y) \in R$$$ that satisfying maximum $$$value_R$$$ and $$$weight_L + weight_R \leq W$$$
Therefore, we can sort all the $$$R$$$ set by increasing weight. Let $$$maxval_Y = max(value_E | E \in R, weight_E \leq weight_Y)$$$. Then for each $$$X \in L$$$, we can find its suitable $$$Y$$$ by binary search in $$$O(log\ |R|)$$$ with $$$O(|R|)$$$ precalculation

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct Node 
{
    ll maxval = 0;

    ll value;
    int weight;
    Node (ll value = 0, int weight = 0)
    : value(value), weight(weight) {}
};

int n, w;
void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            S.push_back(Node(value, weight));
        }    
    }
}

int main()
{
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int m = n / 2;
    vector<int> cl, cr;
    vector<int> vl, vr;
    for (int i = 0; i < n; ++i)
    {
        if (i < m)
        {
            cl.push_back(c[i]);
            vl.push_back(v[i]);
        }
        else 
        {
            cr.push_back(c[i]);
            vr.push_back(v[i]);
        }
    }

    vector<Node> Sl, Sr;
    solve(cl, vl, Sl);
    solve(cr, vr, Sr);

    sort(all(Sr), [](const Node &a, const Node &b) {
        return (a.weight != b.weight) ? a.weight < b.weight : a.value < b.value;
    });

    ll maxval = 0;
    for (Node &x : Sr)
    {
        maximize(maxval, x.value);
        x.maxval = maxval;
    }

    ll res = 0;
    for (Node &y : Sl)
    {
        for (int l = 0, r = int(Sr.size()) - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            Node x = Sr[m];
            if (x.weight + y.weight <= w)
            {
                maximize(res, x.maxval + y.value);
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }
    }

    cout << res;
    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

IV. Solution for small sum of weight — C[i]

What is the maximum value possible when your bag is exact $$$W$$$ weight ?

A) Recursive Dynamic Programming — $$$O(N \times W)$$$ time — $$$O(N \times W)$$$ space

Memorization:
- f[i][s] = magic(int i, int s) stand for using from the $$$ith$$$ items, with the total weight of $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$-1$$$
Base cases
- If ($$$s > w$$$) then $$$v = -oo$$$ since we use more than what the bag can hold
- If ($$$i \geq n$$$) then $$$v = 0$$$ since there is no available item, so no weight added into the bag
Transistion
- Using current item, it will be $$$A = magic(i + 1, s + c_i) + v_i)$$$ — move to next item, weight is added with $$$c_i$$$, value is increased by $$$v_i$$$
- Not using current item, it will be $$$B = magic(i + 1, s + 0) + 0)$$$ — move to next item, weight is remained, value is not increased
- We want the maximum value so $$$magic(int\ i, int\ s) = max(A, B)$$$
The final result: $$$result = magic(1, 0)$$$ — starting from first item with $$$0$$$ weighted bag

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i > n) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    memset(f, -1, sizeof(f));
    cout << magic();
    return 0;
}

B) Iterative Dynamic Programming — $$$O(N \times W)$$$ time — $$$O(N \times W)$$$ space

Memorization:
- f[i][s] stand for using from the $$$ith$$$ items, with the total weight exact $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$0$$$ not $$$-1$$$
Base cases:
- $$$\forall x \geq 0, f[0][x] = 0$$$ — using no item, hence return no value
- $$$\forall x \geq 0, f[x][0] = 0$$$ — having no weight, hence no using item
- $$$\forall x > 0, y < 0, f[x][y] = -oo$$$ — define it as negative infinity for easier calculation
Transistion:
- Using current item, $$$A = \underset{0 \leq t + c_i \leq s}{\underset{j \leq i}{max}}(f[j][t]) + v[i] = \underset{0 \leq t = s - c_i}{\underset{j \leq i}{max}}(f[j][t]) + v[i] = \underset{0 \leq t = s - c_i}{\underset{j = i - 1}{f[j][t]}} + v[i]$$$ maximum value among all previous bags added to current item
- Not using current item, it will be $$$B = \underset{0 \leq t + 0 \leq s}{\underset{j \leq i}{max}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j \leq i}{max}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j = i - 1}{f[j][t]}} + 0$$$ — move to next item, weight is remained, value is not increased
- We want the maximum value so $$$f[i][s] = max(A, B) = max(f[i - 1][s], f[i - 1][s - c_i] + v_i)$$$
The final result: $$$result = \underset{0 \leq s \leq w}{max}(f[n][s])$$$ — starting from first item with $$$0$$$ weighted bag

Bad transistion code - O(N^2 * W^2) time

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            for (int j = 1; j < i; ++j)
            {
                for (int t = 0; t <= s; ++t)
                {
                    maximize(f[i][s], f[i][t] + 0);
                }
                
                for (int t = 0; t + c[i] <= s; ++t)
                {
                    maximize(f[i][s], f[i - 1][t] + v[i]);
                }
            }

            maximize(res, f[i][s]);
        }
    }

    cout << res;
    return 0;
}

Normal DP

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = f[i - 1][s];
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    ll res = 0;
    for (int s = 0; s <= w; ++s)
        maximize(res, f[n][s]);

    cout << res;
    return 0;
}

Prefixmax DP

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    cout << f[n][w];
    return 0;
}

C) Recursive Dynamic Programming (Space optimization) — $$$O(N \times W)$$$ time — $$$O(N + W)$$$ space

A) O(2W) DP space

Observe: $$$\forall i > 0, f[i][x]$$$ depends on $$$f[i - 1]$$$ and $$$f[i]$$$ only, hence we just need 2 dp array space
Define: When we calculate at pth element, we have $$$\underset{x \equiv p (mod 2)}{f[x]}$$$ is current dp array, $$$\underset{y \equiv p + 1 (mod 2)}{f[y]}$$$ is previous dp array
Transistion: $$$f[i][s] = max(f[i - 1][s], f[i - 1][s - c_i] + v_i)$$$ equivalent to $$$f[x][s] = max(f[y][s], f[y][s - c_i] + v_i)$$$

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[2][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s)
        {
            f[cur][s] = f[pre][s];
            if (s >= c[i])
            {
                maximize(f[cur][s], f[pre][s - c[i]] + v[i]);
            }
        }
    }

    ll res = 0;
    for (int s = 0; s <= w; ++s)
        maximize(res, f[n & 1][s]);

    cout << res;
    return 0;
}

B) O(W) 1D — DP space

From the above algorithm, we can change the inner loop

Inner Part

    ll f[2][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s)
            f[cur][s] = f[pre][s];
        
        for (int s = w; s >= c[i]; --s)
            maximize(f[cur][s], f[pre][s - c[i]] + v[i]);
    }

Kinda tricky, but we only need one array, for each query $$$f[s]$$$ stand for maximum value with bag of weight $$$s$$$ upto that query.

Inner Part

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i) /// For each item (c[i], v[i])
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 1; s <= w; ++s) /// Unneeded loop
            f[s] = f[s];
        
        for (int s = w; s >= c[i]; --s)
            maximize(f[s], f[s - c[i]] + v[i]);
    }

Notice that it is required for the second-inner-loop to iterate from $$$w$$$ downto $$$c_i$$$. Here is the reason

From c[i] upto w

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = c; s <= w; ++s)
                maximize(f[s], f[s - c] + v);

        equivalent to

            for (int s = c; s <= w; ++s)
                maximize(f[cur][s], f[cur][s - c] + v);

From w downto c[i]

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = w; s >= c; --s)
                maximize(f[s], f[s - c] + v);

        equivalent

            for (int s = w; s >= c; --s)
                maximize(f[cur][s], f[pre][s - c] + v);

Finally, here is 1D Dynamic Programming Solution

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
        for (int s = w; s >= c[i]; --s)
            maximize(f[s], f[s - c[i]] + v[i]);

    cout << f[w];
    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

V. Solution for small sum of value — V[i]

What is the minimum bag weight possible when it is exact $$$S$$$ sum value ?

A) Recursive Dynamic Programming — $$$O(N \times SUM)$$$ time — $$$O(N \times SUM)$$$ space

Memorization:
- f[i][s] = magic(int i, int s) stand for using from the $$$ith$$$ items, with the total value of $$$s$$$ that minimum weight is exact $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$-1$$$
Base cases
- If ($$$s < 0$$$) then $$$v = +oo$$$ means we use more value than expected
- If ($$$i > n$$$ and $$$s \neq 0$$$) then $$$v = +oo$$$ means there is currently no bag of exact $$$s$$$ value
- If ($$$i > n$$$ and $$$s = 0$$$) then $$$v = 0$$$ means there is actually a bag of exact $$$s$$$ value
Transistion
- Using current item, it will be $$$A = magic(i + 1, s - v_i) + c_i)$$$ — move to next item, sum value is reduce by $$$v_i$$$, weight is added with $$$c_i$$$
- Not using current item, it will be $$$B = magic(i + 1, s - 0) + 0)$$$ — move to next item, sum value is remained, weight is not increased
- We want the minimum weight so $$$magic(int\ i, int\ s) = min(A, B)$$$
The final result: $$$result = \underset{0 \leq s \leq \Sigma(v_i)}{max}(s | magic(1, s) \leq w)$$$ — maximum value whose weight is not greater than $$$W$$$

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    memset(f, -1, sizeof(f));
    for (int res = sum; res >= 0; --res)
    {
        if (magic(1, res) <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

B) Iterative Dynamic Programming — $$$O(N \times SUM)$$$ time — $$$O(N \times SUM)$$$ space

Memorization:
- f[i][s] stand for using from the $$$ith$$$ items, with the total value of exact $$$s$$$ that maximum value is $$$f[i][s]$$$
- All $$$f[i][s]$$$ init as $$$+oo$$$ not $$$-1$$$
Base cases:
- $$$\forall x \geq 0, f[0][x] = 0$$$ — using no item, hence return no weight
- $$$\forall x \geq 0, f[x][0] = 0$$$ — having no value, hence no using item
- $$$\forall x > 0, y < 0, f[x][y] = +oo$$$ — define it as negative infinity for easier calculation
Transistion:
- Using current item, $$$A = \underset{0 \leq t + v_i \leq s}{\underset{j \leq i}{min}}(f[j][t]) + c[i] = \underset{0 \leq t = s - v_i}{\underset{j \leq i}{min}}(f[j][t]) + c[i] = \underset{0 \leq t = s - c_i}{\underset{j = i - 1}{f[j][t]}} + v[i]$$$ minimum weight among all previous bags added to current item
- Not using current item, it will be $$$B = \underset{0 \leq t + 0 \leq s}{\underset{j \leq i}{min}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j \leq i}{min}}(f[j][t]) + 0 = \underset{0 \leq t = s}{\underset{j = i - 1}{f[j][t]}} + 0$$$ — move to next item, value is remained, weight is not increased
- We want the minimum weight so $$$f[i][s] = min(A, B) = min(f[i - 1][s], f[i - 1][s - v_i] + c_i)$$$
The final result: $$$result = \underset{0 \leq s \leq \Sigma(v_i)}{max}(s | f[n][s] \leq w)$$$ — maximum value whose weight is not greater than $$$W$$$

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 0; s <= sum; ++s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    for (int res = sum; res >= 0; --res)
    {
        if (f[n][res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization) — $$$O(N \times SUM)$$$ time — $$$O(N + SUM)$$$ space

A) O(2SUM) DP space

Observe: $$$\forall i > 0, f[i][x]$$$ depends on $$$f[i - 1]$$$ and $$$f[i]$$$ only, hence we just need 2 dp array space
Define: When we calculate at pth element, we have $$$\underset{x \equiv p (mod 2)}{f[x]}$$$ is current dp array, $$$\underset{y \equiv p + 1 (mod 2)}{f[y]}$$$ is previous dp array
Transistion: $$$f[i][s] = min(f[i - 1][s], f[i - 1][s - v_i] + c_i)$$$ equivalent to $$$f[x][s] = min(f[y][s], f[y][s - v_i] + c_i)$$$

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= sum; ++s)
        {
            f[cur][s] = f[pre][s];

            if (s >= v[i])
            {
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);
            }
        }
    }

    for (int res = sum; res >= 0; --res)
    {
        if (f[n & 1][res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

B) O(SUM) 1D — DP space

From the above algorithm, we can change the inner loop

Inner Part

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= sum; ++s)
        {
            f[cur][s] = f[pre][s];

            if (s >= v[i])
            {
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);
            }
        }
    }

Kinda tricky, but we only need one array, for each query $$$f[s]$$$ stand for maximum value with bag of weight $$$s$$$ upto that query.

Inner Part

    ll f[2][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1;
        bool pre = !cur;

        for (int s = 0; s <= w; ++s) /// Unneeded loop
            f[s] = f[s];
        
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);
    }

Notice that it is required for the second-inner-loop to iterate from $$$sum$$$ downto $$$v_i$$$. Here is the reason

From v[i] upto sum

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = v[i]; s <= sum; ++s)
                minimize(f[s], f[s - v[i]] + c[i]);

        equivalent to

            for (int s = v[i]; s <= sum; ++s)
                minimize(f[cur][s], f[cur][s - v[i]] + c[i]);

From sum downto v[i]

        for which

            f[cur][s] = f[s] that updated
            f[pre][s] = f[s] that not update yet

        the part

            for (int s = sum; s >= v[i] --s)
                minimize(f[s], f[s - v[i]] + c[i]);

        equivalent to

            for (int s = sum; s >= v[i] --s)
                minimize(f[cur][s], f[pre][s - v[i]] + c[i]);

Finally, here is 1D Dynamic Programming Solution

Code

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[sum + 1];
    memset(f, +LINF, sizeof(f));

    f[0] = 0;
    for (int i = 1; i <= n; ++i)
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);

    for (int res = sum; res >= 0; --res)
    {
        if (f[res] <= w)
        {
            cout << res;
            return 0;
        }
    }

    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VII. Tracing for selected elements

Which next state will lead to the best result ?

A) Solution for small number of element — N

A) Permutation Approach: We will update selected elements when we see a better solution

Permutation - O(n!) time - O(n) space

#include <algorithm>
#include <iostream>
#include <cstring>
#include <numeric>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int p[n];
    iota(p, p + n, 0);

    vector<int> selected;
    ll res = 0;
    do {
        bool better = false;
        vector<int> current;
        ll sum_weight = 0;
        ll sum_value = 0;
        for (int i = 0; i < n; ++i)
        {
            int weight = c[p[i]];
            int value = v[p[i]];

            sum_weight += weight;
            sum_value += value;
            if (sum_weight > w) 
            {
                break;
            }
            else
            {
               current.push_back(p[i]);
                if (res < sum_value)
                {
                    better = true;
                    res = sum_value;
                }
            }
        }

        if (better) selected = current;
    }
    while (next_permutation(p, p + n));

    cout << res << '\n';
    sort(selected.begin(), selected.end());
    for (int p : selected)
    {
        cout << p + 1 << ' ' << c[p] << ' ' << v[p] << '\n';
    }

    return 0;
}

B) Bitmasking Approach: We will update bitmask when we see a better solution

O(2^n)) time - O(n) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    ll res = 0;
    int selected = 0;
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        if (weight <= w)
        {
            if (res <= value)
            {
                res = value;
                selected = mask;
            }
        }
    }

    cout << res << '\n';
    for (int i = 0; i < n; ++i)
    {
        if (selected >> i & 1)
        {
            cout << i + 1 << ' ' << c[i] << ' ' << v[i] << '\n';
        }
    }
    return 0;
}

C) Meet-in-the-middle Approach: We will update bitmask when we see a better solution AND ON DP-CALCULATION.

Bitmasking - O(2^(n/2)) time - O(2^(n/2)) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
#include <bitset>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

struct Node 
{
    ll maxval = 0;
    int maxmask = 0;

    int mask;
    ll value;
    int weight;
    Node (int mask = 0, ll value = 0, int weight = 0)
    : mask(mask), value(value), weight(weight) {}
};

int n, w;
void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    int lim = 1 << n;
    for (int mask = 0; mask < lim; ++mask)
    {
        ll weight = 0;
        ll value = 0;
        for (int i = 0; i < n; ++i)
        {
            if (mask >> i & 1)
            {
                weight += c[i];
                value += v[i];
                if (weight > w) break;
            }
        }

        S.push_back(Node(mask, value, weight));
    }
}

int main()
{
    cin >> n >> w;

    int c[n], v[n];
    for (int i = 0; i < n; ++i)
        cin >> c[i] >> v[i];

    int m = n / 2;
    vector<int> cl, vl;
    for (int i = 0; i < m; ++i)
    {
        cl.push_back(c[i]);
        vl.push_back(v[i]);
    }

    vector<int> cr, vr;
    for (int i = m; i < n; ++i)
    {
        cr.push_back(c[i]);
        vr.push_back(v[i]);
    }

    vector<Node> Sl, Sr;
    solve(cl, vl, Sl);
    solve(cr, vr, Sr);

    sort(all(Sr), [](const Node &a, const Node &b) {
        return (a.weight != b.weight) ? a.weight < b.weight : a.value > b.value;
    });

    ll maxval = 0;
    int maxmask = 0;
    for (Node &x : Sr)
    {
        if (maxval < x.value)
        {
            maxval = x.value;
            maxmask = x.mask;
        }
        x.maxval = maxval;
        x.maxmask = maxmask;
    }

    ll res = 0;
    int mask_l = 0;
    int mask_r = 0;
    for (Node &y : Sl)
    {
        for (int l = 0, r = int(Sr.size()) - 1; l <= r; )
        {
            int m = (l + r) >> 1;
            Node x = Sr[m];
            if (x.weight + y.weight <= w)
            {
                if (res < x.maxval + y.value)
                {
                    res = x.maxval + y.value;
                    mask_l = y.mask;
                    mask_r = x.maxmask;
                }
                l = m + 1;
            }
            else 
            {
                r = m - 1;
            }
        }
    }

    vector<int> selected;
    for (int i = 0; i < m; ++i)
        if (mask_l >> i & 1)
            selected.push_back(i);

    for (int i = 0; i < n - m; ++i)
        if (mask_r >> i & 1)
            selected.push_back(i + m);

    cout << res << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p + 1 << ' ' << c[p] << ' ' << v[p] << '\n';
    }

    return 0;
}

B) Solution for small sum of weight — C[i]

A) Recursive Dynamic Programming: Starting from $$$(i = 0, s = 0)$$$, we already have $$$magic(i,s)$$$ return the best result, $$$magic(i + 1,s + 0) + 0)$$$ or/and $$$magic(i + 1, s + c[i]) + v[i]$$$ will be the best result

Trace cases

Recursive DP - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i > n) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

vector<int> selected;
void trace(int i = 1, int s = 0)
{
    if (s > w) return ;
    if (i > n) return ;

    ll res = magic(i, s);
    if (res == magic(i + 1, s + 0) + 0)
    {
        return trace(i + 1, s + 0);
    }
    else 
    {
        selected.push_back(i);
        return trace(i + 1, s + c[i]);
    }
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    memset(f, -1, sizeof(f));
    cout << magic() << '\n';

    trace();
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

B) Iterative Dynamic Programming:

Prefixmax Iterative DP - O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c[i])
            {
                maximize(f[i][s], f[i - 1][s - c[i]] + v[i]);
            }
        }
    }

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            s -= c[i];
            i -= 1;
            continue;
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

    cout << f[n][w] << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

Explanation

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
	
using namespace std;
	
template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }
	
typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====
	
const int LIM_N = 111;
const int LIM_W = 1e6 + 16;
	
int n, w;
int c[LIM_N], v[LIM_N];
ll f[LIM_W];

int calc(ll *f, int l = 1, int r = n)
{
	ll upper = 0;
	for (int i = l; i <= r; ++i)
		upper += c[i];
	
	minimize(upper, (ll)w);
	for (int s = 0; s <= upper; ++s)
		f[s] = 0;
	
	for (int i = l; i <= r; ++i)
		for (int s = upper; s >= c[i]; --s)
			maximize(f[s], f[s - c[i]] + v[i]);
	
	return upper;
}
	
ll L[LIM_W], R[LIM_W];
vector<int> selected;
void trace(int s = w, int l = 1, int r = n)
{
	if (l == r)
	{
		if (s == c[l])
		{
			selected.push_back(l);
		}
	
		return ;
	}
	
	int m = (l + r) >> 1;
	int sleft  = calc(L, l, m + 0);
	int sright = calc(R, m + 1, r);
	
	ll mx = -1;
	int pleft = 0;
	int pright = s;
	for (int v = max(0, s - sright); v <= min(s, sleft); ++v)
	{
		if (mx < L[v] + R[s - v])
		{
			mx = L[v] + R[s - v];
			pleft = v;
			pright = s - v;
		}
	}
	
	trace(pleft , l, m + 0);
	trace(pright, m + 1, r);
}
		
int main()
{
	cin >> n >> w;
	for (int i = 1; i <= n; ++i)
		cin >> c[i] >> v[i];
	
	calc(f);
	ll res = f[w];
	int weight_used = max_element(f, f + w + 1) - f;
	
	trace(weight_used);	
	cout << res << '\n';	
	cout << selected.size() << '\n';
	for (int p : selected)
	{
	    cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
	}
	return 0;
}

C) Solution for small sum of value — V[i]

A) Recursive Dynamic Programming: Starting from $$$(i = 0, s = res)$$$, we already have $$$magic(i,s)$$$ return the best result, $$$magic(i + 1,s + 0) + 0)$$$ or/and $$$magic(i + 1, s - v[i]) + c[i]$$$ will be the best result

Trace cases

Recursive DP - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

vector<int> selected;
void trace(int i = 1, int s = 0)
{
    if (s < 0) return ;
    if (i > n) return ;

    ll res = magic(i, s);
    if (res == magic(i + 1, s + 0) + 0)
    {
        return trace(i + 1, s + 0);
    }
    else 
    {
        selected.push_back(i);
        return trace(i + 1, s - v[i]);
    }
}

int main()
{
    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    memset(f, -1, sizeof(f));
    for (int res = sum; res >= 0; --res)
    {
        if (magic(1, res) <= w)
        {
            trace(1, res);
            
            cout << res << '\n';
            cout << selected.size() << '\n';
            for (int p : selected)
            {
                cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
            }
            return 0;
        }
    }

    return 0;
}

B) Iterative Dynamic Programming:

Iterative DP - O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    int c[n + 1], v[n + 1];
    for (int i = 1; i <= n; ++i)
        cin >> c[i] >> v[i];

    int sum = 0;
    for (int i = 1; i <= n; ++i)
        sum += v[i];

    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 0; s <= sum; ++s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    int res = sum;
    while (f[n][res] > w) --res;
    
    vector<int> selected;
    for (int i = n, s = res; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - v[i]] + c[i])
        {
            selected.push_back(i);
            s -= v[i];
            i -= 1;
        }
        else 
        {
            --i;
        }
    }

    cout << res << '\n';
    cout << selected.size() << '\n';
    for (int p : selected)
    {
        cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
    }
    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

Explanation

Code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cmath>
	
using namespace std;
	
template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }
	
typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====
	
const int LIM_N = 111;
const int LIM_S = 1e6 + 16;
	
int n, w;
int c[LIM_N], v[LIM_N];
ll f[LIM_S + 1];

int calc(ll *f, int l = 1, int r = n)
{
	int upper = 0;
	for (int i = l; i <= r; ++i)
		upper += v[i];

	f[0] = 0;
	for (int s = upper; s >= 1; --s)
		f[s] = +LINF;

	for (int i = l; i <= r; ++i)
		for (int s = upper; s >= v[i]; --s)
			minimize(f[s], f[s - v[i]] + c[i]);
	
	return upper;
}

vector<int> selected;
ll L[LIM_S + 1], R[LIM_S + 1];
void trace(int s = LIM_S, int l = 1, int r = n)
{
	if (l == r)
	{
		if (s == v[l])
		{
			selected.push_back(l);
		}	

		return ;
	}

	int m = (l + r) >> 1;
	int sleft  = calc(L, l, m + 0);
	int sright = calc(R, m + 1, r);

	int mn = +INF;
	int pleft = 0;
	int pright = s;
	for (int v = max(0, s - sright); v <= min(s, sleft); ++v)
	{
		if (mn > L[v] + R[s - v])
		{
			mn = L[v] + R[s - v];
			pleft = v;
			pright = s - v;
		}
	}

	trace(pleft , l, m + 0);
	trace(pright, m + 1, r);
}

int main()
{
	cin >> n >> w;
	for (int i = 1; i <= n; ++i)
		cin >> c[i] >> v[i];

	int res = calc(f);
	while (f[res] > w) --res;
	trace(res);
	
	int ans = 0;
	for (int p : selected)
		ans += v[p];

	cout << ans;
	// cout << res << '\n';
	// cout << selected.size() << '\n';
	// for (int p : selected)
	// {
	// 	cout << p << ' ' << c[p] << ' ' << v[p] << '\n';
	// }

	return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VII. Other solutions

How to solve the problem with special condition ?

A) Fractional Knapsack & Greedy Approach

On progress...

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

VIII. Online Algorithm

How to solve the problem when you need to output the result whenever you receive a new item ?

A) Solution for small number of element — N

On progress...

B) Solution for small sum of weight — C[i]

A) Recursive Dynamic Programming:

What have changed from the orginal ?

O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXW = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (s > w) return -LINF; /// Using too much weight
    if (i == 0) return 0;     /// No available item to add into the bag

    ll &res = f[i][s];
    if (res != -1) return res;
        
    maximize(res, magic(i - 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i - 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

int main()
{
    memset(f, -1, sizeof(f));

    cin >> n >> w;
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];
        cout << magic(i, 0) << '\n';
    }
    return 0;
}

B) Iterative Dynamic Programming:

What have changed from the orginal ?

O(NW) time - O(NW) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        for (int s = 1; s <= w; ++s)
        {
            f[i][s] = max(f[i][s - 1], f[i - 1][s]);
            if (s >= c)
            {
                maximize(f[i][s], f[i - 1][s - c] + v);
            }
        }

        cout << f[i][w] << '\n';
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

What have changed from the orginal ?

O(NW) time - O(W) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int main()
{
    int n, w;
    cin >> n >> w;

    ll f[w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        for (int s = w; s >= c; --s)
            maximize(f[s], f[s - c] + v);
    
        cout << f[w] << '\n';
    }

    return 0;
}

C) Solution for small sum of value — V[i]

A) Recursive Dynamic Programming:

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int MAXN = 101;
const int MAXSUM = 101010;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n, w;
int c[MAXN];
int v[MAXN];
ll f[MAXN][MAXSUM];
ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i == 0) return (s == 0) ? 0 : +LINF;

    ll &res = f[i][s];
    if (res != -1) return res;
    res = +LINF;

    minimize(res, magic(i - 1, s - 0) + 0);
    minimize(res, magic(i - 1, s - v[i]) + c[i]);
    return res;
}

int main()
{
    cin >> n >> w;

    int sum = 0;
    memset(f, -1, sizeof(f));
    for (int i = 1; i <= n; ++i)
    {
        cin >> c[i] >> v[i];

        sum += v[i];
        for (int res = sum; res >= 0; --res)
        {
            if (magic(i, res) <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

B) Iterative Dynamic Programming:

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>
#include <vector>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
const int LIMN = 100;
const int LIMSUM = 1e5 + 15;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

ll f[LIMN][LIMSUM];
int main()
{
    int n, w;
    cin >> n >> w;

    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    int sum = 0;
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        sum += v;
        for (int s = sum; s >= 0; --s)
        {
            f[i][s] = f[i - 1][s];

            if (s >= v)
            {
                minimize(f[i][s], f[i - 1][s - v] + c);
            }
        }

        for (int res = sum; res >= 0; --res)
        {
            if (f[i][res] <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

C) Iterative Dynamic Programming (Space Optimization):

What have changed from the orginal ?

O(NSUM) time - O(NSUM) space

#include <iostream>
#include <cstring>
#include <cmath>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

typedef long long ll;
const int LIM = 1e6 + 16;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

ll f[LIM];
int main()
{
    int q, w;
    cin >> q >> w;

    memset(f, +LINF, sizeof(f));
    f[0] = 0;

    int sum = 0;
    while (q-->0) /// For each query
    {
        int c, v;
        cin >> c >> v;

        sum += v;
        for (int s = sum; s >= v; --s)
            minimize(f[s], f[s - v] + c);

        
        for (int res = sum; res >= 0; --res)
        {
            if (f[res] <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

    return 0;
}

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

IX. Optimizations and Heuristic

How to improve the algorithm faster, shorter, simpler, safetier or saving space

A) Filtering the array

1) Split items into 2 types, whose weight less than $$$W$$$ and the rest

Hint

2) Compressed the array

Hint

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

X. Debugging

Support you when you are in a trouble that you cant find your bug

A) Wrong answer

1) Becareful when weight sum and value sum is big, it would cause overflow

Debug

long long weight = 0;
long long value = 0;

2) Becareful that in Meet-in-the-middle approach:

You have to update the bitmask that have maxvalue.
You have to update the $$$maxval$$$ and $$$maxmask$$$ before assign $$$x.maxval$$$, $$$x.maxmask$$$
You have to use also in collecting the result

Wrong

    ll maxval = 0;
    for (Node &x : Sr)
    {
        /// What if x.value > maxval ??
        x.maxval = maxval;
        x.maxmask = maxmask;
        if (maxval < x.value)
        {
            maxval = x.value;
            /// not update maxmask ?
        }
    }

Wrong

    if (res < x.value + y.value) /// where is maxvalue ?
    {
        res = x.value + y.value;
        mask_l = y.mask;
        mask_r = x.mask;
    }

Wrong

    if (res < x.maxval + y.value)
    {
        res = x.maxval + y.value;
        mask_l = y.mask;
        mask_r = x.mask; /// this mask might not given the maxval !
    }

Debug

    ll maxval = 0;
    int maxmask = 0;
    for (Node &x : Sr)
    {
        if (maxval < x.value)
        {
            maxval = x.value;
            maxmask = x.mask;
        }
        x.maxval = maxval;
        x.maxmask = maxmask;
    }

Debug


    if (res < x.maxval + y.value)
    {
        res = x.maxval + y.value;
        mask_l = y.mask;
        mask_r = x.maxmask;
    }

3) Forget base cases: In type $$$IV$$$ the DP is already init as 0, so you dont need the loop to zero, while the $$$V$$$ is not like that when you init it as $$$+oo$$$

Wrong


    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    int sum = 0;
    for (int i = 1; i <= n; ++i)
    {
        int c, v;
        cin >> c >> v;

        sum += v;
        for (int s = sum; s >= 1; --s) /// you have to make a loop from s = sum -> 0
        {
            f[i][s] = f[i - 1][s];

            if (s >= v)
            {
                minimize(f[i][s], f[i - 1][s - v] + c);
            }
        }

        for (int res = sum; res >= 0; --res)
        {
            if (f[i][res] <= w)
            {
                cout << res << '\n';
                break;
            }
        }
    }

B) Time Limit Exceed

1) Global variable $$$\neq$$$ Local variable

In Meet-in-the-middle approach, the solve() function didnt use global variable (n), it use $$$n = |c| = |s|$$$.

Debug

Assign this at the head of the function


void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = c.size(); /// Important !!!
    ...
}

or

void solve(const vector<int> &c, const vector<int> &v, vector<Node> &S)
{
    int n = v.size(); /// Important !!!
    ...
}

2) Forget to use memorization

Wrong

ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll res = f[i][s]; /// is should be &res = [i][s]
    if (res != -1) return res;
    ll res = +LINF;

    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return res;
}

Wrong

ll magic(int i = 1, int s = 0)
{
    if (s < 0) return +LINF;
    if (i > n) return (s == 0) ? 0 : +LINF;

    ll res = +LINF;
    minimize(res, magic(i + 1, s - 0) + 0);
    minimize(res, magic(i + 1, s - v[i]) + c[i]);
    return f[i][s] = res; /// It is calculated first then assigning dp value
}

3) You might get WA if you have wrong initalization or leave the value generated randomly

Wrong

    ll f[sum + 1];

    /// What if f[x > 0] negative ?
    f[0] = 0;
    for (int i = 1; i <= n; ++i)
        for (int s = sum; s >= v[i]; --s)
            minimize(f[s], f[s - v[i]] + c[i]);

4) If you wanna binary search for the result, remember that you cant do Prefixmin DP $$$O(N \times SUM)$$$ as what it like in Prefixmax DP $$$O(N \times W)$$$

Wrong


    ll f[n + 1][sum + 1];
    memset(f, +LINF, sizeof(f));
    f[0][0] = 0;

    for (int i = 1; i <= n; ++i)
    {
        for (int s = 1; s <= sum; ++s)
        {
            f[i][s] = min(f[i][s - 1], f[i - 1][s]);

            if (s >= v[i])
            {
                minimize(f[i][s], f[i - 1][s - v[i]] + c[i]);
            }
        }
    }

    int res = 0;
    for (int l = 1, r = sum; l <= r; )
    {
        int m = (l + r) >> 1;
        if (f[n][m] <= w)
        {
            res = m;
            l = m + 1;
        }
        else 
        {
            r = m - 1;
        }
    }
    cout << res;

C) Memory limit exceed

1) Though Meet-in-the-middle approach is faster than Bitmasking Approach, it requires large amount of space — $$$O(2^{^{\frac{n}{2}}}$$$, which may give you MLE !

2) In some cases you will need space optimization if the limit is too tight !

3) Becareful in tracing results

Wrong

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            i -= 1;
            s -= c[i];
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

Fixed

    vector<int> selected;
    for (int i = n, s = w; i >= 1 && s >= 1; )
    {
        if (f[i][s] == f[i - 1][s - c[i]] + v[i])
        {
            selected.push_back(i);
            s -= c[i]; /// This first then decrease (i)
            i -= 1;
            continue; /// <--- Important in this case
        }

        if (f[i][s - 1] > f[i - 1][s])
        {
            --s;
        }
        else /// f[i][s] = f[i - 1][s]
        {
            --i;
        }
    }

D) Runtime Error

1) Out of bound

Wrong

ll f[MAXN][MAXW];
ll magic(int i = 1, int s = 0)
{
    if (i > n) return (s <= w) ? 0 : -LINF;     /// No available item to add into the bag

    ll &res = f[i][s]; /// what if (s > w) ?
    if (res != -1) return res;
        
    maximize(res, magic(i + 1, s + 0) + 0);       /// Not using this item
    maximize(res, magic(i + 1, s + c[i]) + v[i]); /// Using this item
    return res;
}

Wrong

    ll f[n + 1][w + 1];
    memset(f, 0, sizeof(f));
    for (int i = 1; i <= n; ++i)
        for (int s = w; s >= 0; --s)
            f[i][s] = max(f[i - 1][s], f[i - 1][s - c[i]] + v[i]); /// What if s < c[i] ?

2) Array too big in main function: Declare it globally with the init size bigger than problem constraint a bit

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

XI. Knapsack Variation and Practice Problems

In case you need a place to practice or submitting

1) CSES | DP section | Book Shop

Hint

2) Easy but nice problem — (contributor TheScrasse)

Note

3) Codeforces #683 Div 1 | Problem 1446A | Knapsack

Hint

4) SPOJ | Classical | Large Knapsack

5) SPOJ | Tutorial | Knapsack

6) Codeforces #61 Edu | Problem 1132E | Knapsack

7) Atcoder DP Contest | Problem D | Knapsack 1

8) Atcoder DP Contest | Problem E | Knapsack 2

9) DMOJ | Knapsack 3

10) DMOJ | Knapsack 4

11) Codeforces Wunder Fund Round 2016 | Double Knapsack

――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――――

Teleporter: [Previous] | | | [Next]

XII. Blog status

The current progress and contributor of this blogs

Current progress;

- **1)** Online Algorithms

 - **2)** Remain space optimization while tracing for elements

On progress:

- **0)** Table of content & Complexity comparision table

- **1)** Online Algorithm

- **2)** Optimizations and Heuristic

- **3a)** Unbounded knapsack

- **3b)** Bounded knapsack

- **3c)** Item limitation knapsack

- **4a)** Knapsack query maximum value with item in range $$$[L, R]$$$

- **4b)** Knapsack query maximum value with weight in range $$$[L, R]$$$

- **4c)** Knapsack query minimum weight with value in range $$$[L, R]$$$

- **5a)** Multiple knapsack bags

- **5b)** Multidimentional item

Special thank to contributors: SPyofgame, TheScrasse, Lusterdawn, jiangly

Full text and comments »

knapsack, bruteforce, bitmasking, dp, 1d-dp, 2d-dp, tutorial, practice, meet-in-the-middle, knapsack 0/1, debugging, solutions

SPyofgame
4 years ago
2

Minimum number of charactor need to add to make a valid barrack string

By SPyofgame, history, 4 years ago, In English

The full statement

Main statement

A pair of charactor $$$(L, R)$$$ is called good or matched to each other if it satisfy one of below

$$$L =$$$ '(' and $$$R =$$$ ')'
$$$L =$$$ '[' and $$$R =$$$ ']'
$$$L =$$$ '{' and $$$R =$$$ '}'

Notice that if $$$(L, R)$$$ is good then $$$(R, L)$$$ is not good

String can have many variation of categories, one of that is good string. Let a string $$$S$$$ of size $$$N$$$ is called good if

$$$S$$$ is empty (its length $$$N = 0$$$)
$$$S = S_1 + S_2 + \dots + S_n$$$. Where $$$S_i$$$ is a good string and + symbol mean that string concaternation
$$$S = L + S_x + R$$$ where $$$S_x$$$ is a good string and $$$(L, R)$$$ is a good pair of charactor

Given a string $$$S$$$ of size $$$N$$$. We can add some charactor '(', ')', '[', ']', '{', '}' into anywhere in string $$$S$$$ but you cant replace or remove them.

The question is that: What is the minimum number of charactor need to add into string to make it good ?

Limitation: $$$N = |S| \leq 500$$$

The dynamic programming solution $$$O(n^3)$$$

Lets $$$F(l, r)$$$ is the answer for substring $$$S[l..r]$$$.

If $$$l > r$$$ then the string is empty, hence the answer is $$$F(l, r) = 0$$$
If $$$l = r$$$ then we should add one charactor to match $$$S_l$$$ to make this substring good, hence the answer is $$$F(l, r) = 1$$$
We can split into 2 other substring $$$S[l..r] = S[l..k] + S[k+1..r]$$$, for each $$$k$$$ we have $$$F(l, r) = F(l, k) + F(k+1, r)$$$ hence $$$F(l, r) = min(F(l, k) + F(k+1, r))$$$
Notice that when $$$S_l$$$ match $$$S_r$$$, $$$F(l, r) = min(F(l + 1, r - 1), min(F(l, k) + F(k+1, r)))$$$

Complexity:

$$$F(l, r)$$$ have $$$O(n^2)$$$ states
In each substring $$$S[l..r]$$$, we maybe to have a for-loop $$$O(n)$$$
Hence the upper bound of the complexity is $$$O(n^3)$$$

Recursive Code

///   Minimum number of error
/// = Minimum number of char need to add
int f[LIM][LIM]; /// Min error in substring s[l..r]
int solve(int l = 0, int r = n - 1)
{
    if (l > r) return 0; /// empty substring
    if (l == r) return 1; /// dont pair with other

    int &res = f[l][r];
    if (res != +INF) return res; /// calculated

    if (match(s[l], s[r])) /// paired up, no error
    {
        res = 0 + solve(l + 1, r - 1);
    }
        
    /// Split string into 2 halves
    /// s[l..r] = s[l..k] + s[k+1..r]
    /// f[l..r] = min(f[l..k] + f[k+1..r])
    for (int k = l; k < r; ++k)
    {
        minimize(res, solve(l, k) + solve(k + 1, r));
    }

   return res;
}

Iterative Code

///   Minimum number of error
/// = Minimum number of char need to add
int f[LIM][LIM]; /// Min error in substring s[l..r]
int solve()
{
    memset(f, 0, sizeof(f));
    for (int r = 0; r < n; ++r)
    {
        for (int l = r; l >= 0; --l)
        {
            f[l][r] = r - l + 1; /// Add one charactor to each one
            if (match(s[l], s[r])) /// Outside make a good pair of string
            {
                f[l][r] = f[l+1][r-1];
            }

            /// Split string into 2 halves
            /// s[l..r] = s[l..k] + s[k+1..r]
            /// f[l..r] = min(f[l..k] + f[k+1..r])
            for (int k = l; k < r; ++k)
            {
                minimize(f[l][r], f[l][k] + f[k+1][r]);
            }
        }
    }

    return f[0][n - 1];
}

Full Code

#include <iostream>
#include <cstring>
#include <cstdio>

using namespace std;

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }
void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

const int LIM = 1010;
const int INF = 0x3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

int n;
string s;

///// One variation
//int solve()
//{
//    int res = 0, diff = 0;
//    for (char c : s)
//    {
//        diff += (c == '(') ? +1 : -1;
//        if (diff < 0)
//        {
//            diff = 0;
//            ++res;
//        }
//    }
//    res += diff;
//
//    return res;
//}

bool match(char l, char r)
{
    return (l == '(' && r == ')')
        || (l == '[' && r == ']')
        || (l == '{' && r == '}');
}

///   Minimum number of error
/// = Minimum number of char need to add
int f[LIM][LIM]; /// Min error in substring s[l..r]

/// Iterative
int solve()
{
    memset(f, 0, sizeof(f));
    for (int r = 0; r < n; ++r)
    {
        for (int l = r; l >= 0; --l)
        {
            f[l][r] = r - l + 1; /// Add one charactor to each one
            if (match(s[l], s[r])) /// Outside make a good pair of string
            {
                f[l][r] = f[l+1][r-1];
            }

            /// Split string into 2 halves
            /// s[l..r] = s[l..k] + s[k+1..r]
            /// f[l..r] = min(f[l..k] + f[k+1..r])
            for (int k = l; k < r; ++k)
            {
                minimize(f[l][r], f[l][k] + f[k+1][r]);
            }
        }
    }

    return f[0][n - 1];
}

/// Recursive
//int solve(int l = 0, int r = n - 1)
//{
//    if (l > r) return 0; /// empty substring
//    if (l == r) return 1; /// dont pair with other
//
//    int &res = f[l][r];
//    if (res != +INF) return res; /// calculated
//
//    if (match(s[l], s[r])) /// paired up, no error
//    {
//        res = 0 + solve(l + 1, r - 1);
//    }
//
//     /// Split string into 2 halves
//     /// s[l..r] = s[l..k] + s[k+1..r]
//     /// f[l..r] = min(f[l..k] + f[k+1..r])
//     for (int k = l; k < r; ++k)
//     {
//         minimize(res, solve(l, k) + solve(k + 1, r));
//     }

//    return res;
// }


int main()
{
    file();
    cin >> s;
    n = s.size();

    memset(f, +INF, sizeof(f));
    cout << solve();
    return 0;
}

The other dynamic programming solution $$$O(n^3)$$$

Base cases:

If $$$l > r$$$ then the string is empty, hence $$$F(l, r) = 0$$$
If $$$l = r$$$ then we should add one charactor to match $$$S_l$$$ to make this substring good, hence $$$F(l, r) = 1$$$

Branch and bound cases:

If $$$S_l$$$ is close barrack, then add a open barrack before it, hence $$$F(l, r) = F(l + 1, r) + 1$$$
If $$$S_r$$$ is open barrack, then add a close barrack after it, hence $$$F(l, r) = F(l, r - 1) + 1$$$
If $$$(S_l, S_{l+1})$$$ is good, then just paired it up, hence $$$F(l, r) = F(l + 2, r) + 0$$$
If $$$(S_{r-1}, S_r)$$$ is good, then just paired it up, hence $$$F(l, r) = F(l, r - 2) + 0$$$

Main cases:

For each $$$k = l \rightarrow r - 1$$$

If $$$S_k$$$ match $$$S_r$$$, minimize $$$F(l, r)$$$ with $$$F(l, k - 1) + 0 + F(k + 1, r - 1)$$$
Else add a open charactor at k to match $$$S_r$$$, minimize $$$F(l, r)$$$ with $$$F(l, k) + 1 + F(k + 1, r - 1)$$$

Complexity:

$$$F(l, r)$$$ have $$$O(n^2)$$$ states
In each substring $$$S[l..r]$$$, we maybe to have a for-loop $$$O(n)$$$ or $$$O(1)$$$ for transistion
Hence the upper bound complexity is $$$O(n^3)$$$
Hence the lower bound complexity is $$$O(n^2)$$$

Recursive Code


int n;
string s;
int f[LIM][LIM];
int magic(int l = 0, int r = n - 1)
{
    /// Base cases
    if (l > r)  return 0; /// Empty string
    if (l == r) return 1; /// Add one charactor to pair it up

    int &res = f[l][r];
    if (res != +INF) return res;

    // Branch and bound cases
    if (isClose(s[l]))         return res = magic(l + 1, r - 0) + 1; // Useless leftmost
    if (isOpen(s[r]))          return res = magic(l + 0, r - 1) + 1; // Useless rightmost
    if (match(s[l], s[l + 1])) return res = magic(l + 2, r - 0) + 0; // Leftmost paired
    if (match(s[r - 1], s[r])) return res = magic(l + 0, r - 2) + 0; // Rightmost paired

    /// Main cases
    for (int k = l; k <= r; ++k)
    {
        if (match(s[k], s[r])) // Match s[k] and s[r]
        {
            minimize(res, magic(l, k - 1) + 0 + magic(k + 1, r - 1));
        }
        else // Insert open barrack at s[k]
        {
            minimize(res, magic(l, k - 0) + 1 + magic(k + 1, r - 1));
        }
    }

    return res;
}

Full code

#include <algorithm>
#include <iostream>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;
const int LIM = 1010;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

bool isOpen(char c)  { return (c == '(') || (c == '[') || (c == '{'); }
bool isClose(char c) { return (c == ')') || (c == ']') || (c == '}'); }
bool match(char l, char r)
{
    return (l == '(' && r == ')')
        || (l == '[' && r == ']')
        || (l == '{' && r == '}');
}

char toMatch(char c)
{
    if (c == '(') return ')';
    if (c == '[') return ']';
    if (c == '{') return '}';
    if (c == ')') return '(';
    if (c == ']') return '[';
    if (c == '}') return '{';
}

int n;
string s;
int f[LIM][LIM];
int magic(int l = 0, int r = n - 1)
{
    /// Base cases
    if (l > r)  return 0; /// Empty string
    if (l == r) return 1; /// Add one charactor to pair it up

    int &res = f[l][r];
    if (res != +INF) return res;

    // Branch and bound cases
    if (isClose(s[l]))         return res = magic(l + 1, r - 0) + 1; // Useless leftmost
    if (isOpen(s[r]))          return res = magic(l + 0, r - 1) + 1; // Useless rightmost
    if (match(s[l], s[l + 1])) return res = magic(l + 2, r - 0) + 0; // Leftmost paired
    if (match(s[r - 1], s[r])) return res = magic(l + 0, r - 2) + 0; // Rightmost paired

    /// Main cases
    for (int k = l; k <= r; ++k)
    {
        if (match(s[k], s[r])) // Match s[k] and s[r]
        {
            minimize(res, magic(l, k - 1) + 0 + magic(k + 1, r - 1));
        }
        else // Insert open barrack at s[k]
        {
            minimize(res, magic(l, k - 0) + 1 + magic(k + 1, r - 1));
        }
    }

    return res;
}

int main()
{
    cin >> s;
    n = s.size();

    memset(f, +INF, sizeof(f));
    cout << magic();
    return 0;
}

Optimize version

Instead of make a for loop of every possible cases for $$$k$$$. Just pick the places where the barrack match to $$$S_r$$$ exists in substring $$$S[l..r]$$$. Hence optimize the algorithm by 8 times

Main code


int n;
string s;
/// p[c][x]: Minimum position >= x with charactor c (s[p[c][x]] = c)
int p[256][LIM];
int f[LIM][LIM];
int magic(int l = 0, int r = n - 1)
{
    /// Base cases
    if (l > r)  return 0; /// Empty string
    if (l == r) return 1; /// Add one charactor to pair it up

    int &res = f[l][r];
    if (res != +INF) return f[l][r];

    // Branch and bound cases
    if (isClose(s[l]))         return res = magic(l + 1, r - 0) + 1; // Useless leftmost
    if (isOpen(s[r]))          return res = magic(l + 0, r - 1) + 1; // Useless rightmost
    if (match(s[l], s[l + 1])) return res = magic(l + 2, r - 0) + 0; // Leftmost paired
    if (match(s[r - 1], s[r])) return res = magic(l + 0, r - 2) + 0; // Rightmost paired

    /// Main cases
    bool ok = false;
    int need = toMatch(s[r]);
    for (int k = p[need][l]; k < r; k = p[need][k + 1])
    {
        minimize(res, magic(l, k - 1) + 0 + magic(k + 1, r - 1));
    }

    minimize(res, magic(l, r - 1) + 1);
    return res;
}

Full Code

#include <algorithm>
#include <iostream>
#include <cstdlib>
#include <cstring>
#include <vector>
#include <cstdio>
#include <cmath>
#include <deque>
#include <ctime>
#include <map>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}

char __;
template<typename T>
void getUnsign(T &x)
{
    while (__ = getchar(), __ < '0' || __ > '9');
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');
}

template<typename T>
void getSigned(T &x)
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();
 
    x = (__ - '0');
    while (__ = getchar(), __ >= '0' && __ <= '9')
        x = (x << 3) + (x << 1) + (__ - '0');

    if (sign) x = -x;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;
const int LIM = 2001;
const int INF = 0x3f3f3f3f;
const ll LINF = 0x3f3f3f3f3f3f3f3f;
/// ====*====*====*====*====*====*====*====*====*====*====*====*====*====*====*====

bool isOpen(char c)  { return (c == '(') || (c == '[') || (c == '{'); }
bool isClose(char c) { return (c == ')') || (c == ']') || (c == '}'); }
bool match(char l, char r)
{
    return (l == '(' && r == ')')
        || (l == '[' && r == ']')
        || (l == '{' && r == '}');
}

char toMatch(char c)
{
    if (c == '(') return ')';
    if (c == '[') return ']';
    if (c == '{') return '}';
    if (c == ')') return '(';
    if (c == ']') return '[';
    if (c == '}') return '{';
}

int n;
string s;
/// p[c][x]: Minimum position >= x with charactor c (s[p[c][x]] = c)
int p[256][LIM];
int f[LIM][LIM];
int magic(int l = 0, int r = n - 1)
{
    /// Base cases
    if (l > r)  return 0; /// Empty string
    if (l == r) return 1; /// Add one charactor to pair it up

    int &res = f[l][r];
    if (res != +INF) return f[l][r];

    // Branch and bound cases
    if (isClose(s[l]))         return res = magic(l + 1, r - 0) + 1; // Useless leftmost
    if (isOpen(s[r]))          return res = magic(l + 0, r - 1) + 1; // Useless rightmost
    if (match(s[l], s[l + 1])) return res = magic(l + 2, r - 0) + 0; // Leftmost paired
    if (match(s[r - 1], s[r])) return res = magic(l + 0, r - 2) + 0; // Rightmost paired

    /// Main cases
    bool ok = false;
    int need = toMatch(s[r]);
    for (int k = p[need][l]; k < r; k = p[need][k + 1])
    {
        minimize(res, magic(l, k - 1) + 0 + magic(k + 1, r - 1));
    }

    minimize(res, magic(l, r - 1) + 1);
    return res;
}

/// res: number of barrack need to be added
/// diff: difference between number of '(' - ')', diff can be negative
int solve_unique()
{
    int res = 0, diff = 0;
    for (char c : s) /// for each charactor
    {
        diff += (c == '(') ? +1 : -1;
        if (diff < 0) /// Add a '(' here, since there are more ')' then '('
        {
            diff = 0; /// After adding a '(', diff = 0 <=> equal '(' and ')'
            ++res;
        }
    }
    res += diff; /// Add (diff) close barrack at the end

    return res;
}

void precal()
{
    deque<char> S;
    for (char c : s)
    {
        if (S.empty() || match(S.back(), c) == false)
        {
            S.push_back(c);
        }
        else 
        {
            S.pop_back();
        }
    }

    if (S.size() != s.size())
    {
        s = "";
        for (char c : S)
            s += c;
    }
    n = s.size();

    for (char c : "()[]{}")
    {
        p[c][n] = n;
        for (int i = n - 1; i >= 0; --i)
        {
            p[c][i] = (s[i] == c) ? i : p[c][i + 1];
        }
    }
}

void gen()
{
    s = "";
    for (int i = 0; i < LIM - 1; ++i)
    {
        int x = rand() % 6;
        if (x == 0) s += '(';
        if (x == 1) s += ')';
        if (x == 2) s += '[';
        if (x == 3) s += ']';
        if (x == 4) s += '{';
        if (x == 5) s += '}';
    }
}

// int f[LIM][LIM];
// int solve()
// {
//     memset(f, 0, sizeof(f));
//     for (int r = 0; r < n; ++r)
//     {
//         for (int l = r; l >= 0; --l)
//         {
//             f[l][r] = r - l + 1; /// Add one charactor to each one
//             if (match(s[l], s[r])) /// Outside make a good pair of string
//             {
//                 f[l][r] = f[l+1][r-1];
//             }

//             /// Split string into 2 halves
//             /// s[l..r] = s[l..k] + s[k+1..r]
//             /// f[l..r] = min(f[l..k] + f[k+1..r])
//             for (int k = l; k < r; ++k)
//             {
//                 minimize(f[l][r], f[l][k] + f[k+1][r]);
//             }
//         }
//     }

//     return f[0][n - 1];
// }

int main()
{
    cin >> s;
    gen();
    precal();
    n = s.size();

    memset(f, +INF, sizeof(f));
    cout << magic() << endl;
    return 0;
}

My question

If the string $$$S$$$ is only consist of '(' and ')' then there is a Linear ($$$O(n)$$$) solution

The solution

/// res: number of barrack need to be added
/// diff: difference between number of '(' - ')', diff can be negative
int solve()
{
    int res = 0, diff = 0;
    for (char c : s) /// for each charactor
    {
        diff += (c == '(') ? +1 : -1;
        if (diff < 0) /// Add a '(' here, since there are more ')' then '('
        {
            diff = 0; /// After adding a '(', diff = 0 <=> equal '(' and ')'
            ++res;
        }
    }
    res += diff; /// Add (diff) close barrack at the end

    return res;
}

Can my algorithm ($$$dp[l][r] = min(dp[l][k] + dp[k + 1][r])$$$) improved into $$$O(n^2\ polylog)$$$ or lower in somehow ?
Failed to use Knuth algorithm $$$(dp[l][r] = min(dp[l][k] + dp[k][r] + cost[l][r])$$$ since fully-motone condition is not satisfied

Full text and comments »

string, parentheses, dp, #3d-dp

SPyofgame
4 years ago
5

What is the complexity of this code ?

By SPyofgame, history, 4 years ago, In English

The problem

Given $$$n$$$ points $$$(x_1, y_1),\ (x_2, y_2),\dots, (x_n, y_n)$$$

Find the minimum distance between 2 pair of points.

The problem: SPOJ

The constraint

$$$2 \leq n \leq 50000$$$
$$$x_i, y_i \leq 10^6$$$

My question

I was solving a problem that need to find the minimum distance between two points. I tried to bruteforce then cde an divide and conquer algorithm. But then I modified the bruteforce by adding some branch-and-bound to ignore not-optimal cases. For somehow my code get AC and seems to run fast while I thought it will be slow with the complexity of $$$O(n^2)$$$ with small constant.

I dont really sure about the complexity, can some one help me to calculate it ?

My main part

int main()
{
    int n;
    cin >> n;

    /// Input points
    vector<point> a(n);
    for (int i = 0; i < n; ++i)
        cin >> a[i].x >> a[i].y;

    /// Sorted by minimum(y) then minimum(x)
    set<point, cmp_y> S;
    set<point, cmp_y>::iterator it;

    double CMD = 1e9; /// Current Minimum Distance
    ll CMSD = 1e18;   /// Current Minimum Squared Distance
    for (int i = 0; i < n; ++i)
    {
        point lower(a[i].x, a[i].y - CMD); /// Lower   will give longer distance
        point upper(a[i].x, a[i].y + CMD); /// Shorter will give longer distance
        for (it = S.lower_bound(lower); it != S.end() && sorty(*it, upper); ++it) 
        {
            int dx = abs(a[i].x - it->x);
            int dy = abs(a[i].y - it->y); 
            if (dx >= CMD && dy >= CMD) it = S.erase(it); /// This point is not optimal, remove it
            if (dx >= CMD || dy >= CMD) continue;         /// Distance >= CMD -> Skip
         
            ll CD = 1LL * dx * dx + 1LL * dy * dy; /// Current Distance
            minimie(CMSD, CD);                     /// Update Distance
        }
        S.insert(a[i]);   /// Add new point
        CMD = sqrt(CMSD); /// Recalculate Minimum Distance
    }

    cout << CMD; /// Answer
    return 0;
}

My full code

#include <algorithm>
#include <iostream>
#include <iomanip>
#include <vector>
#include <cmath>
#include <set>

using namespace std;

#define all(x) (x).begin(), (x).end()

char __;
template <typename _T_>
void getSigned(_T_ &_n_) /// For inputing many numbers
{
    while (__ = getchar(), __ != '-' && (__ < '0' || __ > '9'));
    bool sign(__ == '-');
    if (sign) __ = getchar();

    _n_ = __ - '0';
    while (__ = getchar(), __ >= '0' && __ <= '9')
        _n_ = 10 * _n_ + __ - '0';
    
    if (sign) _n_ = -_n_;
}

template<typename T> void maximize(T &res, const T &val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, const T &val) { if (res > val) res = val; }

struct point /// (x, y) cordinate
{
    int x, y;
    point(int x = 0, int y = 0)
    : x(x), y(y) {}
};

bool sortx (const point &a, const point &b) { return (a.x != b.x) ? (a.x < b.x) : (a.y < b.y); }
bool sorty (const point &a, const point &b) { return (a.y != b.y) ? (a.y < b.y) : (a.x < b.x); }
struct cmp_x { bool operator() (const point &a, const point &b) { return sortx(a, b); } };
struct cmp_y { bool operator() (const point &a, const point &b) { return sorty(a, b); } };
typedef long long ll;
int main()
{
    int n;
    cin >> n;

    /// Input points
    vector<point> a(n);
    for (int i = 0; i < n; ++i)
        cin >> a[i].x >> a[i].y;

    /// Sorted by minimum(y) then minimum(x)
    set<point, cmp_y> S;
    set<point, cmp_y>::iterator it;

    double CMD = 1e9; /// Current Minimum Distance
    ll CMSD = 1e18;   /// Current Minimum Squared Distance
    for (int i = 0; i < n; ++i)
    {
        point lower(a[i].x, a[i].y - CMD); /// Lower   will give longer distance
        point upper(a[i].x, a[i].y + CMD); /// Shorter will give longer distance
        for (it = S.lower_bound(lower); it != S.end() && sorty(*it, upper); ++it) 
        {
            int dx = abs(a[i].x - it->x);
            int dy = abs(a[i].y - it->y); 
            if (dx >= CMD && dy >= CMD) it = S.erase(it); /// This point is not optimal, remove it
            if (dx >= CMD || dy >= CMD) continue;         /// Distance >= CMD -> Skip
         
            ll CD = 1LL * dx * dx + 1LL * dy * dy; /// Current Distance
            minimie(CMSD, CD);                     /// Update Distance
        }
        S.insert(a[i]);   /// Add new point
        CMD = sqrt(CMSD); /// Recalculate Minimum Distance
    }

    cout << CMD; /// Answer
    return 0;
}

Full text and comments »

-1

SPyofgame
4 years ago
2

Counting nonegative integer solutions of two variables Linear Diophantine Equation

By SPyofgame, history, 4 years ago, In English

The equation

A Linear Diophantine Equation is an equation of the general form:

$$$\underset{i = 1}{\overset{n}{\Sigma}} (a_i \cdot x_i) = N$$$

Where $$$a_i$$$ and $$$N$$$ are given integers and $$$x_i$$$ are unknown integers.

The problem

Given Linear Diophantine Equation of only 2 variables:

$$$ax + by = c$$$

With given integers $$$a, b, c$$$ and unknown integers $$$x, y$$$

Some interesting property

We have to count the number of $$$(x, y)$$$ non-negative integers solutions for the equation (assume that these value are under $$$10^9$$$ so that we dont deal with overflow cases$

Can I have a simplier implementation then this ? (My algorithm based on cp-algorithm)

Recursive extended greatest common divisor

/// Return gcd(a,b)
/// Find (&x, &y) satisfy ax + by = gcd(a, b)
template<typename T>
T extgcd(T a, T b, T &x, T &y)
{
    if (a == 0) return x = 0, y = 1, b;
    T p = b / a;
    T g = extgcd(b - p * a, a, y, x);
    x -= p * y;
    return g;
}

Recursive extended greatest common divisor

/// Return gcd(a,b)
/// Find (&x, &y) satisfy ax + by = gcd(a, b)
template<typename T>
tuple<T, T, T> extgcd(T a, T b)
{
    if (a == 0) return make_tuple(b, 0, 1);
    T g, x, y; tie(g, x, y) = extgcd(b % a, a);
    return make_tuple(g, y - b / a * x, x);
}

Find one solution ax + by = c

/// Return true if there exist such (x, y) satisfy ax + by = c
/// Find (&g) = gcd(a, b)
/// Find (&x, &y) satisfy ax + by = c
template<typename T>
bool find_any_solution(T a, T b, T c, T &x, T &y, T &g)
{
    if (a == 0 && b == 0) /// 0x + 0y = c
    {
        if (c != 0) return false;
        x = y = g = 0;
        return true;
    }

    if (a == 0) /// 0x + by = c
    {
        if (c % b != 0) return false;
        x = 0, y = c / b, g = abs(b);
        return true;
    }

    if (b == 0) /// ax + 0y = c
    {
        if (c % a != 0) return false;
        x = c / a, y = 0, g = abs(a);
        return true;
    }

    /// ax + by = c
    g = extgcd(abs(a), abs(b), x, y);
    if (c % g != 0) return false;

    x *= (a < 0 ? -1 : +1) * c / g;
    y *= (b < 0 ? -1 : +1) * c / g;
    return true;
}

Shift solution

/// Find the next/prev (cnt)-th solution of ax + by = c
template<typename T>
void shift_solution(T & x, T & y, T a, T b, T cnt)
{
    x += cnt * b;
    y -= cnt * a;
}

Count number solutions of ax + by = c with given range x & range y

template<typename T = long long>
T find_all_solutions(T a, T b, T c, T min_x, T max_x, T min_y, T max_y) {
    if (min_x > max_x) return 0; /// Invalid range
    if (min_y > max_y) return 0; /// Invalid range

    if (a == 0 && b == 0) /// 0x + 0y = c
    {
        if (c != 0) return 0; /// No solution
        return 1LL * (max_x - min_x + 1) * (max_y - min_y + 1); /// Ways to select (x) and (y) in range
    }

    if (a == 0) /// 0x + by = c <=> y = c / b
    {
        if (c % b != 0) return 0; /// No solution
        if (1LL * min_y * b > c) return 0; /// Out of range: min > y
        if (1LL * max_y * b < c) return 0; /// Out of range: max < y
        return max_x - min_x + 1; /// Ways to select (x) in range    
    }

    if (b == 0) /// ax + 0y = c <=> x = c / a
    {
        if (c % a != 0) return 0; /// No solution
        if (1LL * min_x * a > c) return 0; /// Out of range: min > x
        if (1LL * max_x * a < c) return 0; /// Out of range: max < x
        return max_y - min_y + 1; /// Ways to select (y) in range    
    }

    T x, y, g;
    if (!find_any_solution(a, b, c, x, y, g)) return 0;
    a /= g;     
    b /= g;

    T sign_a = a > 0 ? +1 : -1;
    T sign_b = b > 0 ? +1 : -1;

    shift_solution(x, y, a, b, (min_x - x) / b);
    if (x < min_x) shift_solution(x, y, a, b, sign_b);
    if (x > max_x) return 0;
    T lx1 = x;

    shift_solution(x, y, a, b, (max_x - x) / b);
    if (x > max_x) shift_solution(x, y, a, b, -sign_b);
    T rx1 = x;

    shift_solution(x, y, a, b, -(min_y - y) / a);
    if (y < min_y) shift_solution(x, y, a, b, -sign_a);
    if (y > max_y) return 0;
    T lx2 = x;

    shift_solution(x, y, a, b, -(max_y - y) / a);
    if (y > max_y) shift_solution(x, y, a, b, sign_a);
    T rx2 = x;

    if (lx2 > rx2) swap(lx2, rx2);
    T lx = max(lx1, lx2);
    T rx = min(rx1, rx2);

    if (lx > rx) return 0;
    return (rx - lx) / abs(b) + 1;
}

Count all nonegative solutions (x, y) satisfy ax + by = c

typedef long long ll;
long long count_nonegative_solution(int a, int b, int c)
{
    return find_all_solutions(ll(a), ll(b), ll(c), 0LL, ll(c / a + 1), 0LL, ll(c / b + 1));
}

Full text and comments »

linear diophantine, c++, math, extended gcd

SPyofgame
4 years ago
1

C2/A2-Binary Table minimize operations

By SPyofgame, history, 4 years ago, In English

Original Problem

Easy Version: Div 2, Div 1
Hard Version: Div 2, Div 1

You are given a binary table of size $$$n×m$$$. This table consists of symbols $$$0$$$ and $$$1$$$

You can make such operation: select $$$3$$$ different cells that belong to one $$$2×2$$$ square and change the symbols in these cells (change $$$0$$$ to $$$1$$$ and $$$1$$$ to $$$0$$$)

Your task is to make all symbols in the table equal to $$$0$$$. You dont have to minimize the number of operations. (It can be proved, that it is always possible)

And the constraints are

$$$2 \leq N, M \leq 100$$$
Easy Version: Limited in $$$3 \cdot N \cdot M$$$ operations
Hard Version: Limited in $$$1 \cdot N \cdot M$$$ operations

Code solution without minimizing (with comments)

Problem C1 - Eliminate each 1x1 cells

#include <algorithm>
#include <iostream>

using namespace std;

#define all(x) (x).begin(), (x).end()
int main()
{
    int q;
    cin >> q;
    
    while (q-->0) /// For each query
    {
        /// Input
        int n, m;
        cin >> n >> m;
        
        string s[n];
        for (int i = 0; i < n; ++i)
            cin >> s[i];
            


        /// Calculation
        int cnt = 0;
        for (int i = 0; i < n; ++i)           /// For each '1' appear
            cnt += 3 * count(all(s[i]), '1'); /// We use 3 operations to turn off it
                                              /// These operations are independent
 
        /// Output the result
        cout << cnt << '\n';
        for (int i = 0; i < n; ++i)
        {
            for (int j = 0; j < m; ++j)
            {
                if (s[i][j] == '1')
                {
                    int cx = i + 1; /// +1 because we use 0-based loop
                    int cy = j + 1; /// +1 because we use 0-based loop
                    int px = (cx < n) ? cx + 1 : cx - 1; /// Next row
                    int py = (cy < m) ? cy + 1 : cy - 1; /// Next col

                    /// Notice that these operation are independent
                    /// They wont affect others cells, just turn off [i][j]

                    cout << px << ' ' << cy << ' ';  ///  X O  |  + +  |  O X
                    cout << cx << ' ' << py << ' ';  ///  O O  |  + _  |  X O
                    cout << cx << ' ' << cy << '\n'; ///   Operation: 1 -> 2

                    cout << px << ' ' << cy << ' ';  ///  O X  |  + _  |  X X
                    cout << px << ' ' << py << ' ';  ///  X O  |  + +  |  O X
                    cout << cx << ' ' << cy << '\n'; ///   Operation: 2 -> 3

                    cout << cx << ' ' << py << ' ';  ///  X X  |  + +  |  O O
                    cout << px << ' ' << py << ' ';  ///  O X  |  _ +  |  O O
                    cout << cx << ' ' << cy << '\n'; ///   Operation: 3 -> 0
                }
            }
        }
    }
    return 0;
}

Benchmark

$$$\text{Average case} = \frac{\text{Total count}}{\text{Rows}\ \cdot \text{Columns}\ \cdot \text{Total case}}$$$

For $$$N$$$ ones in table, it need exact $$$3 \times N$$$ operations

For every $$$N \cdot M$$$ table, it need averagely $$$\frac{3N}{2}$$$ operations

rows	columns	total count	total cases	average percase	maximum case
2	2	96	16	150%	12
2	3	576	64	150%	18
2	4	3072	256	150%	24
2	5	15360	1024	150%	30
2	6	73728	4096	150%	36
2	7	344064	16384	150%	42
2	8	1572864	65536	150%	48
2	9	7077888	262144	150%	54
2	10	31457280	1048576	150%	60
2	11	138412032	4194304	150%	66
2	12	603979776	16777216	150%	72
3	2	576	64	150%	18
3	3	6912	512	150%	27
3	4	73728	4096	150%	36
3	5	737280	32768	150%	45
3	6	7077888	262144	150%	54
3	7	66060288	2097152	150%	63
3	8	603979776	16777216	150%	72
4	2	3072	256	150%	24
4	3	73728	4096	150%	36
4	4	1572864	65536	150%	48
4	5	31457280	1048576	150%	60
4	6	603979776	16777216	150%	72
5	2	15360	1024	150%	30
5	3	737280	32768	150%	45
5	4	31457280	1048576	150%	60
5	5	1258291200	33554432	150%	75
6	2	73728	4096	150%	36
6	3	7077888	262144	150%	54
6	4	603979776	16777216	150%	72
7	2	344064	16384	150%	42
7	3	66060288	2097152	150%	63
8	2	1572864	65536	150%	48
8	3	603979776	16777216	150%	72
9	2	7077888	262144	150%	54
10	2	31457280	1048576	150%	60
11	2	138412032	4194304	150%	66
12	2	603979776	16777216	150%	72

Problem C2 - Eliminate odd row, odd column then eliminate 2x2 cells

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

#define all(x) (x).begin(), (x).end()

/// Each line of the answer
struct node { 
    int a, b, c, d, e, f; 
    node (int a, int b, int c, int d, int e, int f)
    :       a(a),  b(b),  c(c),  d(d),  e(e),  f(f) {}
};

vector<node> res;
vector<vector<int> > a;
void flip(const node &x)
{
    a[x.a][x.b] ^= 1;
    a[x.c][x.d] ^= 1;
    a[x.e][x.f] ^= 1;
    res.push_back(x);
}

int main()
{
    int q;
    cin >> q;
    
    while (q-->0) /// For each query
    {
        /// Input
        int n, m;
        cin >> n >> m;
        
        a.assign(n + 1, vector<int>(m + 1, 0));
        for (int i = 1; i <= n; ++i)
        {
            string s;
            cin >> s;
            
            for (int j = 1; j <= m; ++j)
                a[i][j] = s[j - 1] - '0'; /// Convert string to array
        }
        
        res.clear();
        
        /// Odd row elimination
        if (n & 1)
        {
            for (int j = 1; j <= m; ++j) /// Eliminate 2x1 cells by once
            {
                int p = (j == m) ? (j - 1) : (j + 1);
                if (a[n][j]) flip(node(n-0,j   ,   n-1,j   ,   n-1,p));
            }
            --n; /// <- Last row eliminated
        }
        
        /// Odd column elimination
        if (m & 1)
        {
            for (int i = 1; i <= n; ++i) /// Eliminate 1x2 cells by once
            {
                int p = (i == n) ? (i - 1) : (i + 1);
                if (a[i][m]) flip(node(i,m-0   ,   i,m-1   ,   p,m-1));
            }
            --m; /// <- last col eliminated
        }
        
        /// Eliminate 2x2 cells by once
        for (int i = 1; i <= n; i += 2)
        {
            for (int j = 1; j <= m; j += 2)
            {
                int x = 0, y = 0, z = 0, t = 0;
                if (a[i + 0][j + 0]) x ^= 0, y ^= 1, z ^= 1, t ^= 1;
                if (a[i + 1][j + 0]) x ^= 1, y ^= 0, z ^= 1, t ^= 1;
                if (a[i + 0][j + 1]) x ^= 1, y ^= 1, z ^= 0, t ^= 1;
                if (a[i + 1][j + 1]) x ^= 1, y ^= 1, z ^= 1, t ^= 0;
            
                if (x) flip(node(i+1,j+0   ,   i+0,j+1   ,   i+1,j+1));
                if (y) flip(node(i+0,j+0   ,   i+0,j+1   ,   i+1,j+1));
                if (z) flip(node(i+0,j+0   ,   i+1,j+0   ,   i+1,j+1));
                if (t) flip(node(i+0,j+0   ,   i+1,j+0   ,   i+0,j+1));
            }
        }
        
        /// Output the result
        cout << res.size() << '\n';
        for (const node &x : res)
        {
            cout << x.a << ' ' << x.b << ' ';
            cout << x.c << ' ' << x.d << ' ';
            cout << x.e << ' ' << x.f << '\n';
        }
    }
    return 0;
}

Benchmark

$$$\text{Average case} = \frac{\text{Total count}}{\text{Rows}\ \cdot \text{Columns}\ \cdot \text{Total case}}$$$

For $$$S$$$ ones in table, it need maximumly $$$min(N \cdot M, 3 \times S)$$$ operations

For every $$$N \cdot M$$$ table, it need averagely $$$\frac{N \cdot M}{2}$$$ operations and maximumly $$$N * M$$$ operations

rows	columns	total count	total cases	average percase	maximum case
2	2	32	16	50%	4
2	3	192	64	50%	6
2	4	1024	256	50%	8
2	5	5120	1024	50%	10
2	6	24576	4096	50%	12
2	7	114688	16384	50%	14
2	8	524288	65536	50%	16
2	9	2359296	262144	50%	18
2	10	10485760	1048576	50%	20
2	11	46137344	4194304	50%	22
2	12	201326592	16777216	50%	24
3	2	192	64	50%	6
3	3	2304	512	50%	9
3	4	24576	4096	50%	12
3	5	245760	32768	50%	15
3	6	2359296	262144	50%	18
3	7	22020096	2097152	50%	21
3	8	201326592	16777216	50%	24
4	2	1024	256	50%	8
4	3	24576	4096	50%	12
4	4	524288	65536	50%	16
4	5	10485760	1048576	50%	20
4	6	201326592	16777216	50%	24
5	2	5120	1024	50%	10
5	3	245760	32768	50%	15
5	4	10485760	1048576	50%	20
6	2	24576	4096	50%	12
6	3	2359296	262144	50%	18
6	4	201326592	16777216	50%	24
7	2	114688	16384	50%	14
7	3	22020096	2097152	50%	21
8	2	524288	65536	50%	16
8	3	201326592	16777216	50%	24
9	2	2359296	262144	50%	18
10	2	10485760	1048576	50%	20
11	2	46137344	4194304	50%	22
12	2	201326592	16777216	50%	24

Problem C2 - Eliminate rows then columns then last 2x2 cells

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

#define all(x) (x).begin(), (x).end()

/// Each line of the answer
struct node { 
    int a, b, c, d, e, f; 
    node (int a, int b, int c, int d, int e, int f)
    :       a(a),  b(b),  c(c),  d(d),  e(e),  f(f) {}
};

vector<node> res;
vector<vector<int> > a;
void flip(const node &x)
{
    a[x.a][x.b] ^= 1;
    a[x.c][x.d] ^= 1;
    a[x.e][x.f] ^= 1;
    res.push_back(x);
}

int main()
{
    int q;
    cin >> q;
    
    while (q-->0) /// For each query
    {
        /// Input
        int n, m;
        cin >> n >> m;

        /// Initalization
        res.clear();
        a.assign(n + 1, vector<int>(m + 1, 0));
        for (int i = 1; i <= n; ++i)
        {
            string s;
            cin >> s;
            
            for (int j = 1; j <= m; ++j)
                a[i][j] = s[j - 1] - '0'; /// Convert string to array
        }
        


        /// Odd column elimination
        if (m & 1)
        {
            for (int i0 = 1; i0 <= n; ++i0) /// Eliminate 1x2 cells by once
            {
                if (a[i0][m] == 0) continue;
                int i1 = (i0 == n) ? (i0 - 1) : (i0 + 1);
//                if (a[i1][m]) flip(node(i0,m-0   ,   i1,m-0   ,   i0,m-1));
//                else          flip(node(i0,m-0   ,   i0,m-1   ,   i1,m-1));

                if (a[i1][m]) /// Greedy one more step
                {
                    int p = a[i1][m-1] ? i1 : i0; /// Select the one with one
                    flip(node(i0,m-0   ,   i1,m-0   ,   p,m-1));
                }
                else          flip(node(i0,m-0   ,   i0,m-1   ,   i1,m-1));
            }
            --m; /// <- Last col eliminated
        }
        


        /// Row Elimination: n * m -> 2 * m remaining
        while (n > 2)
        {
            for (int j0 = 1; j0 <= m; ++j0) /// Eliminate 2x1 cells by once
            {
                if (a[n][j0] == 0) continue;
                int j1 = (j0 == m) ? (j0 - 1) : (j0 + 1);
//                if (a[n][j1]) flip(node(n-0,j0   ,   n-0,j1   ,   n-1,j0));
//                else          flip(node(n-0,j0   ,   n-1,j0   ,   n-1,j1));

                if (a[n][j1])  /// Greedy one more step
                {
                    int p = a[n-1][j1] ? j1 : j0; /// Select the one with one
                    flip(node(n-0,j0   ,   n-0,j1   ,   n-1,p));
                }
                else          flip(node(n-0,j0   ,   n-1,j0   ,   n-1,j1));
            }
            --n; /// <- Last row eliminated
        }
        


        /// Col Elimination: 2 * m -> 2 * 2 remaining
        while (m > 2)
        {
            for (int i0 = 1; i0 <= n; ++i0) /// Eliminate 1x2 cells by once
            {
                if (a[i0][m] == 0) continue;
                int i1 = (i0 == n) ? (i0 - 1) : (i0 + 1);
//                if (a[i1][m]) flip(node(i0,m-0   ,   i1,m-0   ,   i0,m-1));
//                else          flip(node(i0,m-0   ,   i0,m-1   ,   i1,m-1));

                if (a[i1][m]) /// Greedy one more step
                {
                    int p = a[i1][m-1] ? i1 : i0; /// Select the one with one
                    flip(node(i0,m-0   ,   i1,m-0   ,   p,m-1));
                }
                else          flip(node(i0,m-0   ,   i0,m-1   ,   i1,m-1));
            }
            --m; /// <- Last col eliminated
        }
        


        /// Cells Elimination: 2 * 2 -> 0 * 0 remaining - Using modular equations
        int x = 0, y = 0, z = 0, t = 0;
        if (a[1][1]) x ^= 0, y ^= 1, z ^= 1, t ^= 1;
        if (a[2][1]) x ^= 1, y ^= 0, z ^= 1, t ^= 1;
        if (a[1][2]) x ^= 1, y ^= 1, z ^= 0, t ^= 1;
        if (a[2][2]) x ^= 1, y ^= 1, z ^= 1, t ^= 0;

        if (x) flip(node(2,1   ,   1,2   ,   2,2));
        if (y) flip(node(1,1   ,   1,2   ,   2,2));
        if (z) flip(node(1,1   ,   2,1   ,   2,2));
        if (t) flip(node(1,1   ,   2,1   ,   1,2));
     


        /// Output the result
        cout << res.size() << '\n';
        for (const node &x : res)
        {
            cout << x.a << ' ' << x.b << ' ';
            cout << x.c << ' ' << x.d << ' ';
            cout << x.e << ' ' << x.f << '\n';
        }
    }
    return 0;
}

Benchmark

$$$\text{Average case} = \frac{\text{Total count}}{\text{Rows}\ \cdot \text{Columns}\ \cdot \text{Total case}}$$$

For every $$$N \cdot M$$$ table, it need averagely $$$\frac{N}{2}$$$ operations and maximumly $$$min(N \cdot M, \lceil \frac{n + 1}{2} \rceil + \lceil \frac{m(n - 2)}{2} \rceil + (m - 2) + 3)$$$ operations

rows	columns	total count	total cases	average percase	maximum case
2	2	32	16	50%	4
2	3	176	64	45.8333%	5
2	4	880	256	42.9688%	6
2	5	4240	1024	41.4062%	7
2	6	19824	4096	40.332%	8
2	7	90768	16384	39.5717%	9
2	8	408944	65536	38.9999%	10
2	9	1819280	262144	38.5556%	11
2	10	8011120	1048576	38.2%	12
2	11	34980496	4194304	37.9091%	13
2	12	151666032	16777216	37.6667%	14
3	2	176	64	45.8333%	5
3	3	1968	512	42.7083%	7
3	4	19824	4096	40.332%	8
3	5	194352	32768	39.541%	10
3	6	1816240	262144	38.4911%	11
3	7	16814096	2097152	38.179%	13
3	8	151165600	16777216	37.5424%	14
4	2	880	256	42.9688%	6
4	3	19824	4096	40.332%	8
4	4	405088	65536	38.6322%	10
4	5	7938304	1048576	37.8528%	12
4	6	148856720	16777216	36.969%	14
5	2	4240	1024	41.4062%	7
5	3	193216	32768	39.3099%	10
5	4	7897792	1048576	37.6596%	12
5	5	311233920	33554432	37.102%	15
6	2	19824	4096	40.332%	8
6	3	1816240	262144	38.4911%	11
6	4	148979200	16777216	36.9994%	14
7	2	90768	16384	39.5717%	9
7	3	16718864	2097152	37.9627%	13
8	2	408944	65536	38.9999%	10
8	3	151165600	16777216	37.5424%	14
9	2	1819280	262144	38.5556%	11
10	2	8011120	1048576	38.2%	12
11	2	34980496	4194304	37.9091%	13
12	2	151666032	16777216	37.6667%	14

Extended version

But what if I have to minimize number of operations ?

Is there an algorithm other than brute-force to find minimum number of operations in these problem ?
I am wondering if I can use Gauss-Elimination (mod 2) or Greedy-DP to solve in somehow
I wrote an analizer for small $$$N \cdot M$$$ tables so that you can check too. (Modify by a bit, we can answer query of all $$$N \cdot M$$$ tables, but the complexity is $$$O(2^{n \cdot m})$$$)

Analizer

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>
#include <cstdio>
#include <deque>
#include <cmath>
#include <map>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}
#define all(x) (x).begin(), (x).end()
typedef long long ll;

int n, m;
void flip(int &mask, int i, int j) { mask ^= 1 << (i * m + j); }

int lim;
vector<int> F;
vector<int> trace;
void bfs(int s)
{
    trace.assign(lim, 0);
    F.assign(lim, -1);

    trace[s] = -1;
    F[s] = 0;

    deque<int> S;
    S.push_back(s);
    while (S.size())
    {
        int u = S.front();
        S.pop_front();

        for (int i = 1; i < n; ++i) /// Select row
        {
            for (int j = 1; j < m; ++j) /// Select column
            {
                vector<int> selected_x = {i - 1, i - 0, i - 1, i - 0};
                vector<int> selected_y = {j - 1, j - 1, j - 0, j - 0};

                int t = u; /// Make new mask
                for (int k = 0; k < 4; ++k) /// Fully 2x2 modified
                    flip(t, selected_x[k], selected_y[k]);

                for (int k = 0; k < 4; ++k) /// Select 1x1 cell not be modified
                {
                    int v = t;
                    flip(v, selected_x[k], selected_y[k]);

                    if (F[v] == -1) /// If not visited
                    {
                        trace[v] = u;
                        F[v] = F[u] + 1;
                        S.push_back(v);
                    }
                }
            }
        }
    }
}

void analize()
{
    int maximum = *max_element(all(F));
    ll total = accumulate(all(F), 0LL);
    vector<int> C(maximum + 1, 0);
    for (int mask = 0; mask < lim; ++mask)
        ++C[F[mask]];
    
    double closest = 1e9;
    int numerator = 0;
    int denominator = 0;
    int test = sqrt(lim);
    for (int nume = test; nume >= 1; --nume)
    {
        for (int deno = test; deno >= 1; --deno)
        {
            double value = double(nume) / deno * n * m;
            double delta = value - maximum;
            if (delta >= 1e-6 && closest >= delta)
            {
                closest = delta;
                numerator = nume;
                denominator = deno;
            }
        }
    }
    double mean = double(total) / lim;
    double rate = (double(total) / (lim * n * m)) * 100;

    cout << "In total amount of " << lim << " binary table " << n << " x " << m << endl;
    for (int operation = 0; operation <= maximum; ++operation)
    {
        cout << " - " << C[operation] << " cases need";
        cout << " minimum " << operation << " operations" << endl;
    }
    cout << endl;

    cout << "In an overkill solution, in each table " << n << " x " << m << endl;

    cout << "   A) In all cases, you need total " << total << " operations of " << lim << " cases" << endl;
    cout << "    > If your code return exact that value then it is operation minimized" << endl;

    cout << "   B) You need averagely " << mean << " operations" << endl;
    cout << "    > Which is about " << rate << "% of " << n * m << " operations" << endl;

    cout << "   C) And the maximum number of operations needed is " << maximum << endl;
    cout << "    > Which is approximately " << numerator << "nm/" << denominator << " operations" << endl;
    cout << "    > Which mean for each " << numerator << " ones in the table" << endl;
    cout << "      We only need maximumly " << denominator << " operations average" << endl;





    const bool show_case = false;
    const bool show_trace = false;
    if (show_case == false) return ;

    /// If you want to output cases
    vector<vector<int> > M(maximum + 1);
    for (int index = 0; index <= maximum; ++index) M[index].resize(C[index]);
    for (int mask = 0; mask < lim; ++mask)
        M[F[mask]][--C[F[mask]]] = mask; /// this look messy :D

    cout << "\n\n";
    for (int operation = 0; operation <= maximum; ++operation)
    {
        cout << "====X====X====X====X====X====X====X====X====X====X====\n";
        for (int mask : M[operation])
        {
            int cntbit = __builtin_popcount(mask);
            cout << "----x----x----x----x----x----x----x----x----\n";
            cout << "At (" << mask << ") = " << cntbit << " ones | Need " << operation << " operations :\n";

            do 
            {
                int sub = mask;
                for (int i = 1; i <= n; ++i)
                {
                    for (int j = 1; j <= m; ++j)
                    {
                        cout << (sub & 1);
                        sub >>= 1;
                    }
                    cout << '\n';
                }
                cout << '\n';
                if (show_trace == false) break; /// If you only want the mask and not the trace
            } while ((mask = trace[mask]) != -1);
        }
    }
}

int main()
{
    // file();
    cin >> n >> m;
    if (1LL * n * m > 20)
    {
        cout << "Well, I dont have enough data for that :(";
        cout << "- Please provide me smaller constraint";
        return 0;
    }

    lim = 1 << (n * m);
    bfs(0); /// fully zero binary table
    analize();

    return 0;
}

Test with 3x3 tables

In total amount of 512 binary table 3 x 3
 - 1 cases need minimum 0 operations
 - 16 cases need minimum 1 operations
 - 105 cases need minimum 2 operations
 - 220 cases need minimum 3 operations
 - 150 cases need minimum 4 operations
 - 20 cases need minimum 5 operations

In an overkill solution, in each table 3 x 3
   A) In all cases, you need total 1586 operations of 512 cases
    > If your code return exact that value then it is operation minimized
   B) You need averagely 3.09766 operations
    > Which is about 34.4184% of 9 operations
   C) And the maximum number of operations needed is 5
    > Which is approximately 9nm/16 operations
    > Which mean for each 9 ones in the table
      We only need maximumly 16 operations average

With (n, m) = (2, 2) || show_case = true; show_trace = true;

In total amount of 16 binary table 2 x 2
 - 1 cases need minimum 0 operations
 - 4 cases need minimum 1 operations
 - 6 cases need minimum 2 operations
 - 4 cases need minimum 3 operations
 - 1 cases need minimum 4 operations

In an overkill solution, in each table 2 x 2
   A) In all cases, you need total 32 operations of 16 cases
    > If your code return exact that value then it is operation minimized
   B) You need averagely 2 operations
    > Which is about 50% of 4 operations
   C) And the maximum number of operations needed is 4
    > Which is approximately 4nm/3 operations
    > Which mean for each 4 ones in the table
      We only need maximumly 3 operations average


====X====X====X====X====X====X====X====X====X====X====
----x----x----x----x----x----x----x----x----
At (0) = 0 ones | Need 0 operations :
00
00

====X====X====X====X====X====X====X====X====X====X====
----x----x----x----x----x----x----x----x----
At (14) = 3 ones | Need 1 operations :
01
11

00
00

----x----x----x----x----x----x----x----x----
At (13) = 3 ones | Need 1 operations :
10
11

00
00

----x----x----x----x----x----x----x----x----
At (11) = 3 ones | Need 1 operations :
11
01

00
00

----x----x----x----x----x----x----x----x----
At (7) = 3 ones | Need 1 operations :
11
10

00
00

====X====X====X====X====X====X====X====X====X====X====
----x----x----x----x----x----x----x----x----
At (12) = 2 ones | Need 2 operations :
00
11

11
01

00
00

----x----x----x----x----x----x----x----x----
At (10) = 2 ones | Need 2 operations :
01
01

10
11

00
00

----x----x----x----x----x----x----x----x----
At (9) = 2 ones | Need 2 operations :
10
01

01
11

00
00

----x----x----x----x----x----x----x----x----
At (6) = 2 ones | Need 2 operations :
01
10

11
01

00
00

----x----x----x----x----x----x----x----x----
At (5) = 2 ones | Need 2 operations :
10
10

01
11

00
00

----x----x----x----x----x----x----x----x----
At (3) = 2 ones | Need 2 operations :
11
00

01
11

00
00

====X====X====X====X====X====X====X====X====X====X====
----x----x----x----x----x----x----x----x----
At (8) = 1 ones | Need 3 operations :
00
01

10
10

01
11

00
00

----x----x----x----x----x----x----x----x----
At (4) = 1 ones | Need 3 operations :
00
10

11
00

01
11

00
00

----x----x----x----x----x----x----x----x----
At (2) = 1 ones | Need 3 operations :
01
00

10
10

01
11

00
00

----x----x----x----x----x----x----x----x----
At (1) = 1 ones | Need 3 operations :
10
00

01
10

11
01

00
00

====X====X====X====X====X====X====X====X====X====X====
----x----x----x----x----x----x----x----x----
At (15) = 4 ones | Need 4 operations :
11
11

00
01

10
10

01
11

00
00

And if the ones are connected, here is the analizer (I will optimize the algo later)

Small checker

/// The algorithm can be improved with polynomino-generating

#include <algorithm>
#include <iostream>
#include <numeric>
#include <vector>
#include <cstdio>
#include <deque>
#include <cmath>
#include <map>

using namespace std;

void file(const string FILE = "Test")
{
    freopen((FILE + ".INP").c_str(), "r", stdin);
    freopen((FILE + ".OUT").c_str(), "w", stdout);
}
#define all(x) (x).begin(), (x).end()
typedef long long ll;
typedef pair<int, int> pi;

template<typename T> void maximize(T &res, T val) { if (res < val) res = val; }
template<typename T> void minimize(T &res, T val) { if (res > val) res = val; }

const int mx[] = {+1, +0, -0, -1};
const int my[] = {+0, +1, -1, -0};

int n, m;
void flip(int &mask, int i, int j) { mask ^= 1 << (i * m + j); }
bool consecutive(int mask) /// All ones are connected
{
    vector<int> used(n, 0);
    used[0] = 1 << 0;
    int cnt = 0;

    deque<pi> S;
    S.push_back(pi(0, 0));
    while (S.size())
    {
        int x = S.front().first;
        int y = S.front().second;
        S.pop_front();

        for (int k = 0; k < 4; ++k)
        {
            int nx = x + mx[k];
            int ny = y + my[k];
            if (nx < 0 || nx >= n) continue;
            if (ny < 0 || ny >= m) continue;
            if (used[nx] >> ny & 1) continue;
            if ((mask >> (nx * m + ny)) & 1) 
            {
                ++cnt;
                used[nx] |= 1 << ny;
                S.push_back(pi(nx, ny));
            }
        }
    }

    int cntbit = __builtin_popcount(mask);
    return cnt == cntbit;
}

int lim;
vector<int> F;
vector<int> trace;
void bfs(int s)
{
    trace.assign(lim, 0);
    F.assign(lim, -1);

    trace[s] = -1;
    F[s] = 0;

    deque<int> S;
    S.push_back(s);
    while (S.size())
    {
        int u = S.front();
        S.pop_front();

        for (int i = 1; i < n; ++i) /// Select row
        {
            for (int j = 1; j < m; ++j) /// Select column
            {
                vector<int> selected_x = {i - 1, i - 0, i - 1, i - 0};
                vector<int> selected_y = {j - 1, j - 1, j - 0, j - 0};

                int t = u; /// Make new mask
                for (int k = 0; k < 4; ++k) /// Fully 2x2 modified
                    flip(t, selected_x[k], selected_y[k]);

                for (int k = 0; k < 4; ++k) /// Select 1x1 cell not be modified
                {
                    int v = t;
                    flip(v, selected_x[k], selected_y[k]);

                    if (F[v] == -1) /// If not visited
                    {
                        trace[v] = u;
                        F[v] = F[u] + 1;
                        S.push_back(v);
                    }
                }
            }
        }
    }
}

void analize()
{
    int maximum = *max_element(all(F));
    ll total = accumulate(all(F), 0LL);
    vector<int> C(maximum + 1, 0);
    for (int mask = 0; mask < lim; ++mask)
        ++C[F[mask]];
    
    cout << "In an overkill solution, in each table " << n << " x " << m << endl;
    /// If you want to output cases
    vector<vector<int> > M(maximum + 1);
    for (int index = 0; index <= maximum; ++index) M[index].resize(C[index]);
    for (int mask = 0; mask < lim; ++mask)
        M[F[mask]][--C[F[mask]]] = mask; /// this look messy :D

    int k = min(n, m);
    vector<int> maxi(k + 1, -1e9);
    vector<int> mini(k + 1, +1e9);
    for (int operation = 0; operation <= maximum; ++operation)
    {
        for (int mask : M[operation])
        {
            int cntbit = __builtin_popcount(mask);
            if (cntbit <= k) /// If (cntbit > k) then we lost the case 1x(cntbit) rectangle
            {
                if (consecutive(mask)) /// all ones are consecutive
                {
                    maximize(maxi[cntbit], operation);
                    minimize(mini[cntbit], operation);
                }
            }
        }
    }

    for (int i = 0; i <= k; ++i) 
    {
        cout << "- With " << i << " ones randomly on the table | ";
        cout << "You need " << "atleast " << mini[i] << " and atmost " << maxi[i] << " operations " << endl;
    }
}

int main()
{
    // file();
    cin >> n >> m;
    if (1LL * n * m > 25)
    {
        cout << "Well, I dont have enough data for that :(";
        cout << "- Please provide me smaller constraint";
        return 0;
    }

    lim = 1 << (n * m);
    bfs(0); /// fully zero binary table
    analize();

    return 0;
}
    }

    lim = 1 << (n * m);
    bfs(0); /// fully zero binary table
    analize();

    return 0;
}

Observation

Test with 9x9 tables

In an overkill solution, in each table 9 x 9
- With 0 ones consecutively random on the table | You need atleast 0 and atmost 0 operations 
- With 1 ones consecutively random on the table | You need atleast 3 and atmost 3 operations 
- With 2 ones consecutively random on the table | You need atleast 2 and atmost 2 operations 
- With 3 ones consecutively random on the table | You need atleast 1 and atmost 3 operations 
- With 4 ones consecutively random on the table | You need atleast 2 and atmost 4 operations 
- With 5 ones consecutively random on the table | You need atleast 3 and atmost 5 operations
- With 6 ones consecutively random on the table | You need atleast 2 and atmost 6 operations
- With 7 ones consecutively random on the table | You need atleast 3 and atmost 7 operations
- With 8 ones consecutively random on the table | You need atleast 4 and atmost 8 operations
- With 9 ones consecutively random on the table | You need atleast 3 and atmost 9 operations

Full text and comments »

SPyofgame
4 years ago
4

Share exact K candies to all children with a limitation of a child can get

By SPyofgame, history, 4 years ago, In English

Original Problem

M-candies-problem. In this version, we need to calculate the number of ways to share exact $$$K$$$ candies for all $$$N$$$ children that the $$$ith$$$-child doesnt have more than $$$a_i$$$ candies.

And the constraints are

$$$1 \leq N \leq 100$$$
$$$0 \leq K \leq 10^5$$$
$$$0 \leq a_i \leq K$$$

O(n * k^2) solution - Standard DP

Lets $$$DP[i][j] =$$$ number of ways to share first $$$[i]$$$ children with $$$[j]$$$ used candies

Base case $$$(DP[0][0] = 1)$$$ and $$$(DP[0][x] = 0\ \forall\ x > 0)$$$ and ($$$DP[p][x] = 0\ \forall\ x < 0$$$)
At state $$$[i][j]$$$, you can share to the $$$[i]$$$ child $$$(0 \leq x \leq a_i)$$$ candies with $$$(DP[i - 1][j - x])$$$ ways to share

So we have $$$DP[i][j] = \underset{x = 0..a_i}{Sigma}(DP[i - 1][j - x])$$$

And the answer is $$$DP[n][k]$$$



#include <iostream>
#include <vector>

using namespace std;

const int MOD = 1e9 + 7;
void quickadd(int &res, int val) { if ((res += val) >= MOD) res -= MOD; }
int main()
{
    int n, k;
    cin >> n >> k;
    
    vector<int> a(n + 1);
    for (int i = 1; i <= n; ++i)
        cin >> a[i];

    vector<vector<int>> dp(n + 1, vector<int>(k + 1, 0));
    dp[0][0] = 1;
    for (int i = 1; i <= n; ++i)
        for (int j = 0; j <= k; ++j)
            for (int t = max(0, j - a[i]); t <= j; ++t)
                quickadd(dp[i][j], dp[i - 1][t]);

    cout << dp[n][k];
    return 0;
}

O(n * k) solution - Prefixsum DP

Lets $$$F[i][j] = \underset{x = 0..a_i}{Sigma}(DP[i - 1][j - x])$$$$

From the above base case, we also have $$$(F[0][0] = 1)$$$ and $$$(F[x][0] = 1)$$$ and $$$(F[0][x] = 0)$$$ $$$\ \forall\ x \in \mathbb{N}^*$$$

From the above formula, we also have $$$F[i][j] = F[i][j - 1] + F[i - 1][j] - F[i - 1][j - a_i - 1]$$$

From the above answer, we also have $$$F[n][k]$$$

#include <iostream>
#include <vector>

using namespace std;

const int MOD = 1e9 + 7;
void quickadd(int &res, int val) { if ((res += val) >= MOD) res -= MOD; }
void quicksub(int &res, int val) { if ((res -= val)  <  0 ) res += MOD; }
int main()
{
    int n, k;
    cin >> n >> k;
    
    vector<int> a(n + 1);
    for (int i = 1; i <= n; ++i)
        cin >> a[i];

    vector<vector<int>> dp(n + 1, vector<int>(k + 1, 0));
    dp[0][0] = 1;
    for (int i = 1; i <= n; ++i)
    {
        dp[i][0] = 1;
        for (int j = 1; j <= k; ++j)
        {
            dp[i][j] = dp[i][j - 1];
            quickadd(dp[i][j], dp[i - 1][j]);
            if (j > a[i]) quicksub(dp[i][j], dp[i - 1][j - a[i] - 1]);
        }
    }

    cout << dp[n][k];
    return 0;
}

O(n * k) solution - Online algo and space optimization

To compress to 1D array, notice that the current array is build from the previous array, we already having the path $$$F[i][x] = F[i - 1][x]\ \forall\ 0 \leq x \leq k$$$

First we subtract the $$$F[j - a_i - 1]$$$ path $$$\ \forall\ a_i < j \leq k$$$

Then we keep the prefixsum with $$$F[j] += F[j - 1]$$$

#include <iostream>
#include <vector>

using namespace std;

const int MOD = 1e9 + 7;
void quickadd(int &res, int val) { if ((res += val) >= MOD) res -= MOD; }
void quicksub(int &res, int val) { if ((res -= val)  <  0 ) res += MOD; }
int main()
{
    int n, k;
    cin >> n >> k;
    
    vector<int> f(k + 1, 0);
    f[0] = 1;
    for (int i = 0; i < n; ++i)
    {
        int x;
        cin >> x;
        for (int j = k; j >= x + 1; --j) quicksub(f[j], f[j - 1 - x]);
        for (int j = 1; j <= k    ; ++j) quickadd(f[j], f[j - 1]);
    }
    cout << f[k];
    return 0;
}

Extended Version

But what if the constraints were higher, I mean for such $$$M, a_i \leq 10^{18}$$$ limitation ?

O(1) solution for N = 1

/// If (x < k) then there is no way to share candies
/// Else there are exact (k - x + 1) ways to share
int solve1(ll x, ll k)
{
    return max(0LL, k - x + 1) % MOD;
}

O(1) solution for N = 2

/// max(A.get) = x
/// max(B.get) = y
/// max(A.get + B.get) = k
/// * Take max(x) = min(x, k)
/// * Take min(x) = max(0, k - y) = k - min(y, k)
int solve2(ll x, ll y, ll k)
{
    return solve1(k - min(y, k), min(x, k));
}

O(1) solution for N = 3

/// Sigma(i = 1..n) (1)
int f1(ll n)
{
    return n % MOD;
}

/// Sigma(i = 1..n) f1(i)
int f2(ll n)
{
    int t = abs(n) % 2;
    if (t == 0) return 1LL * f1(n + 1) * f1((n + 0) / 2) % MOD;
    if (t == 1) return 1LL * f1(n + 0) * f1((n + 1) / 2) % MOD;
}

/// Sigma(i = 1..n) f2(i)
int f3(ll n)
{
    int t = abs(n) % 6;
    if (t == 0) return 1LL * f1((n + 0) / 6) * f1((n + 1) / 1) % MOD * f1((n + 2) / 1) % MOD;
    if (t == 5) return 1LL * f1((n + 0) / 1) * f1((n + 1) / 6) % MOD * f1((n + 2) / 1) % MOD;
    if (t == 4) return 1LL * f1((n + 0) / 1) * f1((n + 1) / 1) % MOD * f1((n + 2) / 6) % MOD;
    if (t == 3) return 1LL * f1((n + 0) / 3) * f1((n + 1) / 2) % MOD * f1((n + 2) / 1) % MOD;
    if (t == 2) return 1LL * f1((n + 0) / 1) * f1((n + 1) / 3) % MOD * f1((n + 2) / 2) % MOD;
    if (t == 1) return 1LL * f1((n + 0) / 1) * f1((n + 1) / 2) % MOD * f1((n + 2) / 3) % MOD;
}

int f1(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f1(r) - f1(l - 1));     } /// sigma(i=l..r)   i
int f2(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f2(r) - f2(l - 1));     } /// sigma(i=l..r) f1(i)
int f3(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f3(r) - f3(l - 1));     } /// sigma(i=l..r) f2(i)

/// * sigma(i=l..r) min(i, y)
/// = sigma(i=l..y-1) (i)    +    sigma(i=t..r) (y)    | t = max(l, y)
/// =      f2(l, y-1)        +        f1(t, r) * y
int g2(ll l, ll r, ll y) 
{
    minimize(y, r);     ll t = max(l, y);
    return (f2(l, y - 1) + y * f1(t, r)) % MOD;
}

/// Return the value in [0..MOD)
int fix(ll x) { x %= MOD; if (x < 0) x += MOD; return x; }

/// * max(a, b) = a + b - min(a, b)
/// * L = Take min(x) = k - min(k, x)
/// * R = Take max(x) = min(k, y + z)
/// -------------------------------------------------------
/// * Sigma(x = L..R) {  solve2(y, z, x)                  }
/// = Sigma(x = L..R) {  f1(max(0, x - y), min(z, x))     }
/// = Sigma(x = L..R) {  f1(x - min(x, y), min(z, x))     }
/// = Sigma(x = L..R) {  (1 - x + min(x, y) + min(x, z))  }
/// -------------------------------------------------------
/// > f1(L, R) = Sigma(x = L..R) (1)
/// > f2(L, R) = Sigma(x = L..R) (x)
/// > g2(L, R, y) = Sigma(x = L..R) min(x, y)
/// > g2(L, R, z) = Sigma(x = L..R) min(x, z)
int solve3(ll x, ll y, ll z, ll k)
{
    ll L = k - min(k, x);
    ll R = min(k, y + z);
    if (L > R) return 0;
    return fix(f1(L, R) - f2(L, R) + g2(L, R, y) + g2(L, R, z));
}

O(1) solution for N = 4


void quickadd(int &res, int val) { if ((res += val) >= MOD) res -= MOD; }
void quicksub(int &res, int val) { if ((res -= val) <   0 ) res += MOD; }

/// Return the value in [0..MOD)
int fix(ll x) { x %= MOD; if (x < 0) x += MOD; return x; }

/// f1(n)   = sigma(i=1..n)  (1)  = n
/// f2(n)   = sigma(i=1..n)  (i)  = n * (n + 1) / 2
/// f3(n)   = sigma(i=1..n) f2(i) = n * (n + 1) * (n + 2) / 6
/// sqf1(n) = sigma(i=1..n) i^2 = n * (n + 1) * (2n + 1) / 6 = f2(n) * (2n + 1) / 3
int f1(ll n)   { return n % MOD; }
int f2(ll n)   { return 1LL * f1(n) * f1(n + 1) % MOD * inverse_2 % MOD; }
int f3(ll n)   { return 1LL * f1(n) * f1(n + 1) % MOD * f1(n + 2 * 1) % MOD * inverse_6 % MOD; }
int sqf1(ll n) { return 1LL * f1(n) * f1(n + 1) % MOD * f1(n * 2 + 1) % MOD * inverse_6 % MOD; }

/// f1(l, r)   = sigma(i=l..r)  (1)
/// f2(l, r)   = sigma(i=l..r)  (i)
/// f3(l, r)   = sigma(i=l..r) f2(i)
/// sqf1(l, r) = sigma(i=l..r)  i^2
int f1(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f1(r) - f1(l - 1));     }
int f2(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f2(r) - f2(l - 1));     } 
int f3(ll l, ll r)   { return (l < 0 || l > r) ? 0 : fix(f3(r) - f3(l - 1));     }
int sqf1(ll l, ll r) { return (l < 0 || l > r) ? 0 : fix(sqf1(r) - sqf1(l - 1)); }

/// sigma(i=l..r) min(i, y)
int g2(ll l, ll r, ll y) 
{
    minimize(y, r);     ll t = max(l, y);
    return (f2(l, y - 1) + y * f1(t, r)) % MOD;
}
/// sigma(i=l..r) i * min(i, y)
int g3(ll l, ll r, ll y)
{
    minimize(y, r);     ll t = max(l, y);
    return (sqf1(l, t - 1) + y * f2(t, r)) % MOD;
}

///   sigma(i=l..r) min(k - i, y)
/// = sigma(i=k-r..k-l) min(i, y) 
int g2(ll l, ll r, ll y, ll k) { return g2(k - r, k - l, y); }
///   sigma(i=l..r) i * min(k - i, y)
/// = sigma(i=k-r..k-l) * min(y, i) * k     +     sigma(i=k-r..k-l) min(y, i) * k
int g3(ll l, ll r, ll y, ll k) { return g2(k - r, k - l, y) * k - g3(k - r, k - l, y); }

/// If (x < k) then there is no way to share candies
/// Else there are exact (k - x + 1) ways to share
int solve1(ll x, ll k)
{
    return max(0LL, k - x + 1) % MOD;
}

/// max(A.get) = x
/// max(B.get) = y
/// max(A.get + B.get) = k
/// * Take max(x) = min(x, k)
/// * Take min(x) = max(0, k - y) = k - min(y, k)
int solve2(ll x, ll y, ll k)
{
    return solve1(k - min(y, k), min(x, k));
}

/// * max(a, b) = a + b - min(a, b)
/// * L = Take min(x) = k - min(k, x)
/// * R = Take max(x) = min(k, y + z)
/// -------------------------------------------------------
/// * Sigma(x = L..R) {  solve2(y, z, x)                  }
/// = Sigma(x = L..R) {  f1(max(0, x - y), min(z, x))     }
/// = Sigma(x = L..R) {  f1(x - min(x, y), min(z, x))     }
/// = Sigma(x = L..R) {  (1 - x + min(x, y) + min(x, z))  }
/// = f1(L, R) - f2(L, R) + g2(L,R, y) + g2(L, R, z)
/// -------------------------------------------------------
/// > f1(L, R)    = Sigma(x = L..R) (1)
/// > f2(L, R)    = Sigma(x = L..R) (x)
/// > g2(L, R, y) = Sigma(x = L..R) min(x, y)
/// > g2(L, R, z) = Sigma(x = L..R) min(x, z)
int solve3(ll x, ll y, ll z, ll k)
{
    ll L = k - min(k, x);
    ll R = min(k, y + z);
    if (L > R) return 0;
    return fix(f1(L, R) - f2(L, R) + g2(L, R, y) + g2(L, R, z));
}

/// * max(x, y, z, t, z + t) <= R
/// * L = Take min(x + y) = k - min(k, z + t)
/// * R = Take max(x + y) = min(k, x + y)
/// -------------------------------------------------------
/// * Sigma(s = L..R) {  solve2(x, y, s) * solve2(z, t, k-s)  }
///
/// = Sigma(s = L..R) {  min(x, s) - s + min(y, s) + 1  }
///                 * {  min(t, k-s) - k + s + min(z, k-s) + 1  } 
///
/// = Sigma(s = L..R) {  min(x, s) * min(t, k-s) - s * min(t, k-s) + min(y, s) * min(t, k-s) + min(t, k-s) }
///                 + {  min(x, s) * min(z, k-s) - s * min(z, k-s) + min(y, s) * min(z, k-s) + min(z, k-s) }
///                 - {  min(x, s) * k - s * k + min(y, s) * k + k }
///                 + {  min(x, s) * s - s * s + min(y, s) * s + s }
///                 + {  min(x, s) - s + min(y, s) + 1 }
///
/// = Sigma(s = L..R) A(k) + B(x) + B(y) + C(k) + D(z, x) + D(z, y) + D(t, x) + D(t, y)
///
///    With X = max(x, L)
///    A(k)> Sigma(s = L..R) {  1 - k + s * k-s * s  }  
///        = Sigma(s = L..R) (1 - k)    +     Sigma(s = L..R) (s * k)    -    Sigma(s = L..R) (s * s)
///        =    f1(L, R) * (1 - k)      +           f2(L, R) * k         -          sqf1(L, R)  
///   
///    B(x)> Sigma(s = L..R) {  min(x, s) * (s - k + 1)  }
///        = (1 - k) * Sigma(s = L..X-1) min(x, s)    +    Sigma(s = L..X-1) min(x, s) * s
///        = (1 - k) * (f2(L, X-1) + f1(X, R) * x)    +     sqf1(L, X - 1) + f2(X, R) * x  
///
///    C(k)> Sigma(s = L..R) { min(z, k-s) * (1 - s) }
///        = Sigma(s = L..R) min(z, k-s)    -    Sigma(s = L..R) min(z, k-s) * s
///        =         g2(L, R, z, k)         -            g3(L, R, z, t)
///
///    D(z, x)> Sigma(s = L..R) { min(z, k-s) * min(x, s) }
///        = Sigma(s = L..X-1) min(z, k-s) * s    +    Sigma(s = X..R) min(z, k-s) * x
///        =         g3(L, X - 1, z, k)           +            g2(X, R, z, k) * x
/// ---------------------------------------------------------------------------------------------------------------------------------
int solve4(ll x, ll y, ll z, ll t, ll k)
{
    ll L = k - min(k, z + t);
    ll R = min(k, x + y);
    if (L > R) return 0;
    minimize(x, R);     
    minimize(y, R);

    ll X = max(L, x), Y = max(L, y);
    int res = 0;
    quickadd(res, fix(0LL + f1(L, R) * (1 - k) + f2(L, R) * k - sqf1(L, R)));                          /// A(k)
    quickadd(res, fix(0LL + (1 - k) * (f2(L, X - 1) + f1(X, R) * x) + sqf1(L, X - 1) + f2(X, R) * x)); /// B(x)
    quickadd(res, fix(0LL + (1 - k) * (f2(L, Y - 1) + f1(Y, R) * y) + sqf1(L, Y - 1) + f2(Y, R) * y)); /// B(y)
    quickadd(res, fix(0LL + g2(L, R, z, k) - g3(L, R, z, k)));         /// C(z)
    quickadd(res, fix(0LL + g2(L, R, t, k) - g3(L, R, t, k)));         /// C(z)
    quickadd(res, fix(0LL + g3(L, X - 1, z, k) + g2(X, R, z, k) * x)); /// D(x, z)
    quickadd(res, fix(0LL + g3(L, Y - 1, z, k) + g2(Y, R, z, k) * y)); /// D(y, z)
    quickadd(res, fix(0LL + g3(L, X - 1, t, k) + g2(X, R, t, k) * x)); /// D(x, t)
    quickadd(res, fix(0LL + g3(L, Y - 1, t, k) + g2(Y, R, t, k) * y)); /// D(y, t)
    return res;
}

Those fully-combinatorics codes above suck and hard to gets a simplified formula. Though I think this problem can be solved for general $$$a_i$$$ and $$$k$$$ in $$$O(n)$$$ or $$$O(n\ polylog\ n)$$$ with combinatorics or/and inclusion-exclusion, but I failed to find such formula.

Can someone give me a hint ?

Full text and comments »

combinatorics, math

SPyofgame
4 years ago
3

Antimacro

By SPyofgame, history, 4 years ago, In English

Does anyone know any simple website or tool or vscode-extension that replace all macros directly into the source code ? I tried to search on google/vscode-extension something like antimacro ..., remove macro ..., replace macro ... but find no result. Thanks for helping ^^

Full text and comments »

SPyofgame
4 years ago
7

Losing Right Navigation Html

By SPyofgame, history, 4 years ago, In English

Can someone help me. I didnt download anything or any extensions these days but suddenly my both Google and Mozilla Firefox browsers give me codeforces without right navigation tabs :(

I really need the navigations for reading recent news on codeforces and some are helpful <3 Thanks

Full text and comments »

SPyofgame
4 years ago
4

Counting such number whose digit product multiple to itself smaller than given number

By SPyofgame, history, 4 years ago, In English

The problem

Lets $$$f(x) = $$$ product of digits of $$$x$$$. Example: $$$f(123) = 1 * 2 * 3 = 6$$$, $$$f(990) = 9 * 9 * 0 = 0$$$, $$$f(1) = 1$$$

The statement is, given such limitation $$$N$$$, count the number of positive $$$x$$$ that $$$1 \leq x * f(x) \leq N$$$

Example: For $$$N = 20$$$, there are 5 valid numbers $$$1, 2, 3, 4, 11$$$

The limitation

Subtask 1: $$$N \leq 10^6$$$
Subtask 2: $$$N \leq 10^{10}$$$
Subtask 3: $$$N \leq 10^{14}$$$
Subtask 4: $$$N \leq 10^{18}$$$

My approach for subtask 1

If $$$(x > N)$$$ or $$$(f(x) > N)$$$ then $$$(x * f(x) > N)$$$. So we will only care about $$$x \leq N$$$ that $$$x * f(x) \leq N$$$

Calculating x * f(x) - O(log10(x))

ll cal(ll x, ll lim)
{
    ll res = x;
    do {
        int t = x % 10;
        if (res > lim / t) return -1; /// (x * f(x) > N)
        res *= t;
    } while (x /= 10);
    return res;
}

Counting - O(n log10(n))

ll brute(ll n)
{
    ll res = 0;
    for (ll x = 1; x <= n; ++x)
        res += cal(x, n) > 0; /// 1 <= x * f(x) <= N

    return res;
}

My approach for subtask 2

If $$$x$$$ contains $$$0$$$ then $$$f(x) = 0 \Rightarrow x \times f(x) < 1$$$. We only care about such $$$x$$$ without $$$0$$$ digit

Building function - O(result + log10(n))

/// Run 2e13 under 1 secs in (-O2) flag
/// X: current X
/// Y: f(X)
/// T: X * f(X)

ll N;
int res = 0;
vector<int> d;
void build(ll X = 0, ll Y = 1, ll T = 0)
{
    if (T >= 1) ++res; /// 1 <= x * f(x) <= N
    for (int v = 1; v <= 9; ++v) /// if (v = 0) then f(x) = 0
    {
        ll NX = X * 10 + v; /// Insert rightmost digits
        ll NY = Y * v;      /// Calculate digits production
        ll NT = NX * NY;
        if (NT > N) break;  /// x * f(x) > N
        build(NX, NY, NT);
    }
}

Here is the solution:

Let takes some $$$x$$$ satisfy $$$1 \leq x * f(x) \leq N$$$

We can easily prove that $$$f(x) \leq x$$$, and because $$$x * f(x) \leq N$$$, we have $$$f(x) \leq \sqrt{N}$$$ (notice that $$$x$$$ might bigger than $$$\sqrt{N}$$$)

Since $$$f(x)$$$ is product of digits of $$$x$$$, which can be obtain by such digits {$$$1, 2, \dots, 9$$$}. So $$$f(x) = 2^a \times 3^b \times 5^c \times 7^d$$$

So we can bruteforces all possible tuple of such $$$(a, b, c, d)$$$ satisfy ($$$P = 2^a \times 3^b \times 5^c \times 7^d \leq \sqrt{N}$$$). There are small amount of such tuples (493 tuples for $$$N = 10^9$$$ and 5914 tuples for $$$N = 10^{18}$$$)

Find all possible tuples - O(quartic_root(N))

/// [L, R] = [1, N]
/// lim = sqrt(N)
ll solve(int p2 = 0, int p3 = 0, int p5 = 0, int p7 = 0, ll P = 1)
{
    if (P > lim) return 0; /// Dont care such tuples whose P > sqrt(N)

    VL = (L + val - 1) / val; ///  ceil(L / P)
    VR = R / val;             /// floor(R / P)
    ll res = magic(0, 18, p2, p3, p5, p7); /// Calculating subproblem

    /// By doing these if-condition, it is guarantee that all tuples generated are all unique
    if (!p3 && !p5 && !p7) res += solve(p2 + 1, p3, p5, p7, P * 2); /// Continue increasing a
    if (       !p5 && !p7) res += solve(p2, p3 + 1, p5, p7, P * 3); /// Continue increasing b
    if (              !p7) res += solve(p2, p3, p5 + 1, p7, P * 5); /// Continue increasing c
                           res += solve(p2, p3, p5, p7 + 1, P * 7); /// Continue increasing d

    return res;
}

For each tuples, we need to counting the numbers of such $$$x$$$ that $$$1 \leq x \times f(x) \leq N$$$ and $$$f(x) = P$$$.

We have the value $$$P$$$, so $$$\lceil \frac{1}{P} \rceil \leq x \leq \lfloor \frac{N}{P} \rfloor$$$.
We have the value $$$f(x) = P$$$, so $$$x$$$ can be made by digits having the product exactly $$$P$$$, so we can do some DP-digit

So now we have to solve this DP-digit problem: Calculate numbers of such $$$x$$$ ($$$L \leq x \leq R$$$) whose $$$f(x) = P$$$

Solving Subproblem

We try to build each digits by digits for $$$X$$$. Because $$$X \leq N$$$, so we have to build about $$$18$$$ digits.

Lets make a recursive function $$$magic(X, N, p2, p3, p5, p7)$$$

Lets make some definition

Notice that

Ugly precalculation code - O(1)

const int l2 = 60, l3 = 37, l5 = 25, l7 = 21, l10 = 19;
ll        pw2[l2], pw3[l3], pw5[l5], pw7[l7], pw10[l10];
int       cost[10][10];
void precal()
{
    pw2[0] = pw3[0] = pw5[0] = pw7[0] = pw10[0] = 1; 
    for (int i2  = 1; i2  < l2 ; ++i2 ) pw2 [i2]  =  2 * pw2 [i2  - 1];
    for (int i3  = 1; i3  < l3 ; ++i3 ) pw3 [i3]  =  3 * pw3 [i3  - 1];
    for (int i5  = 1; i5  < l5 ; ++i5 ) pw5 [i5]  =  5 * pw5 [i5  - 1];
    for (int i7  = 1; i7  < l7 ; ++i7 ) pw7 [i7]  =  7 * pw7 [i7  - 1];
    for (int i10 = 1; i10 < l10; ++i10) pw10[i10] = 10 * pw10[i10 - 1];
                                                                                /// 2^a * 3^b * 5^c * 7^d = val
    cost[1][2] = 0;    cost[1][3] = 0;    cost[1][5] = 0;    cost[1][7] = 0;    ///  0     0     0     0  =  1
    cost[2][2] = 1;    cost[2][3] = 0;    cost[2][5] = 0;    cost[2][7] = 0;    ///  1     0     0     0  =  2
    cost[3][2] = 0;    cost[3][3] = 1;    cost[3][5] = 0;    cost[3][7] = 0;    ///  0     1     0     0  =  3
    cost[4][2] = 2;    cost[4][3] = 0;    cost[4][5] = 0;    cost[4][7] = 0;    ///  2     0     0     0  =  4
    cost[5][2] = 0;    cost[5][3] = 0;    cost[5][5] = 1;    cost[5][7] = 0;    ///  0     0     1     0  =  5
    cost[6][2] = 1;    cost[6][3] = 1;    cost[6][5] = 0;    cost[6][7] = 0;    ///  1     1     0     0  =  6
    cost[7][2] = 0;    cost[7][3] = 0;    cost[7][5] = 0;    cost[7][7] = 1;    ///  0     0     0     1  =  7
    cost[8][2] = 3;    cost[8][3] = 0;    cost[8][5] = 0;    cost[8][7] = 0;    ///  3     0     0     0  =  8
    cost[9][2] = 0;    cost[9][3] = 2;    cost[9][5] = 0;    cost[9][7] = 0;    ///  0     2     0     0  =  9
}

magic function - O(18*p2*p3*p5*p7)

ll VL, VR;
ll f[l10][l2][l3][l5][l7];
ll magic(ll X, int N, int p2, int p3, int p5, int p7)
{
    if (p2 < 0 || p3 < 0 || p5 < 0 || p7 < 0) return 0;
    if (N == 0) return (p2 + p3 + p5 + p7 == 0) && (VL <= X && X <= VR);

    ll mn = X * pw10[N];
    ll mx = mn + pw10[N] - 1;
    if (mx < VL || mn > VR) return 0;

    ll &save = f[N][p2][p3][p5][p7];
    bool memo = (VL <= mn) && (mx <= VR);
    if (memo) if (save != -1) return save;
    
    ll res = 0;
    if (X == 0) res = magic(0, N - 1, p2, p3, p5, p7);
    for (int v = 1; v <= 9; ++v)
    {
        int c2 = cost[v][2];
        int c3 = cost[v][3];
        int c5 = cost[v][5];
        int c7 = cost[v][7];
        res += magic(X * 10 + v, N - 1, p2 - c2, p3 - c3, p5 - c5, p7 - c7);
    }
    if (memo) save = res;
    
    return res;
}

About the proving stuff

1) $$$\forall x \in \mathbb{N}, f(x) \leq x$$$

Proof: $$$x = \overline{\dots dcba} = \dots + d \times 10^3 + c \times 10^2 + b \times 10^1 + a \times 10^0 \geq \dots \times d \times c \times b \times a = f(x)$$$

2) If $$$x$$$ satisfy then $$$f(x) \leq \sqrt{N}$$$ must be satisfied

Proof: $$$x \times f(x) \leq N \Rightarrow f(x) \times f(x) \leq N \Rightarrow f(x) \leq \sqrt{N}$$$

3) $$$\exists\ a, b, c, d \in \mathbb{N} \rightarrow f(x) = 2^a \times 3^b \times 5^c \times 7^d$$$

Since $$$x = \overline{\dots dcba} \Rightarrow (0 \leq \dots, d, c, b, a \leq 9)$$$ and $$$f(x) = \dots \times d \times c \times b \times a$$$
And we also have $$$\forall$$$ digit $$$v$$$ ($$$v \in \mathbb{N}, 0 \leq v \leq 9$$$) $$$\rightarrow \exists\ a, b, c, d \in \mathbb{N} \rightarrow v = 2^a \times 3^b \times 5^c \times 7^d$$$
And because $$$f(x)$$$ is the product of digits of $$$x$$$, hence the statement is correct

4) If we know $$$f(x)$$$ we can find such $$$x$$$ satisfy $$$x \in [L, R]$$$

Proof: Since $$$f(x)$$$ is created from factors of digits of $$$x$$$, so $$$x$$$ can also be generated using the factors

5) Number of tuples $$$(a, b, c, d)$$$ satisfy $$$P = 2^a \times 3^b \times 5^c \times 7^d \leq \sqrt{N}$$$ is very small

Lets $$$O(k(x)) = O(log_2(x) \times log_3(x) \times log_5(x) \times log_7(x))$$$
Since each value $$$x$$$ have the upper bound of $$$log_x(\sqrt{N})$$$. So the complexity is about $$$O(log_2(\sqrt{N}) \times log_3(\sqrt{N}) \times log_5(\sqrt{N}) \times log_7(\sqrt{N})) = O(k(\sqrt{N})) \leq O(log_2(\sqrt{N})^4)$$$
But actually for $$$R \leq 10^{18}$$$, the complexity is just about $$$O(k(\sqrt[4]{N}))$$$

Weak proof - N = 10^k

With (N = 1) -> Found 1 valid tuples || upper_bound 1
With (N = 10) -> Found 3 valid tuples || upper_bound 3
With (N = 100) -> Found 10 valid tuples || upper_bound 10
With (N = 1000) -> Found 22 valid tuples || upper_bound 24
With (N = 10000) -> Found 46 valid tuples || upper_bound 50
With (N = 100000) -> Found 83 valid tuples || upper_bound 95
With (N = 1000000) -> Found 141 valid tuples || upper_bound 166
With (N = 10000000) -> Found 225 valid tuples || upper_bound 269
With (N = 100000000) -> Found 338 valid tuples || upper_bound 414
With (N = 1000000000) -> Found 493 valid tuples || upper_bound 612
With (N = 10000000000) -> Found 694 valid tuples || upper_bound 874
With (N = 100000000000) -> Found 951 valid tuples || upper_bound 1212
With (N = 1000000000000) -> Found 1273 valid tuples || upper_bound 1640
With (N = 10000000000000) -> Found 1670 valid tuples || upper_bound 2172
With (N = 100000000000000) -> Found 2155 valid tuples || upper_bound 2825
With (N = 1000000000000000) -> Found 2736 valid tuples || upper_bound 3614
With (N = 10000000000000000) -> Found 3427 valid tuples || upper_bound 4558
With (N = 100000000000000000) -> Found 4246 valid tuples || upper_bound 5676
Error: 5701

Weak proof - N = 2^k

With (N = 1) -> Found 1 valid tuples || upper_bound 1
With (N = 2) -> Found 1 valid tuples || upper_bound 1
With (N = 4) -> Found 2 valid tuples || upper_bound 2
With (N = 8) -> Found 2 valid tuples || upper_bound 3
With (N = 16) -> Found 4 valid tuples || upper_bound 4
With (N = 32) -> Found 5 valid tuples || upper_bound 6
With (N = 64) -> Found 8 valid tuples || upper_bound 8
With (N = 128) -> Found 10 valid tuples || upper_bound 11
With (N = 256) -> Found 14 valid tuples || upper_bound 14
With (N = 512) -> Found 17 valid tuples || upper_bound 19
With (N = 1024) -> Found 23 valid tuples || upper_bound 24
With (N = 2048) -> Found 28 valid tuples || upper_bound 30
With (N = 4096) -> Found 36 valid tuples || upper_bound 38
With (N = 8192) -> Found 43 valid tuples || upper_bound 47
With (N = 16384) -> Found 53 valid tuples || upper_bound 58
With (N = 32768) -> Found 63 valid tuples || upper_bound 71
With (N = 65536) -> Found 77 valid tuples || upper_bound 85
With (N = 131072) -> Found 89 valid tuples || upper_bound 102
With (N = 262144) -> Found 106 valid tuples || upper_bound 121
With (N = 524288) -> Found 122 valid tuples || upper_bound 143
With (N = 1048576) -> Found 143 valid tuples || upper_bound 167
With (N = 2097152) -> Found 164 valid tuples || upper_bound 195
With (N = 4194304) -> Found 190 valid tuples || upper_bound 225
With (N = 8388608) -> Found 216 valid tuples || upper_bound 260
With (N = 16777216) -> Found 248 valid tuples || upper_bound 298
With (N = 33554432) -> Found 279 valid tuples || upper_bound 339
With (N = 67108864) -> Found 317 valid tuples || upper_bound 386
With (N = 134217728) -> Found 355 valid tuples || upper_bound 437
With (N = 268435456) -> Found 400 valid tuples || upper_bound 492
With (N = 536870912) -> Found 446 valid tuples || upper_bound 553
With (N = 1073741824) -> Found 498 valid tuples || upper_bound 620
With (N = 2147483648) -> Found 553 valid tuples || upper_bound 692
With (N = 4294967296) -> Found 614 valid tuples || upper_bound 770
With (N = 8589934592) -> Found 679 valid tuples || upper_bound 855
With (N = 17179869184) -> Found 749 valid tuples || upper_bound 946
With (N = 34359738368) -> Found 825 valid tuples || upper_bound 1045
With (N = 68719476736) -> Found 905 valid tuples || upper_bound 1152
With (N = 137438953472) -> Found 991 valid tuples || upper_bound 1266
With (N = 274877906944) -> Found 1083 valid tuples || upper_bound 1388
With (N = 549755813888) -> Found 1181 valid tuples || upper_bound 1520
With (N = 1099511627776) -> Found 1286 valid tuples || upper_bound 1660
With (N = 2199023255552) -> Found 1398 valid tuples || upper_bound 1810
With (N = 4398046511104) -> Found 1517 valid tuples || upper_bound 1970
With (N = 8796093022208) -> Found 1646 valid tuples || upper_bound 2140
With (N = 17592186044416) -> Found 1780 valid tuples || upper_bound 2321
With (N = 35184372088832) -> Found 1924 valid tuples || upper_bound 2513
With (N = 70368744177664) -> Found 2074 valid tuples || upper_bound 2717
With (N = 140737488355328) -> Found 2235 valid tuples || upper_bound 2933
With (N = 281474976710656) -> Found 2402 valid tuples || upper_bound 3161
With (N = 562949953421312) -> Found 2581 valid tuples || upper_bound 3403
With (N = 1125899906842624) -> Found 2767 valid tuples || upper_bound 3658
With (N = 2251799813685248) -> Found 2966 valid tuples || upper_bound 3928
With (N = 4503599627370496) -> Found 3174 valid tuples || upper_bound 4212
With (N = 9007199254740992) -> Found 3393 valid tuples || upper_bound 4511
With (N = 18014398509481984) -> Found 3625 valid tuples || upper_bound 4826
With (N = 36028797018963968) -> Found 3866 valid tuples || upper_bound 5157
With (N = 72057594037927936) -> Found 4121 valid tuples || upper_bound 5505
With (N = 144115188075855872) -> Found 4386 valid tuples || upper_bound 5870
With (N = 288230376151711744) -> Found 4665 valid tuples || upper_bound 6253
With (N = 576460752303423488) -> Found 4955 valid tuples || upper_bound 6655
With (N = 1152921504606846976) -> Found 5260 valid tuples || upper_bound 7076
Error: 23112

Code

#include <iostream>
#include <cmath>
#include <cstdio>

using namespace std;

typedef long long ll;

ll L;
ll R;
int lim;
ll solve(int p2 = 0, int p3 = 0, int p5 = 0, int p7 = 0, ll val = 1)
{
    if (val > lim) return 0;

    ll res = 1;
    if (!p3 && !p5 && !p7) res += solve(p2 + 1, p3, p5, p7, val * 2);
    if (       !p5 && !p7) res += solve(p2, p3 + 1, p5, p7, val * 3);
    if (              !p7) res += solve(p2, p3, p5 + 1, p7, val * 5);
                           res += solve(p2, p3, p5, p7 + 1, val * 7);

    return res;
}

const double EPS = 0.0188704;
int main()
{
    freopen("Test.OUT", "w", stdout);

    double error = 0;
    ll mul = 2;
    ll over = (1LL << 62) / mul;
    for (ll t = 1; t < over; t *= mul)
    {
        L = 1;
        R = t;
        lim = sqrt(R);
        int res = solve();
        double base = log(sqrt(sqrt(R))) + 1;
        int complexity = ceil(EPS + (base / log(2)) * (base / log(3)) * (base / log(5)) * (base / log(7)));

        cout << "With (N = " << R << ") -> Found " << res << " valid tuples || upper_bound " << complexity << endl;
        int delta = complexity - res;
        if (delta < 0)
        {
            cout << "Program failed at (t = " << t << ") -> ";
            cout << "Returning " << res << " compared to " << complexity << endl;
            exit(0);
        }
        else error += delta;
    }
    cout << "Error: " << error;

    return 0;
}

About the complexity:

$$$O(h(x)) = O(log_{10}(N))$$$ is number of digits we have to build
$$$O(k(x)) = O(log_2(N) \times log_3(N) \times log_5(N) \times log_7(N)) = O(log(N)^4)$$$ is product of all prime digits $$$p$$$ with its maximum power $$$k$$$ satisfy $$$p^k \leq N$$$
$$$O(g(x)) = O(k(\sqrt{N}))$$$ is number of such valid tuples, but for $$$1 \leq N \leq 10^{18}$$$ it is about $$$\approx O(k(\sqrt[4]{N})) \leq O(log_2^4{\sqrt[4]{N}})$$$
The space complexity is $$$O(SPACE) = O(h(x) \times k(x)) = O(log_2(N) \times log_3(N) \times log_5(N) \times log_7(N) \times log_{10}(N)) = O(log(N)^5)$$$
The time complexity is $$$O(TIME) = O(SPACE) + O(g(x) \times k(x)) \approx O(log(N)^4 \times log_2^4{\sqrt[4]{N}})$$$

Other

Here is the Vietnamese version of the problem where we need to count such valid numbers in range $$$[A..B]$$$ where $$$1 \leq A \leq B \leq 10^{18}$$$. (Vietnamese Editorial)

About the real complexity for $$$O(k(x))$$$, do you find a better/closer upper bound of counting such tuples of $$$(a, b, c, d \in \mathbb{N})$$$ that $$$P = 2^a \times 3^b \times 5^c \times 7^d \leq N$$$ ? Since my $$$O(log_2(N))$$$ complexity seem very far than the real counting and $$$O(log_2(\sqrt{N}))$$$ is closer but just correct for $$$N \leq 10^{18}$$$
Thanks for reading and sorry for updating this blog publicly so many times (most is for the proving path and correcting the complexity)

Full text and comments »

SPyofgame
4 years ago
7

Wondering for a better approach to this problem

By SPyofgame, history, 4 years ago, In English

I come to a fun problem, and after I tried hard to solve it, I curiously to find better algorithm, but I cant.

The problem is:

There are $$$N$$$ buildings with $$$a_1, a_2, \dots, a_n$$$ metters height. These days are hard, the heavy raining weather still not ended yet. In day $$$d$$$, every building with height $$$h$$$ only have $$$x_d = \lfloor \frac{h}{d} \rfloor$$$ height left of good space to use, while others are sunk underwater. Every building having the same value $$$x_d$$$ on day $$$d$$$ will group together (including $$$x_d = 0$$$ which sunk completely underwater) in a way that no other building with same value $$$x_d$$$ in another group.

The question is:

Output $$$N$$$ lines, in each size $$$s$$$ from $$$1$$$ to $$$n$$$, what is the earliest day $$$d$$$ that have at least one group of size $$$s$$$ (if there is no suitable day then output -1)

The constraints are:

Subtaks 1: $$$n \leq 100~ and ~a_i \leq 2.10^5$$$
Subtaks 2: $$$n \leq 300~ and ~a_i \leq 3.10^6$$$
Subtaks 3: $$$n \leq 300~ and ~a_i \leq 5.10^7$$$

The examples are:

Example 1

Input:

3
1 2 5

Output:

1
3
6

Explanation:

Day 1: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{1} \rfloor, \lfloor \frac{2}{1} \rfloor, \lfloor \frac{5}{1} \rfloor$$$ } $$$=$$$ { $$$1, 2, 5$$$ } — First group of size 1: Any of {$$$1$$$} {$$$2$$$} {$$$5$$$}

Day 2: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{2} \rfloor, \lfloor \frac{2}{2} \rfloor, \lfloor \frac{5}{2} \rfloor$$$ } $$$=$$$ { $$$0, 1, 2$$$ }

Day 3: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{3} \rfloor, \lfloor \frac{2}{3} \rfloor, \lfloor \frac{5}{3} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ } — First group of size 2: Only {$$$1, 2$$$}

Day 4: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{4} \rfloor, \lfloor \frac{2}{4} \rfloor, \lfloor \frac{5}{4} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ }

Day 5: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{5} \rfloor, \lfloor \frac{2}{5} \rfloor, \lfloor \frac{5}{5} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ }

Day 6: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{6} \rfloor, \lfloor \frac{2}{6} \rfloor, \lfloor \frac{5}{6} \rfloor$$$ } $$$=$$$ { $$$0, 0, 0$$$ } — First group of size 3: Only {$$$1, 2, 5$$$}

Example 2

Input:

3
1 1 5

Output:

1
1
6

Explanation:

Day 1: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{1} \rfloor, \lfloor \frac{1}{1} \rfloor, \lfloor \frac{5}{1} \rfloor$$$ } $$$=$$$ { $$$1, 1, 5$$$ } — First group of size 1: Any of {$$$1$$$} {$$$1$$$} {$$$5$$$}

Day 2: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{2} \rfloor, \lfloor \frac{1}{2} \rfloor, \lfloor \frac{5}{2} \rfloor$$$ } $$$=$$$ { $$$0, 0, 2$$$ } — First group of size 2: Only {$$$1, 1$$$}

Day 3: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{3} \rfloor, \lfloor \frac{1}{3} \rfloor, \lfloor \frac{5}{3} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ }

Day 4: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{4} \rfloor, \lfloor \frac{1}{4} \rfloor, \lfloor \frac{5}{4} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ }

Day 5: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{5} \rfloor, \lfloor \frac{1}{5} \rfloor, \lfloor \frac{5}{5} \rfloor$$$ } $$$=$$$ { $$$0, 0, 1$$$ }

Day 6: $$$x[d] = $$$ { $$$ \lfloor \frac{1}{6} \rfloor, \lfloor \frac{1}{6} \rfloor, \lfloor \frac{5}{6} \rfloor$$$ } $$$=$$$ { $$$0, 0, 0$$$ } — First group of size 3: Only {$$$1, 2, 5$$$}

Example 3

Input:

3
2 2 2

Output:

-1
-1
2

Explanation:

Day 1: $$$x[d] = $$$ { $$$ \lfloor \frac{2}{1} \rfloor, \lfloor \frac{2}{1} \rfloor, \lfloor \frac{2}{1} \rfloor$$$ } $$$=$$$ { $$$2, 2, 2$$$ } — First group of size 3: Only {$$$2, 2, 2$$$}

Day 2: $$$x[d] = $$$ { $$$ \lfloor \frac{2}{2} \rfloor, \lfloor \frac{2}{2} \rfloor, \lfloor \frac{2}{2} \rfloor$$$ } $$$=$$$ { $$$0, 0, 0$$$ }

Day 3: $$$x[d] = $$$ { $$$ \lfloor \frac{2}{3} \rfloor, \lfloor \frac{2}{3} \rfloor, \lfloor \frac{2}{3} \rfloor$$$ } $$$=$$$ { $$$0, 0, 0$$$ }

Day 3 $$$\leq k \rightarrow \infty$$$: $$$x[d] = $$$ { $$$ \lfloor \frac{2}{k} \rfloor, \lfloor \frac{2}{k} \rfloor, \lfloor \frac{2}{k} \rfloor$$$ } $$$=$$$ { $$$0, 0, 0$$$ } — There are no group of size 1 and 2

My approach to this problem:

Observation: Harmonic Sequence

Subtask 1: A[i] <= 2 * 10^5

Since day $$$k > max(a[i])$$$ will make all array always being as $$$x[k] =$$$ {$$$0, 0, \dots, 0$$$}, we only need to check for each value $$$k \in [1, d]$$$. Then we can calculate how many group of size $$$s$$$ in day $$$k$$$. My approach is to increase the value $$$f[\lfloor \frac{a_i}{k} \rfloor] = s$$$ means in day $$$\lfloor \frac{a_i}{k} \rfloor$$$ have group of size $$$s$$$, then minimize the first day we have the group of size $$$s$$$

This approach is $$$O(n \times max(a_i))$$$

int main()
{
    int n;
    cin >> n;

    vector<int> a(n);
    for (int &x : a) cin >> x;
    int k = *max_element(all(a)) + 1;

    vector<int> f(k + 1, 0);
    vector<int> q(n + 1, +INF);
    for (int x = 1; x <= k; ++x) /// Checking possible dividing value
    {
        for (int t : a) ++f[t / x];               /// Calculating value f[] for each day [t / x]
        for (int t : a) minimize(q[f[t / x]], x); /// Minimize first day have group of size k
        for (int t : a) --f[t / x];               /// Reset the array
    }

    for (int i = 1; i <= n; ++i)
        cout << (q[i] == +INF ? -1 : q[i]) << '\n';
    
    return 0;
}

Subtask 2: A[i] <= 3 * 10^6

Lets $$$setL$$$ be a set of good potential $$$k$$$ for dividing. Since there are some $$$k$$$ that having group of size $$$s$$$ but never will be good enough to be the smallest day. That is for each $$$a_i$$$, we find all $$$k$$$ that have smallest day $$$d$$$ that might appear a group of size $$$\lfloor \frac{a_i}{k} \rfloor$$$ and insert into $$$setL$$$.

The upper bound complexity is $$$O(n\ \times k\ log(k))$$$ where $$$k = |setL| = O(n \times \sqrt{max(a_i)})$$$ but since $$$a_i$$$ is small, and the bigger $$$n$$$ is, the more number of duplicate values will all be eliminated by using $$$set<>$$$. It would reduce both complexity and constant factor significantly.

int main()
{
    int n;
    cin >> n;

    set<int> setL;
    vector<int> a(n);
    for (int &x : a)
    {
        cin >> x;

        int sqrtx = sqrt(x);
        for (int t = 1; t <= sqrt(x); ++t)
        {
            setL.insert(x / t + 1);
            setL.insert(t + 1);
        }
        setL.insert(x + 1);
        setL.insert(1);
    }

    vector<int> res(n + 1, +INF);
    for (int x : setL)
    {
        map<int, int> F;
        for (int t : a) ++F[t / x];                 /// Calculating value f[] for each day [t / x]
        for (int t : a) minimize(res[F[t / x]], x); /// Minimize first day have group of size k
    }

    for (int g = 1; g <= n; ++g) cout << (res[g] == +INF ? -1 : res[g]) << '\n';
    return 0;
}

Subtask 3: A[i] <= 5 * 10^7

With $$$T = \sqrt{max(a_i)}$$$ then all such $$$k \leq T$$$ are potential. We only care for such $$$k > T$$$ which we can use branch and bound to reduce constant matter.

By using map, we can calculate faster by ignoring duplicates. Iterating through map reducing the $$$O(log)$$$ factor each query.

Since we solve each potential $$$k$$$ from large to smaller, the value $$$cur$$$ will be from small to larger, so we dont have to use $$$min(a, b)$$$ function, and only update for each $$$res[x]$$$ once. We exit when all $$$N$$$ query are found or we tried all potential $$$k$$$

Hence, the complexity is $$$O(n\ log\ n + n \times (k + \sqrt{max(a_i)}) + k\ log\ k)$$$ where $$$k = |setL|$$$

int n;
int cnt = 0;
vector<int> a;
map<int, int> b;
map<int, int>::iterator it;
vector<int> res;

void out()
{
    for (int i = 1; i <= n; ++i) cout << (res[i] == +INF ? -1 : res[i]) << '\n';
    exit(0);
}
void solve(int x)
{
    int sum = 0, val = -1;
    for (it = b.begin(); it != b.end(); ++it)
    {
        int cur = it->first / x;
        if (cur != val)
        {
            val = cur;
            if (sum && res[sum] == +INF) { res[sum] = x; if (++cnt == n) out(); }
            sum = 0;
        }
        sum += it->second;
    }
    if (sum && res[sum] == +INF) { res[sum] = x; if (++cnt == n) out(); }
}

int main()
{
    cin >> n;
    a.resize(n);
    for (int &x : a)
    {
        cin >> x;
        ++b[x];
    }

    
    res.assign(n + 1, +INF);
    int k = ceil(sqrt(*max_element(all(a))));
    for (int x = 1; x <= k; ++x) solve(x);

    if (k > 1)
    {
        set<int> setL;
        for (it = b.begin(); it != b.end(); ++it)
        {
            int x = it->first;
            int lim = min(k, x / (k - 1));
            for (int t = 1; t <= lim; ++t)
            {
                int v = x / t + 1;
                setL.insert(v);
            }
        }
        for (int x : setL) solve(x);
    }
    out();
    return 0;
}

My question

Is there a better algorithm for larger $$$N$$$ ? (upto $$$10^4, 10^6$$$)
Is there a better algorithm for larger $$$a_i$$$ ? (upto $$$10^{12}, 10^{16}, 10^{18}$$$)
Can I use combinatorics or euclidian algorithm for this problem ?

Full text and comments »

SPyofgame
4 years ago
7

Need to optimize the solution for a variant problem of Deque L — Atcoder DP contest

By SPyofgame, history, 4 years ago, In English

Original Problem

In this problem. The statement give you a deque of $$$n$$$ number. There are two players take turn alternately. In one turn, they can select either leftmost or rightmost element and remove it, and earn $$$x$$$ points where $$$x$$$ is the removed number. They play until the deque is empty. Lets $$$X, Y$$$ are the scores of the first player and the second. Find the maximum $$$X - Y$$$ when they play optimally

We can use dynamic-programming to solve it in $$$O(n^2)$$$ or we can improve upto $$$O(n\ polylog(n))$$$ using data-structure like Treap and fully-optimized by deque in linear $$$O(n)$$$

Recursive DP - O(n^2)

const ll LINF = 0x3f3f3f3f3f3f3f3f;

int n;
vector<int> a;
vector<vector<ll> > f;
ll magic(ll L = 0, ll R = n - 1) {
    if (f[L][R] != -LINF) return f[L][R];
    if (L >= R + 1) return f[L][R] = 0;
    if (L >= R - 1) return f[L][R] = max(a[L], a[R]);

    ll A = a[L] + min(magic(L + 2, R - 0), magic(L + 1, R - 1));
    ll B = a[R] + min(magic(L + 1, R - 1), magic(L + 0, R - 2));
    return f[L][R] = max(A, B);
}

int main()
{
    cin >> n;
    a.resize(n);
    f.assign(n, vector<ll>(n, -LINF));

    ll sum = 0;
    for (int &t : a)
    {
        cin >> t;
        sum += t;
    }

    ll x = magic(); /// Maximum Possible x can get
    ll y = sum - x; /// Calculate the score of second player
    cout << x - y;
    return 0;
}

DP iterative - O(n^2)

int main()
{
    int n;
    cin >> n;

    vector<int> a(n);
    vector<vector<ll> > dp(n, vector<ll>(n));
    for (int r = 0; r < n; ++r)
    {
        cin >> a[r];
        dp[r][r] = a[r]; /// (l = r) case
        for (int l = r - 1; l >= 0; --l)
            dp[l][r] = max(a[l] - dp[l + 1][r], a[r] - dp[l][r - 1]); /// either to take leftmost or rightmost element
    }

    cout << dp[0][n - 1];
}

Deque - O(n)

int main()
{
    int n;
    cin >> n;

    vector<ll> v(n);
    for (int i = 0; i < n; ++i)
    {
        cin >> v[i];

        /// Compress the array to bitonic a1 > a2 > ... > ak < ... < a[n - 1] < an
        for (; i >= 2 && v[i - 2] <= v[i - 1] && v[i - 1] >= v[i]; i -= 2, n -= 2)
            v[i - 2] += v[i] - v[i - 1];
    }
 
    /// Calculate result (x - y)
    ll res = 0;
    for (int l = 0, r = n - 1, t = 1; l <= r; t = -t)
        res += t *((v[l] > v[r]) ? v[l++] : v[r--]);

    cout << res;

    return 0;
}

Variant Problem

Then, I come to a problem, here is the statement.

There is a cycle of $$$n (n \leq 10^4)$$$ binary number $$${a_1, a_2, \dots, a_n}$$$ and ($$$a_i \in {0, 1}$$$) First player take a random number, lets say $$$a_p$$$ then remove it and gain $$$a_p$$$ points The second player take a number which is consecutive with last number removed ($$$a_p$$$) — select either $$$a_{p - 1}$$$ or $$$a_{p + 1}$$$ (notice that $$$a_1$$$ and $$$a_n$$$ is consecutive) They start to play alternately until there are no number left and they plays optimally

The question is in each game where as the first player select the number $$$a_p$$$, $$$p \in [1, n]$$$. How many games did the first player have more score than the second player

Example 1

Input:
3
1 1 1

Output:
3

Explain:
In three games, first player have 2 points and second player have 1 points

Example 2

Input:
2
0 1

Output:
1

Explain:
In the first game the first player lose (0 < 1)
In the second game the first player win (1 > 0)

Example 3

Input:
1
0

Output:
0

Explain:
In the only game, the first player have equal score to the second (0 = 0). So he lost

I try to use dp to solve it in $$$O(n^2)$$$ but I dont know how to optimize by using deque and only come up to an $$$O(n^2)$$$ solution. Can someone suggest me a better algorithm ?

Dynamic programming - O(n^2)

const int INF = 1e9;
int main()
{
    int n;
    cin >> n;

    vector<int> a(n << 1);
    vector<vector<int> > f(n << 1, vector<int>(n << 1, -INF));
    for (int i = 0; i < n; ++i)
    {
        int x;
        cin >> x;

        a[i]    = a[n + i]        = x;
        f[i][i] = f[n + i][n + i] = x; /// (l = r) case
    }

    for (int d = 1; d < n; ++d) /// (r - l = d)
        for (int l = 0, r = d; r < a.size(); ++l, ++r) /// iterating over the array
            f[l][r] = max(a[l] - f[l + 1][r], a[r] - f[l][r - 1]); /// either select leftmost or rightmost

    int res = 0;
    for (int i = 0; i < n; ++i)
    {
        int l = i + 1, r = i + n - 1;
        if (a[i] - f[l][r] > 0) res++; /// a[i] is the first selected number
    }

    cout << res;
    return 0;
}

Deque way - O(n^2)

int main()
{
    int n;
    cin >> n;

    vector<int> a(n);
    for (int &x : a) cin >> x;

    a.resize(n << 1);
    for (int i = 0; i < n; ++i) a[i + n] = a[i];
    vector<int> v(n - 1, false);

    int res = 0;
    for (int p = 0; p < n; ++p)
    {
        int m = n - 1;
        for (int i = 0, j = p; i < m; ++i)
        {
            v[i] = a[j++];


            /// Compress the array to bitonic a1 > a2 > ... > ak < ... < a[n - 1] < an
            for (; i >= 2 && v[i - 2] <= v[i - 1] && v[i - 1] >= v[i]; i -= 2, m -= 2)
                v[i - 2] = (v[i - 2] + v[i] - v[i - 1]);
        }

        /// Calculating (t = x + y)
        int t = 0;
        for (int i = p; i < p + n - 1; ++i)
            if (a[i]) t++;

        /// Calculating (d = x - y)
        int d = 0;
        for (int l = 0, r = m - 1, t = 1; l <= r; t = -t)
            d += t * (v[l] > v[r] ? v[l++] : v[r--]);
        
        int e = a[n - 1 + p]; /// first selected number
        int x = (t - d) / 2;  /// first  player played after first selected
        int y = (t + d) / 2;  /// second player played after first selected
        if (x + e > y) res++;
    }

    cout << res;
    return 0;
}

Full text and comments »

deque

SPyofgame
4 years ago
0

Number of partitions of n into at least two distinct parts

By SPyofgame, history, 4 years ago, In English

About the problem

The problem is to calculate the number of such subsequence $$${a_1, a_2, \dots a_n}$$$ that ($$$a_1 + a_2 + \dots + a_k = n$$$) where ($$$k \geq 2$$$) and ($$$a_i \in {1, 2, \dots, n}$$$)

It is the sequence OEIS A111133

My approach for small n

Lets $$$magic(left, last)$$$ is the number of valid subsequences whose sum equal $$$left$$$ which next selected element is such $$$next$$$ in range $$$(last, left]$$$ ($$$next$$$ is strictly greater then last selected number $$$last$$$ and not greater than current sum $$$left$$$). The recursive stop when $$$left = 0$$$ then we found one valid subsequence

Recursive dp - O(n^3) - small n

vector<vector<ll> > f; /// init as -1
ll magic(int left = n, int last = 0)
{
    if (left == 0) return 1;

    ll &res = f[left][last];
    if (res != -1) return res;
    res = 0;

    for (int next = last + 1; next <= left; ++next)
        res += magic(left - cur, next);

    return res;
}

My approach for bigger n

Lets $$$magic(sum, cur)$$$ is the number of valid subsequences whose selected sum is $$$sum$$$ and current selecting element is $$$cur$$$

$$$cur$$$ is init as $$$1$$$ (smallest element) and recursive stop when it greater than $$$n$$$ (largest element)
$$$sum$$$ is in range $$$[0, n]$$$ when it equal $$$n$$$ then we found 1 valid subsequence so we return $$$1$$$, else if it greater than $$$n$$$ we stop the recursive

The complexity is still $$$O(n^3)$$$ which $$$O(n^2)$$$ calculation and $$$O(n)$$$ for bignum implementation

Recursive dp - O(n^3) - bignum result

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

#define all(x) (x).begin(), (x).end()
typedef pair<int, int> pi;
typedef vector<int> vi;
typedef long long ll;

struct bignum {
    vector<int> d;
    bignum(int x = 0) {
        do d.push_back(x % 10);
        while (x /= 10);
    }

    void fix()
    {
        d.push_back(0);
        for (int i = 0; i + 1 < d.size(); ++i)
        {
            d[i + 1] += d[i] / 10; d[i] %= 10;
            if (d[i] < 0) { d[i + 1]--; d[i] += 10; }
        }
        while (d.size() >= 2 && d.back() == 0) d.pop_back();
    }

    void operator += (const bignum &a) {
        const vi &c = a.d;
        d.resize(max(c.size(), d.size()));
        for (int i = 0; i < c.size(); ++i) d[i] += c[i];
        fix();
    }

    void operator -= (const bignum &a) { 
        const vi &c = a.d;
        d.resize(max(c.size(), d.size()));
        for (int i = 0; i < c.size(); ++i) d[i] -= c[i];
        fix();
    }

    void out()   {  for (int i = int(d.size()) - 1; i >= 0; --i) cout << d[i]; }
    void outln() { out(); putchar('\n'); }
};

int n;
vector<vector<bignum> > f;
vector<vector<bool> > m;
bignum magic(int sum = 0, int cur = 1) {
    if (sum == n) return bignum({1});
    if (sum > n) return bignum({0});
    if (cur > n) return bignum({0});

    bignum &res = f[sum][cur];
    if (m[sum][cur]) return res; else m[sum][cur] = true;

    res += magic(sum + cur, cur + 1);
    res += magic(sum, cur + 1);
    
    return res;
}

int main()
{
    cin >> n;
    f.assign(n + 1, vector<bignum>(n + 1, bignum({0})));
    m.assign(n + 1, vector< bool >(n + 1, false));
    bignum res = trau();
    res -= bignum({1});
    res.outln();
}

Iterative dp - O(n^3) - bignum result

#include <algorithm>
#include <iostream>
#include <vector>

using namespace std;

#define all(x) (x).begin(), (x).end()
typedef pair<int, int> pi;
typedef vector<int> vi;
typedef long long ll;

struct bignum {
    vector<int> d;
    bignum(int x = 0) {
        do d.push_back(x % 10);
        while (x /= 10);
    }
    bignum(const vi &a) {
        d = a;
        reverse(all(d));
    }

    void fix()
    {
        d.push_back(0);
        for (int i = 0; i + 1 < d.size(); ++i)
        {
            d[i + 1] += d[i] / 10; d[i] %= 10;
            if (d[i] < 0) { d[i + 1]--; d[i] += 10; }
        }
        while (d.size() >= 2 && d.back() == 0) d.pop_back();
    }

    void operator += (const bignum &a) {
        const vi &c = a.d;
        d.resize(max(c.size(), d.size()));
        for (int i = 0; i < c.size(); ++i) d[i] += c[i];
        fix();
    }

    void operator -= (const bignum &a) { 
        const vi &c = a.d;
        d.resize(max(c.size(), d.size()));
        for (int i = 0; i < c.size(); ++i) d[i] -= c[i];
        fix();
    }

    void out()   {  for (int i = int(d.size()) - 1; i >= 0; --i) cout << d[i]; }
    void outln() { out(); putchar('\n'); }
};

int main()
{
    int n;
    cin >> n;

    vector<bignum> f(n + 1);
    f[0] = f[1] = bignum(1);
    for (int i = 2; i <= n; ++i)
        for (int j = n; j >= i; --j)
            f[j] += f[j - i];

    f[n] -= 1;
    f[n].out();
}

My question

Can I solve the problem in $$$O(n^2)$$$ or in $$$O(n^2 \times polylog(n))$$$
Can I find the n_th element faster than $$$O(n^3)$$$

Full text and comments »

combinatorics, math, dp

SPyofgame
4 years ago
2

Number of nth permutations with k inversion pairs

By SPyofgame, history, 5 years ago, In English

Simplest way:

Approach

Brute-force <TLE> - O(n! * n ^ 2) time - O(n) space

int n, k;
int inversion_count(const vector<int> &perm) /// count number of inversion in permutation
{
    int res = 0;
    for (int i = 2; i <= n; ++i)
        for (int j = 1; j < i; ++j)
            if (perm[i] < perm[j])
                res++;

    return res;
}

int main()
{
    cin >> n >> k;

    vector<int> perm(n + 1);
    for (int i = 1; i <= n; ++i) /// assigning permutation
        perm[i] = i;   

    int res = 0;
    do {
        if (inversion_count(perm) == k) /// this permutation is valid (having exactly k inversion pairs)
            res++;

    } while (next_permutation(1 + all(perm))); /// iterating all possible permutation
    cout << res;
}

Improving the algorithm

Approach

BIT, Binary-search, Dynamic-programming <TLE> - O((n! * n + n) log n) time - O(n) space

vector<int> BIT;
void inc(int p) { /// increase the amount of visited
    do BIT[p]++;
    while (p -= p & -p);
}

void dec(int p) { /// decrease the amount of visited
    do BIT[p]--;
    while (p -= p & -p);
}

int getValue(int p) { /// count number of inversion pairs
    int res = 0;
    do res += BIT[p];
    while ((p += p & -p) < BIT.size());
    return res;
}

int main() 
{
    int n, m;
    cin >> n >> m;

    BIT.assign(n + 1, 0);
    vector<int> perm(n + 1, 0);
    vector<int> calc(n + 1, 0);
    for (int i = 1; i <= n; ++i)
    {
        perm[i] = i; /// assigning value
        calc[i] = calc[i - 1] + getValue(perm[i]); /// dp-precalculating
        inc(perm[i]); /// update value
    }

    int res = 0;
    res += (calc[n] == m);
    /// Iterating over all possible permutation
    for (vector<int> n_perm(perm); next_permutation(1 + all(n_perm)); next_permutation(1 + all(perm)))
    {
        int p = 1; /// position of first different between two consecutive permutations (perm, n_perm)
        for (int l = 1, r = n; l <= r; )
        {
            int m = (l + r) / 2; /// middle point
            if (perm[m] == n_perm[m] && perm[m - 1] == n_perm[m - 1]) /// this position is valid
            {
                p = m;
                l = m + 1;
            }
            else r = m - 1;
        }
        for (int i = p; i <= n; ++i) dec(perm[i]); /// reset the tree before update this path [p..n]
        for (int i = p; i <= n; ++i)
        {
            calc[i] = calc[i - 1] + getValue(n_perm[i]); /// re-calculating the different path
            inc(n_perm[i]); /// update new value
        }
        res += (calc[n] == m); /// increase the counter when it is valid
    }

    cout << res;
    return 0;
}

Using formula to improve furthur more

Approach

Recursive Dynamic Programming Approach <AC> - O(n * k) time - O(n * k) space

vector<vector<int> > f;
int solve(int n, int k)
{
    int total = n * (n - 1) / 2;       /// number of possible k
    if (k < 0 || k > total) return 0;  /// k is out of range
    minimize(k, total - k);            /// f[n][k] = f[n][total - k]
    if (f[n][k] != -1) return f[n][k]; /// if f[n][k] is calculated
    if (k == 0) return f[n][0] = 1;    /// strictly increasing permutation
    if (n == 0) return f[0][k] = 0;    /// empty permutation
    return f[n][k] = (solve(n, k - 1) + solve(n - 1, k) - solve(n - 1, k - n) + MOD) % MOD;
}

int main()
{
    int n, k;
    cin >> n >> k;

    int total = n * (n - 1) / 2;
    if (k < 0 || k > total) /// k is out of range
    {
        cout << 0;
        return 0;
    }

    minimize(k, total - k);  /// f[n][k] = f[n][total - k]
    f.assign(n + 1, vector<int>(k + 1, -1)); /// init dp-table
    cout << solve(n, k);
    return 0;
}

Iterative Dynamic Programming Approach <AC> - O(n * k) time - O(k) space

int main()
{
    int n, k;
    cin >> n >> k;

    int total = n * (n - 1) / 2;
    if (k < 0 || k > total) /// k is out of range
    {
        cout << 0;
        return 0;
    }

    minimize(k, total - k);
    int f[n + 1][k + 1];
    f[0][0] = 1; /// base case f[n][0] = 1
    for (int i = 1; i <= n; ++i) f[0][i] = 0; /// base case f[0][k] = 0
    for (int i = 1; i <= n; ++i)
    {
        bool cur = i & 1; /// current
        bool pre = !cur;  /// previous

        f[cur][0] = 1; /// base case f[n][0] = 1
        for (int j = 1; j <= k; ++j)
        {
            f[cur][j] = (f[cur][j - 1] + f[pre][j]) % MOD;
            if (j >= i)
                f[cur][j] = (f[cur][j] - f[pre][j - i] + MOD) % MOD;
        }
    }

    cout << f[n & 1][k];
    return 0;
}

I solved this problem but I wonder if there is a better way (faster than $$$O(n \times k)$$$)

So my question is "Is there a faster approach for this problem ?"

Full text and comments »

SPyofgame
5 years ago
14

Binary Extended Greatest Common Divisors ?

By SPyofgame, history, 5 years ago, In English

I read in this paper and know that Binary GCD Implementation is proven to be about 2 times faster than Normal GCD Implementation.

Binary Iterative GCD Implementation (wikipedia)

unsigned int gcd(unsigned int u, unsigned int v)
{
    unsigned int shift = 0;

    /* GCD(0,v) == v; GCD(u,0) == u, GCD(0,0) == 0 */
    if (u == 0) return v;
    if (v == 0) return u;
 
    /* Let shift := lg K, where K is the greatest power of 2
        dividing both u and v. */
    while (((u | v) & 1) == 0) {
        shift++;
        u >>= 1;
        v >>= 1;
    }
 
    while ((u & 1) == 0)
        u >>= 1;
 
    /* From here on, u is always odd. */
    do {
        /* remove all factors of 2 in v -- they are not common */
        /*   note: v is not zero, so while will terminate */
        while ((v & 1) == 0)
            v >>= 1;

        /* Now u and v are both odd. Swap if necessary so u <= v,
            then set v = v - u (which is even). For bignums, the
             swapping is just pointer movement, and the subtraction
              can be done in-place. */
        if (u > v) {
            unsigned int t = v; v = u; u = t; // Swap u and v.
        }
       
        v -= u; // Here v >= u.
    } while (v != 0);

    /* restore common factors of 2 */
    return u << shift;
}

Normal Iterative GCD Implementation

unsigned int gcd(unsigned int u, unsigned int v)
{
    while (v != 0)
    {
        u %= v;
        swap(u, v);
    }
    return u;
}

I just wonder if there is an Efficient Binary Extended GCD Implementation and how fast can it be ?

Full text and comments »

SPyofgame
5 years ago
4

How can I find Suffix Array in Linear effectively

By SPyofgame, history, 5 years ago, In English

Thanks to Suffix Array Tutorial from Codeforces I could learn easily how to build Suffix Array and solve some problems

I learn about $$$O(n \log^2(n))$$$ solution and optimize it into $$$O(n \log(n)$$$, it is fast and good. In some contests, I saw some div-1 rankers discuss about $$$O(n)$$$ solution. Now I am wondering if there is an simple implementation but efficient Suffix Array in Linear Time and Space ?

From their conversation about Linear Solution, I read from this paper and this too but I am not be able to implement it. I know those implementations are hard and higher above my levels but I just curiously to know about the implementation

Thanks for reading, sorry if this blog waste your time.

Full text and comments »

#suffix_array

SPyofgame
5 years ago
1

Can I calculate ((((a ^ n) ^ n) ^ ...) mod m) and (a ^ (n ^ (n ^ (...))) mod m) (b times n) effectively ?

By SPyofgame, history, 5 years ago, In English

In problem $$$\overbrace{(((a ^ n) ^ {{}^n}) ^ {{}^{{}^{...}}})}^{b\ times\ n} \mod m$$$

My approach is calculating each part $$$((a ^ n) \mod m)$$$ then $$$(((a ^ n) ^ n) \mod m) = ((t ^ n) \mod m)$$$ ... all together in $$$O(\log n)$$$ each part and $$$O(b \times \log n)$$$ total time complexity
Can I improve it somehow faster where $$$b$$$ is large ($$$b \leq 10^{16}$$$) and ($$$m$$$ can be composite number)

In problem $$$\overbrace{a ^ {(n ^ {(n ^ {(...)})})}}^{b\ times\ n} \mod m$$$

I only know the way to use bignum to calculate $$$n ^ n$$$ then $$$n ^ {n ^ n}$$$ all together then I just have to calculate the modulo of $$$a ^ {...} \mod m$$$ but the total complexity will be very huge (since the number of digits of bignum will raise very fast)
Can I apply these formula from phi-function:

$$$a ^ {\phi(m)} \equiv 1 \pmod m$$$ $$$a ^ n \equiv a ^ {n \mod \phi(m)} \pmod m$$$ $$$a ^ n \equiv a ^ {\phi(m) + (n \mod \phi(m))} \pmod m$$$

Can I improve it somehow faster where $$$n$$$ is large ($$$n \leq 10^{16}$$$) and ($$$m$$$ can be composite number)

Full text and comments »

asking

SPyofgame
5 years ago
14

Can someone hint me on this problem

By SPyofgame, history, 5 years ago, In English

There is an empty rectangle of size

$$$n \times m$$$

where

$$$1 \leq n, m \leq 10^6$$$

We cover the rectangles k times from (u1, v1) to (u2, v2), where

$$$1 \leq k \leq 400$$$ $$$1 \leq u1 \leq u2 \leq n$$$ $$$1 \leq v1 \leq v2 \leq m$$$

The question is: How many squares of that rectangle have been covered ?

Full text and comments »

SPyofgame
5 years ago
4

Divisor Sum Implementations

By SPyofgame, history, 5 years ago, In English

I found these implementations ^^

Sorry, I am not sure if there are some corner cases or I calculated wrong complexity. If there are, please correct me.

I am wondering if there are some other implementations ^^ (you can comment below and I will add to the post)

Implementation 0: Trivial

/// Implementation: Trivial
/// Time  Complexity: O(n)
/// Space Complexity: O(1)

int main()
{
    int n;
    cin >> n;

    ll sum = 0;
    /// n = sigma(x | n % x == 0)
    for (int i = 1; i <= n; ++i) /// O(n)
        if (n % i == 0) /// If (i) is a divisor
            sum += i;
    
    cout << sum;
    return 0;
}

Implementation 1: Optimized Trivial

/// Implementation: Optimized Trivial
/// Time  Complexity: O(√n)
/// Space Complexity: O(1)

typedef long long ll;
int main()
{
    ll n;
    cin >> n;
    int sqrtn = sqrt(n);
    
    ll sum = 0;
    /// if (i) is a divisor then (n) / (i) is also a divisor
    for (ll i = 1; i <= sqrtn; ++i) /// O(√n)
    {
        if (n % i != 0) continue; /// If (i) not a divisor
        sum += i;
        ll t = n / i;
        if (i != t) sum += t;     /// If (t) is a unique divisor
    }
    
    cout << sum;
    return 0;
}

Implementation 2: Naive Factorization

/// Implementation: Naive | Multiplicative Formula
/// Time  Complexity: O(n ^ 2)
/// Space Complexity: O(1)

typedef long long ll;
ll pw(int x, int n) /// O(n)
{
    ll res = 1;
    for (int i = 1; i <= n; ++i) res *= x;
    return res;
}

bool isPrime(int n) /// O(n)
{
    int cnt = 0;
    for (int i = 1; i <= n; ++i) cnt += (n % i == 0);
    return (cnt == 2);
}

int main() /// O(n ^ 2) ?
{
    int n;
    cin >> n;

    ll sum = 1;
    /// n = p1 ^ f1 * p2 ^ 2 * ... * pk ^ fk
    /// sum = ∏(divisor prime p){ p ^ (f + 1) - 1) / (p - 1) }
    for (int i = 1; i <= n; ++i) /// O(n * f(n)) ≤ O(n + divisor_count ^ 2) ≤ O(n ^ 2)
    {
        if (n % i != 0) continue;
        if (isPrime(i) == false) continue; /// O(i)
        
        int p = i, f = 0;
        do f++, n /= p; while (n % p == 0);  /// O(log_p(n))
        sum *= (pw(p, f + 1) - 1) / (p - 1); /// O(f)
    }
    
    cout << sum;
    return 0;
}

Implementation 3: Factorization

/// Implementation: Optimized | Multiplicative Formula
/// Time  Complexity: O(n√n)
/// Space Complexity: O(1)

typedef long long ll;
ll pw(ll x, ll n) /// O(log n)
{
    ll res = 1;
    for (; n > 0; x = x * x, n >>= 1)
        if (n & 1) res = res * x;
    
    return res;
}

bool isPrime(ll n) /// O(sqrt n)
{ 
    if (n < 2) return false; 
    if (n < 4) return true; 
    if (n % 2 == 0 || n % 3 == 0) return false; 
  
    int sqrtn = sqrt(n);
    for (ll i = 5; i <= sqrtn; i += 6) 
        if ((n % i == 0) || (n % (i + 2) == 0)) 
            return false; 
  
    return true; 
} 

int main() /// O(n√n)
{
    ll n;
    cin >> n;

    ll sum = 1;
    /// n = p1 ^ f1 * p2 ^ 2 * ... * pk ^ fk
    /// sum = ∏(divisor prime p){ p ^ (f + 1) - 1) / (p - 1) }
    for (ll i = 1; i <= n; ++i) /// O(n * f(n)) ≤ O(n + sqrt(divisor_count)) ≤ O(n * sqrt(n))
    {
        if (n % i != 0) continue;
        if (isPrime(i) == false) continue; /// O(sqrt i)  

        ll p = i; int f = 0;
        do f++, n /= p; while (n % p == 0);  /// O(log_p(n))
        sum *= (pw(p, f + 1) - 1) / (p - 1); /// O(log f)
    }
    
    cout << sum;
    return 0;
}

Implementation 4: Miller-rabin

/// Implementation: Miller-rabin Primality Test | Multiplicative Formula
/// Time  Complexity: O(n * log^5(n))
/// Space Complexity: O(1)

typedef long long ll;
ll bigmodMul(ll a, ll b, ll m = MOD) /// O(log n)
{
    ll res = 0;
    for (a %= m, b %= m; b > 0; a <<= 1, b >>= 1) {
        if (a >= m) a -= m;
        if (b & 1) { res += a; if (res >= m) res -= m; }
    }

    return res;
}
ll bigmodPow(ll x, ll n, ll m = MOD) /// O(log^2(n))
{
    ll res = 1;
    for (x %= m; n > 0; x = bigmodMul(x, x, m), n >>= 1)
        if (n & 1) res = bigmodMul(res, x, m);
    
    return res;
}

ll pw(ll x, ll n) /// O(log n)
{
    ll res = 1;
    for (; n > 0; x = x * x, n >>= 1)
        if (n & 1) res = res * x;
    
    return res;
}

bool isPrime(ll p) /// O(log^5(p)) - O(log p) rounds performed - O(log^2(p)) primality tests - O(log^2(p)) calculation
{
    if (p < 2) return false;
    if (p < 4) return true;
    if (p % 2 == 0 || p % 3 == 0) return false;
 
    ll q = p - 1;
    int k = 0;
    while ((q & 1) == 0) /// O(log p)
        q >>= 1, k++;
 
    ll a = rand() % (p - 4) + 2;
    ll t = bigmodPow(a, q, p); /// O(log^2(q))
 
    bool ok = (t == 1) || (t == p-1);
    for (int i = 1; i <= k && !ok; i++) { /// O(log k * log t)
        t = bigmodMul(t, t, p); /// O(log t)
        ok = (t == p - 1);
    }
 
    return ok;
}

int main() /// O(n * log5(n))
{
    ll n;
    cin >> n;
    ll sum = 1;
    /// n = p1 ^ f1 * p2 ^ 2 * ... * pk ^ fk
    /// sum = ∏(divisor prime p){ p ^ (f + 1) - 1) / (p - 1) }
    for (ll i = 1; i <= n; ++i) /// O(n * f(n)) ≤ O(divisor_count * log^5(n)) ≤ O(n * log^5(n))
    {
        if (n % i != 0) continue;
        if (isPrime(i) == false) continue; /// O(log^5(i))

        ll p = i; int f = 0;
        do f++, n /= p; while (n % p == 0);  /// O(log_p(n))
        sum *= (pw(p, f + 1) - 1) / (p - 1); /// O(log f)
    }
    
    cout << sum;
    return 0;
}

Implementation 5: Sieve + Factorization

/// Implementation: Sieve | Factorization | Multiplicative Formula
/// Time  Complexity: O(n) precalculation + O(log n) query
/// Space Complexity: O(n) sieve + O(1) query

typedef long long ll;
ll pw(ll x, ll n) /// O(log n)
{
    ll res = 1;
    for (; n > 0; x = x * x, n >>= 1)
        if (n & 1) res = res * x;
    
    return res;
}

vector<int> prime; /// List of prime numbers
vector<int> lpf;   /// lpf[x] = Lowest_Prime_Factor of x

void sieve(int lim = LIM) /// O(n)
{
    prime.assign(1, 2);
    lpf.assign(lim + 1, 2);

    for (int i = 3; i <= lim; i += 2)
    {
        if (lpf[i] == 2) prime.pb(lpf[i] = i);
        for (int j = 0; j < sz(lpf) && prime[j] <= lpf[i] && prime[j] * i <= lim; ++j)
            lpf[prime[j] * i] = prime[j];
    }
}

int main()
{
    int n;
    cin >> n;
    sieve(n); /// O(n) for sieving

    ll sum = 1;
    /// n = p1 ^ f1 * p2 ^ 2 * ... * pk ^ fk
    /// sum = ∏(divisor prime p){ p ^ (f + 1) - 1) / (p - 1) }
    for (int p, f; n > 1; ) /// O(log n)
    {
        for (p = lpf[n], f = 0; (n > 1) && (p == lpf[n]); n /= p) f++; /// O(log n)
        sum *= (pw(p, f + 1) - 1) / (p - 1);                           /// O(log n)
    }
    
    cout << sum;
    return 0;
}

Implementation 6: Sieve + Miller-rabin + Pollard-rho

/// Implementation: Sieve | Factorization | Pollard-rho | Miller-rabin | Multiplicative Formula
/// Time  Complexity: O(√n) precal + O(√(√(n)) polylog n) query
/// Space Complexity: O(√n) sieve + O(log n / log(log n)) query

const int MOD = 1e9 + 7;
typedef long long ll;
ll bigmodMul(ll a, ll b, ll m = MOD) /// O(log n)
{
    ll res = 0;
    for (a %= m, b %= m; b > 0; a <<= 1, b >>= 1) {
        if (a >= m) a -= m;
        if (b & 1) { res += a; if (res >= m) res -= m; }
    }

    return res;
}
ll bigmodPow(ll x, ll n, ll m = MOD) /// O(log^2(n))
{
    ll res = 1;
    for (x %= m; n > 0; x = bigmodMul(x, x, m), n >>= 1)
        if (n & 1) res = bigmodMul(res, x, m);
    
    return res;
}

ll pw(ll x, ll n) /// O(log n)
{
    ll res = 1;
    for (; n > 0; x = x * x, n >>= 1)
        if (n & 1) res = res * x;
    
    return res;
}

bool isPrime(ll p) /// O(log^5(p)) - O(log p) rounds performed - O(log^2(p)) primality tests - O(log^2(p)) calculation
{
    if (p < 2) return false;
    if (p < 4) return true;
    if (p % 2 == 0 || p % 3 == 0) return false;
 
    ll q = p - 1;
    int k = 0;
    while ((q & 1) == 0) /// O(log p)
        q >>= 1, k++;
 
    ll a = rand() % (p - 4) + 2;
    ll t = bigmodPow(a, q, p); /// O(log^2(q))
 
    bool ok = (t == 1) || (t == p-1);
    for (int i = 1; i <= k && !ok; i++) { /// O(log k * log t)
        t = bigmodMul(t, t, p); /// O(log t)
        ok = (t == p - 1);
    }
 
    return ok;
}

map<int, int> M;
int sqrtn;
ll sum = 0; 

vector<int> prime; /// List of prime numbers
vector<int> lpf;   /// lpf[x] = Lowest_Prime_Factor of x

void sieve(int lim = LIM) /// O(n)
{
    prime.assign(1, 2);
    lpf.assign(lim + 1, 2); /// O(n)

    for (int i = 3; i <= lim; i += 2) /// O(n)
    {
        if (lpf[i] == 2) prime.pb(lpf[i] = i);
        for (int j = 0; j < sz(lpf) && prime[j] <= lpf[i] && prime[j] * i <= lim; ++j)
            lpf[prime[j] * i] = prime[j];
    }
}

ll rho(ll n, ll c) {  /// O(√(√(n)) * polylog n)
    ll x = 2, y = 2, i = 2, k = 2, d;
    while (true) {
        x = (bigmodMul(x, x, n) + c); /// O(log n)
        if (x >= n)	x -= n;
        d = __gcd(abs(x - y), n);     /// O(log n)
        if (d > 1) return d;
        if (i++ == k) y = x, k <<= 1;
    }
    return n;
}
 
void big_fact(ll n) { /// O(√(√(n)) * polylog n)
    if (n == 1) /// O(1)
        return ;
 
    if (n < 1e+9) { /// O(√(n) / ln(√(n)) * log n / log log n)
        for (int p : prime) /// O(pair<int, int>(√(n)) * log(M.size)) = O(√(n) / ln(√(n)) * log n / log(log n))
        {
            if (p > sqrtn) break;
            while (n % p == 0) M[p]++, n /= p; /// O(log_p(n) * log(M.size)) = O(log_p(n) * log(n) / log(log n))
        }

        if (n > 1) M[n]++; /// O(log(M.size)) = O(log n / log(log n))
        return ;
    }
 
    if (isPrime(n))            /// O(log^5(n))
        return M[n]++, void(); /// O(log(M.size)) = O(log n / log(log n))
 
    ll d = n;
    for (int i = 2; d == n; i++) /// O(√(√(n)) * polylog n)
        d = rho(n, i);           /// O(√(√(n)) * polylog n)
 
    big_fact(    d);
    big_fact(n / d);
}

int main() /// O(√n + √(√(n)) * polylog n)
{
    ll n;
    cin >> n;

    sqrtn = sqrt(n);   /// O(√(n))   
    sieve(sqrtn);      /// O(√(n))
    
    srand(time(NULL));
    big_fact(n);       /// O(√(√(n)) * polylog n)
    
    ll sum = 1;
    /// n = ∏(e ∈ M){ e.first ^ e.second }
    /// sum = ∏(divisor prime e.first){ e.first ^ (e.second + 1) - 1) / (e.first - 1) }
    for (pair<int, int> e : M)     /// O(log n / log(log n))
        sum *= (pw(e.first, e.second + 1) - 1) / (e.first - 1); /// O(log n)

    cout << sum;    
    return 0;
}

Implementation 7: Divisor Sum Sieve

vector<int> prime;
vector<int> ds; /// divisor sum
void sieve(int lim) /// Precalculation: O(n log n)
{
    prime.clear();
    if (n < 2) return ;

    ds.assign(n + 1, 1);
    for (int i = 2; i <= lim; ++i)
        for (int j = i; j <= lim; j += i)
            ds[j] += i;
    
    return ; /// If you want to get primes, remove this line
    prime.assign(1, 2);
    for (int i = 3; i <= n; i += 2)
        if (ds[i] == i + 1)
            prime.push_back(i);
}

int main()
{
    int n;
    cin >> n;
    sieve(n);      /// O(n) for sieving
    cout << ds[n]; /// O(1) query
    return 0;
}

Compilation:

Implementation	Main Algorithm	Time Complexity	Space Complexity	Coding Space	Coding Complex
0. Trivial	Implementation	O(n)	O(1)	Short	Simple
1. Trivial	Implementation	O(√n)	O(1)	Short	Simple
2. Factorization + Math	Math	O(n ^ 2) ?	O(1)	Normal	Normal
3. Factorization + Math	Factorization	O(n√n)	O(1)	Normal	Normal
4. Factorization + Math	Miller-rabin	O(n * log^5(n))	O(1)	Long	Normal
5. Factorization + Math	Sieve	O(n) + O(log n)	O(n)	Normal	Normal
6. Factorization + Math	Pollard-rho	O(√n) + O(√(√(n)) polylog n) query	O(√n) + O(log n / log(log n)) query	Very Long	Complex

Planning:

Add relative blogs
Add some more implementations
Add tutorial & reasoning for each implementation

Full text and comments »

#divisors_sum, #implementation, #miller-rabin, #primality_test, #pollard_rho, #primes, #sieve

SPyofgame
5 years ago
5

←