Why does loop order affect too much to execution time ?

→ Pay attention

Before contest
CodeTON Round 9 (Div. 1 + Div. 2, Rated, Prizes!)
10:38:23
Register now »

*has extra registration

→ Streams

Leetcode BiWeekly Contest 144 — Solution Discussion

By Shayan

Before stream 12:08:21

Codeforces CodeTON Round 9 (Div 1 + Div 2) — Solution Discussion

By Shayan

Before stream 13:38:21

View all →

→ Top rated

#	User	Rating
1	tourist	4009
2	jiangly	3823
3	Benq	3738
4	Radewoosh	3633
5	jqdai0815	3620
6	orzdevinwang	3529
7	ecnerwala	3446
8	Um_nik	3396
9	ksun48	3390
10	gamegame	3386

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	167
2	Um_nik	163
3	maomao90	162
4	atcoder_official	161
5	adamant	159
6	-is-this-fft-	158
7	awoo	157
8	TheScrasse	154
9	Dominater069	153
9	nor	153

View all →

→ Find user

→ Recent actions

Detailed →

ExpectoPatronum's blog

Why does loop order affect too much to execution time ?

By ExpectoPatronum, history, 3 years ago, In English

ll res = -oo;
    for(int k=1; k<=n; k++)
        for(int i=1; i<=n-k+1; i++)
            for(int j=1; j<=n-k+1; j++)
            {
                f[i][j][k] = f[i][j][k - 1] + a[i + k - 1] * b[j + k - 1];
                res = max(res, f[i][j][k]);
            }
    cout << res;

ll res = -oo;
    for(int i=1; i<=n; i++)
        for(int j=1; j<=n; j++)
            for(int k=1; k<=n; k++)
            {
                if(k + i > n || k + j > n) break;
                f[i][j][k] = f[i][j][k - 1] + a[i + k - 1] * b[j + k - 1];
                res = max(res, f[i][j][k]);
            }
    cout << res;

These code seem not to be differ too much but execution time have much differ. Example in case n = 500, code1 run in 3103ms but code 2 run only 789ms. What is the reason ? Pls explain to me. Thanks.

ExpectoPatronum
3 years ago
9

Comments (8)

Show archived | Write comment?

ben_dover

3 years ago, # |

+17

I didn't read the code but it is probably cache misses https://stackoverflow.com/questions/9936132/why-does-the-order-of-the-loops-affect-performance-when-iterating-over-a-2d-arra https://en.wikipedia.org/wiki/Loop_interchange

→ Reply

Dan4Life

3 years ago, # |

From my little understanding, arrays are arranged row order in continuos blocks of memory. i.e. the first row is in a block of memory, then the second row immediately continues from where the first one left off and so on...

This makes it easier to go to the next row in a loop that is ordered row by row by just going to the next position in memory eveytime making it cache friendly. If ordered the other way around, it takes more time as you have to continuosly make jumps in memory position when switching to different rows which isn't cache friendly

→ Reply

parth_kabra

3 years ago, # |

Hey, can you please put the codes inside the block

→ Reply

ExpectoPatronum

3 years ago, # ^ |

→ Reply

ExpectoPatronum

3 years ago, # |

Auto comment: topic has been updated by ExpectoPatronum (previous revision, new revision, compare).

→ Reply

ExpectoPatronum

3 years ago, # |

Auto comment: topic has been updated by ExpectoPatronum (previous revision, new revision, compare).

→ Reply

INeedCarb

3 years ago, # |

pro vjp idol

→ Reply

WitchOfTruth

3 years ago, # |

← Rev. 4 →

Because of how memory works
Whenever you read 4 bytes, you actually load 64 bytes into cpu caches and basically read other 15 values for free (MUCH cheaper than from memory) if you read contigious ranges of memory

But if you random jump, you lose advantages of caches, because they're limited and always filled with data you currently don't need and you have to read each value from memory directly

→ Reply