AVX — really nice thing to optimize everything

#	User	Rating
1	tourist	3856
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3462
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Dominater069	160
4	Um_nik	158
5	atcoder_official	157
6	Qingyu	156
7	djm03178	151
7	adamant	151
7	luogu_official	151
10	awoo	146

AVX (Advanced Vector Extensions) is an instruction set extension designed for SIMD (Single Instruction, Multiple Data) operations. It's an extension of Intel's x86 and x86-64 architectures, providing wider vector registers and additional instructions to perform parallel processing on multiple data elements simultaneously.

In C++, you can leverage AVX through intrinsics, which are special functions that map directly to low-level machine instructions. AVX intrinsics allow you to write code that explicitly uses the AVX instructions, taking advantage of SIMD parallelism to accelerate certain computations.

Here's a brief overview of using AVX in C++:

Include Header: To use AVX intrinsics, include the appropriate header file. For AVX, you'll need <immintrin.h>.

#include <immintrin.h>

Data Types: AVX introduces new data types, such as m256 for 256-bit wide vectors of single-precision floating-point numbers (float). There are corresponding types for double-precision (m256d) and integer data.
Intrinsics: Use AVX intrinsics to perform SIMD operations. For example, _mm256_add_ps adds two 256-bit vectors of single-precision floating-point numbers.

__m256 a = _mm256_set_ps(4.0, 3.0, 2.0, 1.0, 8.0, 7.0, 6.0, 5.0);
__m256 b = _mm256_set_ps(8.0, 7.0, 6.0, 5.0, 4.0, 3.0, 2.0, 1.0);
__m256 result = _mm256_add_ps(a, b);

Compiler Flags: Ensure that your compiler is configured to generate code that uses AVX instructions. For GCC, you might use flags like -mavx or -march=native to enable AVX support.

g++ -mavx -o your_program your_source.cpp

Caution: Be aware that using intrinsics ties your code to specific hardware architectures. Ensure that your target platform supports AVX before relying heavily on these instructions.
Performance Considerations: AVX can significantly boost performance for certain workloads, especially those involving parallelizable operations on large datasets. However, its effectiveness depends on the specific nature of the computations.

Always consider the trade-offs, and profile your code to ensure that the expected performance gains are achieved. Additionally, keep in mind that the use of intrinsics requires careful consideration of data alignment and memory access patterns for optimal performance.

	Rev.	Lang.	By	When	Δ	Comment
	en1		Blank_X	2023-11-21 21:05:44	2414	Initial revision for English translation
	ru1		Blank_X	2023-11-21 21:05:21	2414	Первая редакция (опубликовано)

Rev.

Lang.

When

Comment

en1

Blank_X

2023-11-21 21:05:44

2414

Initial revision for English translation

ru1

Blank_X

2023-11-21 21:05:21

2414

Первая редакция (опубликовано)

History