Use 16-byte alignment (align=4) instead of 4-byte (align=2) in the function and const macros. This improves instruction fetch and NEON load performance on modern AArch64 CPUs. |
||
|---|---|---|
| .. | ||
| asm.S | ||
| cpu.c | ||
| cpu.h | ||
| cpu_sve.S | ||
| float_dsp_init.c | ||
| float_dsp_neon.S | ||
| intreadwrite.h | ||
| Makefile | ||
| neontest.h | ||
| timer.h | ||
| tx_float_init.c | ||
| tx_float_neon.S | ||