ffmpeg-bb/libavcodec/aarch64/vvc
Zhao Zhili 39786f8cd5 aarch64/h26x: optimize sao_band_filter
int8_t[] is enough for offset_table of 8 bit streams.

On rpi5:
                             Before               After
hevc_sao_band_8_8_c:          252.3 ( 1.00x)     252.3 ( 1.00x)
hevc_sao_band_8_8_neon:        95.8 ( 2.63x)      61.0 ( 4.57x)
hevc_sao_band_16_8_c:         875.2 ( 1.00x)     864.9 ( 1.00x)
hevc_sao_band_16_8_neon:      317.5 ( 2.76x)     150.0 ( 6.26x)
hevc_sao_band_32_8_c:        3853.5 ( 1.00x)    3871.6 ( 1.00x)
hevc_sao_band_32_8_neon:     1222.3 ( 3.15x)     550.6 ( 7.39)
hevc_sao_band_48_8_c:        8203.6 ( 1.00x)    8182.6 ( 1.00x)
hevc_sao_band_48_8_neon:     2685.7 ( 3.05x)    1185.8 ( 7.36x)
hevc_sao_band_64_8_c:       14023.0 ( 1.00x)   14038.9 ( 1.00x)
hevc_sao_band_64_8_neon:     4783.2 ( 2.93x)    2078.4 ( 7.15x)

Reviewed-by: Martin Storsjö <martin@martin.st>
Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>
2025-04-29 15:11:45 +08:00
..
alf.S aarch64/vvc: Fix clip in alf 2024-12-10 21:00:47 +08:00
alf_template.c avcodec/vvc: Add aarch64 neon optimization for ALF 2024-07-22 21:09:56 +08:00
dsp_init.c aarch64/h26x: optimize sao_band_filter 2025-04-29 15:11:45 +08:00
inter.S avcodec/aarch64/vvc: Optimize vvc_avg{8, 10, 12} 2025-03-07 15:51:20 +02:00
Makefile aarch64/vvc: Add avg 2024-09-14 16:36:34 +08:00
of_template.c aarch64/vvc: Add apply_bdof 2024-12-21 11:54:44 +08:00
sad.S aarch64/vvc: Add sad 2024-09-14 16:36:34 +08:00