git.baikalelectronics.ru Git - kernel.git/commit

author	Kyung Min Park <kyung.min.park@intel.com>
	Tue, 8 Dec 2020 03:34:40 +0000 (19:34 -0800)
committer	Paolo Bonzini <pbonzini@redhat.com>
	Sat, 12 Dec 2020 00:00:58 +0000 (19:00 -0500)
commit	722469e947f9e92ef26d25cda314016b0a7999d6
tree	d94a69d14d2de85e8c71fb5c1175a81225c910eb	tree \| snapshot
parent	1aecc82fc417ea6b6987e7afb30ff48abd529db9	commit \| diff

x86: Enumerate AVX512 FP16 CPUID feature flag

Enumerate AVX512 Half-precision floating point (FP16) CPUID feature
flag. Compared with using FP32, using FP16 cut the number of bits
required for storage in half, reducing the exponent from 8 bits to 5,
and the mantissa from 23 bits to 10. Using FP16 also enables developers
to train and run inference on deep learning models fast when all
precision or magnitude (FP32) is not needed.

A processor supports AVX512 FP16 if CPUID.(EAX=7,ECX=0):EDX[bit 23]
is present. The AVX512 FP16 requires AVX512BW feature be implemented
since the instructions for manipulating 32bit masks are associated with
AVX512BW.

The only in-kernel usage of this is kvm passthrough. The CPU feature
flag is shown as "avx512_fp16" in /proc/cpuinfo.

Signed-off-by: Kyung Min Park <kyung.min.park@intel.com>
Acked-by: Dave Hansen <dave.hansen@intel.com>
Reviewed-by: Tony Luck <tony.luck@intel.com>
Message-Id: <20201208033441.28207-2-kyung.min.park@intel.com>
Acked-by: Borislav Petkov <bp@suse.de>
Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>

arch/x86/include/asm/cpufeatures.h		diff \| blob \| history
arch/x86/kernel/cpu/cpuid-deps.c		diff \| blob \| history