]> git.baikalelectronics.ru Git - kernel.git/commit
habanalabs/gaudi2: reset device upon critical ECC event
authorOfir Bitton <obitton@habana.ai>
Tue, 28 Jun 2022 05:34:28 +0000 (08:34 +0300)
committerOded Gabbay <ogabbay@kernel.org>
Tue, 12 Jul 2022 06:09:28 +0000 (09:09 +0300)
commit2d768a45454e14d1df8a97a7ca4fd12fa355c5ab
tree244559480662fce6d8132120c7e13d5c0f0a637a
parenta9e90f1eded50a43c3de406ccecda480d9dfc565
habanalabs/gaudi2: reset device upon critical ECC event

Correctable ECC events are not fatal, but as they accumulate, the f/w
can decide that a hard-rest is required. This indication is
propagated to the host using the existing ECC event interface.

Signed-off-by: Ofir Bitton <obitton@habana.ai>
Reviewed-by: Oded Gabbay <ogabbay@kernel.org>
Signed-off-by: Oded Gabbay <ogabbay@kernel.org>
drivers/misc/habanalabs/gaudi2/gaudi2.c
drivers/misc/habanalabs/include/common/cpucp_if.h