This rule checks the amount of lock stalls for AI Engine tiles.
Reducing lock stalls can improve AI Engine performance. Time AI Engine was in a lock stall. This could be due to multiple reasons such as buffers being read faster than they are written to or from streams beween the PL being clocked at a slower frequency, etc.
You can resolve the stall by examining buffer access patterns using trace results and acquiring and releasing buffers on time. Use of local buffers may also resolve the issue. You should also ensure the PL interface throughput matches the AI Engine throughput in the case the PL interface is either the source or destination of the stalls.
- Use PING-PONG buffer (default).
- Balance throughput between kernels.
- Acquire and release buffer in-time. Use local buffer as needed.