Select the
Graph
view to examine the design. Selectp_d
to identify the tile as (25,0).Adjust the trace view to the correct size with the zoom in or zoom out icons, and move the marker to the end of
peak_detect
or the beginning of_main
. This is considered as the beginning of an iteration. A period of lock stall indicates data is sent from the PL to AIE tile.Observe the end of the
peak_detect
kernel corresponding to the core(25,0) and start of the core(24,0) and core(25,1). If you observe the graph view, you can notice that the kernelpeak_detect
sends data to both theupscale
anddata_shuffle
kernels. The same behavior can be observed in the trace view as well.You can calculate the execution time of one iteration as follows. Place the marker at start and end of the iteration and (1) - (2) gives 262.2 ns which is ~= 329 cycles. This matches with the
Function time
in the profile data from both the AI Engine simulation and hardware emulation.