When properly configured, the Xilinx
Runtime (XRT) collects profiling data on host applications and kernels. XRT
automatically captures profiling data for the host application as it makes calls to the
runtime either through OpenCL or XRT API calls.
You can add user calls to your host application to capture additional profiling
information, as explained in Custom Profiling of the Host Application. To
capture details of the kernel operations you must instrument kernels using the
--profile options as explained in the next section.
After the application finishes execution, the Profile Summary report is saved as a .csv file in the directory where the compiled host code is executed. The Profile Summary provides annotated details regarding the overall application performance. All data generated during the execution of the application is grouped into categories. The Profile Summary lets you examine the kernel execution and data transfer statistics.
An example of the Profile Summary report is shown below.