Vitis HLS is project based and can contain multiple variations called "solutions" to drive synthesis and simulation. Each solution can target either the Vivado IP flow, or the Vitis Kernel flow. Based on the target flow, each solution will specify different constraints and optimization directives, as described in Enabling the Vivado IP Flow and Enabling the Vitis Kernel Flow. Refer to Default Settings of Vivado/Vitis Flows for a clear list of differences between the two flows.
The following are the synthesis, analysis, and optimization steps in the typical design flow:
- Create a new Vitis HLS project.
- Verify the source code with C simulation.
- Run high-level synthesis to generate RTL files.
- Analyze the results by examining latency, initiation interval (II), throughput, and resource utilization.
- Optimize and repeat as needed.
- Verify the results using C/RTL Co-simulation.
Vitis HLS implements the solution based on the target flow, default tool configuration, design constraints, and any optimization pragmas or directives you specify. You can use optimization directives to modify and control the implementation of the internal logic and I/O ports, overriding the default behaviors of the tool.
The C/C++ code is synthesized as follows:
- Top-level function arguments synthesize into RTL I/O port interfaces automatically by Vitis HLS. As described in Defining Interfaces, the default interfaces that the tool creates depends on the target flow, the data type and direction of the function argument, the default interface mode, and any user-specified INTERFACE pragmas or directives that manually define the interface.
- Sub-functions of the top-level C/C++ function synthesize into blocks in
the hierarchy of the RTL design.
- The final RTL design includes a hierarchy of modules or entities that correspond with the original top-level C function hierarchy.
- Vitis HLS automatically inlines sub-functions into higher level functions, or the top-level function as needed to improve performance.
- You can disable automatic inlining by specifying the
INLINEpragma to a sub-function, or using
set_directive_inline, and setting it to
OFFin your solution.
- By default, each call of the C sub-function uses the same instance of
the RTL module. However, you can implement multiple instances of the RTL module to
improve performance by specifying the
ALLOCATIONpragma, or using the
set_directive_allocationin your solution.
- Loops in the C functions are kept rolled and are pipelined by default to improve performance.
- The Vitis HLS tool will not unroll loops unless it improves the performance of the solution, like unrolling nested loops to pipeline the top-level loop. When loops are rolled, synthesis creates the logic for one iteration of the loop, and the RTL design executes this logic for each iteration of the loop in sequence. Unrolled loops let some or all iterations of the loop occur in parallel, but also consume more device resources.
- You can manually unroll loops using the
UNROLLpragma, or the
- Loops can also be pipelined, either with a finite-state machine fine-grain implementation (loop pipelining) or with a more coarse-grain handshake-based implementation (dataflow).
- Arrays in the code are synthesized into block RAM (BRAM), LUT RAM, or
UltraRAM in the final FPGA design.
- If the array is on the top-level function interface, high-level synthesis implements the array as ports with access to a block RAM outside the design.
- You can reconfigure the type of memory used, or reconfigure
read/write memory transfers using the
ARRAY_RESHAPEpragmas, or the associated
set_directive_arraycommands to change the default assignments.
After synthesis, you can analyze the results in the various reports produced to determine the quality of your results. After analyzing the results, you can create additional solutions for your project specifying different constraints and optimization directives, and synthesize and analyze those results. You can compare results between different solutions to see what has worked and what has not. You can repeat this process until the design has the desired performance characteristics. Using multiple solutions allows you to proceed with development while retaining prior solutions.