Sample Bitstream Sizes - 1.3 English

UltraScale+ Devices Integrated Block for PCI Express Product Guide (PG213)

Document ID
PG213
Release Date
2023-10-19
Version
1.3 English
The final size of the stage 1 bitstream varies based on many factors, including:
  • IP: The size and shape of the first-stage Pblocks determine the number of frames required for stage 1. x8 and x16 configurations will require more GT quads in the stage 1 floorplan, which will lead to a larger stage 1 bitstream.
  • Device: Wider devices require more routing frames to connect the IP to clocking resources.
  • Design: Location of the reset pin is one of many factors introduced by the addition of the user application.
  • GT Locations: The selection of the GT quads used affects the size of the stage 1 bitstream. For the most efficient use of resources, the GT quad adjacent to the PCI Express hard block should be used.
  • Compression: As the device utilization increases, the effectiveness of compression decreases.

As a baseline, here are some sample bitstream sizes and configuration times for the example (PIO) design generated along with the PCIe IP.

Table 1. Example Bitstream Size and Configuration Times 1
Device Full Bitstream Full: BPI16

at 50 MHz

Tandem Stage 1 2 Tandem: BPI16

at 50 MHz

AU25P 117.7 Mb 147.2 ms 14.4 Mb 18.1 ms
KU15P 277.3 Mb 346.6 ms 17.6 Mb 22.0 ms
VU9P 611.6 Mb 764.5 ms 17.5 Mb 21.8 ms
  1. The configuration times shown here do not include TPOR.
  2. Because the PIO design is very small, compression is very effective in reducing the bitstream size. These numbers were obtained without compression to give a more accurate estimate of what a full design might show. These numbers were generated using a PCIe Gen3x16 configuration.
The amount of time it takes to load the stage 2 bitstream using the Tandem PCIe methodology depends on three additional factors:
  • The width and speed of PCI Express link.
  • The frequency of the clock used to program the MCAP.
  • The efficiency at which the Root Port host can deliver the bitstream to the Endpoint FPGA design. For most designs this is the limiting factor.

The lower bandwidth of these three factors determines how fast the stage 2 bitstream is loaded.