ZCU102 Evaluation Kit - 1.4 English

Vitis AI Library User Guide (UG1354)

Document ID
UG1354
Release Date
2021-07-22
Version
1.4 English

The ZCU102 evaluation kit uses the mid-range ZU9 UltraScale+™ device. There are two different hardware versions of the ZCU102 evaluation kit, one with the serial number 0432055-04 as the header, and the other with the serial number 0432055-05 as the header. The performance of the Vitis AI Library varies between the two hardware versions (because of different DDR memory performance). Because the 0432055-04 version of ZCU102 has been discontinued, the following table only shows the performance for the ZCU102 (0432055-05) evaluation kit. In the ZCU102 evaluation kit, triple B4096F DPU cores are implemented in the program logic.

Refer to the following table for throughput performance (in frames/sec or fps) for various neural network samples on ZCU102 (0432055-05) with DPU running at 281 MHz.

Note: The DPU on the ZCU102 has hardware softmax acceleration module. Due to the limitation of hardware softmax module, the software softmax is faster when the number of categories reaches 1000. Set XLNX_ENABLE_C_SOFTMAX=1 to enable the software softmax: softmax_c. The default value of XLNX_ENABLE_C_SOFTMAX is 0, which means the softmax method is selected according to the following priorities.
  1. Neon Acceleration
  2. Hardware Softmax
  3. Software Softmax_c
For ZCU102, use the following command to test the performance of classification.
env XLNX_ENABLE_C_SOFTMAX=1 ./test_performance_classification resnet50 test_performance_classification.list -t 8 -s 60
Table 1. ZCU102 (0432055-05) Performance
No Neural Network Input Size GOPS Performance (fps) (Single thread) Performance (fps) (Multiple thread)
1 bcc_pt 800x1000 268.9 3.3 10.7
2 centerpoint_0_pt centerpoint_1_pt 2560x40x4 54 16 48
3 densebox_320_320 320x320 0.49 418.8 1630.7
4 densebox_640_360 360x640 1.1 208.6 811.3
5 efficientNet-edgetpu-L_tf 300x300 19.36 27.9 78
6 efficientNet-edgetpu-M_tf 240x240 7.34 68.8 181.8
7 efficientNet-edgetpu-S_tf 224x224 4.72 94.8 261
8 ENet_cityscapes_pt 512x1024 8.6 9.2 36.9
9 face_landmark 96x72 0.14 843 1546.9
10 face-quality 80x60 0.06 2114.2 7279.5
11 face-quality_pt 80x60 0.06 2098.4 7178.7
12 facerec_resnet20 112x96 3.5 165.1 331.5
13 facerec-resnet20_mixed_pt 112x96 3.5 163.3 329.7
14 facerec_resnet64 112x96 11 70.7 176.9
15 facereid-large_pt 96x96 0.5 840.3 2206.3
16 facereid-small_pt 80x80 0.09 1858.7 5966.3
17 fpn 256x512 8.9 34.4 151.9
18 FPN_Res18_Medical_segmentation 320x320 45.3 12.2 45.8
19 FPN-resnet18_covid19-seg_pt 352x352 22.7 35.8 106.1
20 FPN-resnet18_Endov 240x320 13.75 36.3 155.9
21 hourglass-pe_mpii 256x256 10.2 18.2 71.9
22 inception_resnet_v2_tf 299x299 26.4 22.4 50.8
23 inception_v1 224x224 3.2 179.4 470.2
24 inception_v1_tf 224x224 3 182.9 469.1
25 inception_v2 224x224 4 131.5 303.3
26 inception_v2_tf 224x224 3.88 91.1 229.9
27 inception_v3 299x299 11.4 58.8 135.9
28 inception_v3_pt 299x299 5.7 58.8 136.1
29 inception_v3_tf 299x299 11.5 58.7 134.8
30 inception_v3_tf2 299x299 11.5 58.3 136.6
31 inception_v4 299x299 24.5 28.6 68.6
32 inception_v4_2016_09_09_tf 299x299 24.6 28.6 68.5
33 medical_seg_cell_tf2 128x128 5.3 154.8 395.3
34 MLPerf_resnet50_v1.5_tf 224x224 8.19 71.2 168.4
35 mlperf_ssd_resnet34_tf 1200x1200 433 1.8 7
36 mobilenet_1_0_224_tf2 224x224 1.1 292.5 930.4
37 mobilenet_edge_0_75_tf 224x224 0.62 237.5 707.1
38 mobilenet_edge_1_0_tf 224x224 0.99 198.1 547.6
39 mobilenet_v1_0_25_128_tf 128x128 0.027 1125.7 4184.1
40 mobilenet_v1_0_5_160_tf 160x160 0.15 715.2 2659.6
41 mobilenet_v1_1_0_224_tf 224x224 1.1 297 941.2
42 mobilenet_v2 224x224 0.6 250 733.4
43 mobilenet_v2_1_0_224_tf 224x224 0.6 244.7 694.5
44 mobilenet_v2_1_4_224_tf 224x224 1.2 179.5 468.9
45 mobilenet_v2_cityscapes_tf 1024x2048 132.74 1.6 4.6
46 MT-resnet18_mixed_pt 512x320 13.65 31.3 102
47 multi_task 288x512 14.8 37.9 126.9
48 multi_task_v3_pt 320x512 25.44 16.7 60.9
49 openpose_pruned_0_3 368x368 49.9 3.8 15.1
50 personreid-res18_pt 176x80 1.1 349 683.6
51 personreid-res50_pt 256x128 5.4 96.8 228.1
52 plate_detection 320x320 0.49 507.9 2008.2
53 plate_num 96x288 1.75 149.7 494.9
54 pmg_pt 224x224 2.28 144.7 363.4
55 pointpainting- pointpainting_nuscenes_40000_64_0_pt pointpainting_nuscenes_40000_64_1_pt 40000x64x16 112 1.3 4.4
56 pointpillars_kitti_12000_0_pt pointpillars_kitti_12000_1_pt 12000x100x4 10.8 19.8 49.9
57 pointpillars_nuscenes_40000_64_0_pt pointpillars_nuscenes_40000_64_1_pt 40000x64x5 108 2.3 9.9
58 rcan_pruned_tf 360x640 86.95 7.6 17.1
59 refinedet_baseline 480x360 123 8.5 24.7
60 RefineDet-Medical_EDD_tf 320x320 9.8 66 230.1
61 refinedet_pruned_0_8 360x480 25 32.5 100.1
62 refinedet_pruned_0_92 360x480 10.1 62.9 201.3
63 refinedet_pruned_0_96 360x480 5.1 87.2 287.9
64 refinedet_VOC_tf 320x320 81.9 11.2 34.4
65 reid 80x160 0.95 350.4 692.8
66 resnet18 224x224 3.7 183.1 478.1
67 resnet50 224x224 7.7 72.7 173.7
68 resnet50_pt 224x224 4.1 68.8 166.5
69 resnet50_tf2 224x224 7.7 70.6 171.2
70 resnet_v1_101_tf 224x224 14.4 42.6 106.5
71 resnet_v1_152_tf 224x224 21.8 29.1 74.1
72 resnet_v1_50_tf 224x224 7 79.5 185.6
73 resnet_v2_101_tf 299x299 26.78 20.5 54.4
74 resnet_v2_152_tf 299x299 40.47 14.5 37.5
75 resnet_v2_50_tf 299x299 13.1 35.1 95.6
76 retinaface 360x640 1.11 126 541.4
77 salsanext_pt 64x2048 20.4 5.5 21.0
78 salsanext_v2_pt 64x2048 32 4.0 11.1
79 SemanticFPN_cityscapes_pt 256x512 10 34 161.8
80 SemanticFPN_Mobilenetv2_pt 512x1024 5.4 10.1 54.1
81 semantic_seg_citys_tf2 512x1024 54 7.1 24
82 sp_net 128x224 0.55 514.4 1590.6
83 squeezenet 227x227 0.76 270.9 1070.5
84 squeezenet_pt 224x224 0.82 221.7 817.5
85 ssd_adas_pruned_0_95 360x480 6.3 87.3 297.4
86 ssd_inception_v2_coco_tf 300x300 9.6 38.8 101.5
87 ssdlite_mobilenet_v2_coco_tf 300x300 1.5 101.1 304.6
88 ssd_mobilenet_v1_coco_tf 300x300 2.5 107.1 331.8
89 ssd_mobilenet_v2 360x480 6.6 39.5 117.5
90 ssd_mobilenet_v2_coco_tf 300x300 3.8 78.8 212.5
91 ssd_pedestrian_pruned_0_97 360x360 5.9 76.1 278.9
92 ssd_resnet_50_fpn_coco_tf 640x640 178.4 2.9 5.2
93 ssd_traffic_pruned_0_9 360x480 11.6 54.9 199.9
94 tiny_yolov3_vmss 416x416 5.46 117.7 393.3
95 unet_chaos-CT_pt 512x512 23.3 22.2 69.7
96 vgg_16_tf 224x224 31 20.1 41.1
97 vgg_19_tf 224x224 39.3 17.3 36.6
98 vpgnet_pruned_0_99 480x640 2.5 94.4 351.2
99 yolov2_voc 448x448 34 26.4 70
100 yolov2_voc_pruned_0_66 448x448 11.6 62.9 191.7
101 yolov2_voc_pruned_0_71 448x448 9.9 72.3 224.2
102 yolov2_voc_pruned_0_77 448x448 7.8 84.4 269.3
103 yolov3_adas_pruned_0_9 256x512 5.5 90.6 263.4
104 yolov3_bdd 288x512 53.7 12.4 33
105 yolov3_voc 416x416 65.4 12.7 33.5
106 yolov3_voc_tf 416x416 65.6 13.2 34.4
107 yolov4_leaky_spp_m 416x416 60.1 13.2 33.8
108 yolov4_leaky_spp_m_pruned_0_36 416x416 38.2 18 45.5