KV260 Vision AI Starter Kit - 3.0 English

Vitis AI Library User Guide (UG1354)

Document ID
UG1354
Release Date
2023-01-12
Version
3.0 English

The KV260 starter kit uses a custom Zynq UltraScale+ device. One B4096 DPU core is implemented in the programmable logic and delivers 1.23 TOPS INT8 peak performance for deep learning inference acceleration.

Refer to the following table for the throughput performance (in frames/sec or fps) for various neural network samples on KV260 with the DPU clocked at 300 MHz.

Table 1. KV260 Starter Kit Performance
No Neural Network Input Size GOPS Performance (fps) (Single thread) Performance (fps) (Multiple thread)
1 bcc_pt 800x1000 268.9 3.6 4
2 bevdet 256x704 407.6 1.7 2.5
3 c2d2_lite 512x512 6.86 3.2 3.4
4 centerpoint 2560x40x4 54 17.2 20.2
5 cflownet_pt 128x128 5.21 67 73.9
6 chen_color_resnet18_pt 224x224 3.627 227.7 249.3
7 clocs 12000x100x4 41 3.1 9.1
8 drunet_pt 528x608 2.59 65.1 77.7
9 efficientdet_d2_tf 768x768 11.06 3.8 5
10 efficientnet_lite_tf2 224x224 0.77 217.9 243.3
11 efficientnet-b0_tf2 224x224 0.36 83.9 87.3
12 efficientNet-edgetpu-L_tf 300x300 19.36 37.5 38.6
13 efficientNet-edgetpu-M_tf 240x240 7.34 86 90
14 efficientNet-edgetpu-S_tf 224x224 4.72 123.9 131.7
15 ENet_cityscapes_pt 512x1024 8.6 11.2 29.8
16 face_mask_detection_pt 512x512 0.593 123.6 170.3
17 face-quality_pt 80x60 0.06 3269.8 5659.6
18 facerec-resnet20_mixed_pt 112x96 3.5 180.5 185.2
19 facereid-large_pt 96x96 0.5 1178.1 1465.2
20 facereid-small_pt 80x80 0.09 2697.7 4181.6
21 fadnet 576x960 441 1.7 2.3
22 fadnet_pruned 576x960 154 2.7 4.3
23 fadnet_v2_pt 576x960 412 1.8 2.5
24 fadnet_v2_pruned_pt 576x960 201 2.8 4.5
25 FairMot_pt 640x480 36 24.3 27.2
26 FPN-resnet18_covid19-seg_pt 352x352 22.7 39.8 41.5
27 HardNet_MSeg_pt 352x352 22.78 26.5 27.5
28 hfnet_tf 960x960 20.09 3.4 11.5
29 HRNet_pt 1024x2048 1511.9 N/A N/A
30 inception_resnet_v2_tf 299x299 26.4 25.6 26.2
31 inception_v1_tf 224x224 3 206.9 229.8
32 inception_v2_tf 224x224 3.88 100.1 105.2
33 inception_v3_pt 299x299 5.7 64.8 68.4
34 inception_v3_tf 299x299 11.5 64.9 68.4
35 inception_v3_tf2 299x299 11.5 64.1 67.6
36 inception_v4_2016_09_09_tf 299x299 24.6 31 31.8
37 medical_seg_cell_tf2 128x128 5.3 168.2 183.2
38 MLPerf_resnet50_v1.5_tf 224x224 8.19 86.4 90.1
39 mlperf_ssd_resnet34_tf 1200x1200 433 2 2.6
40 mobilenet_1_0_224_tf2 224x224 1.1 351.9 423.7
41 mobilenet_edge_0_75_tf 224x224 0.62 283.4 328
42 mobilenet_edge_1_0_tf 224x224 0.99 234.2 263.6
43 mobilenet_v1_0_25_128_tf 128x128 0.027 1452 2423.2
44 mobilenet_v1_0_5_160_tf 160x160 0.15 979.5 1460.7
45 mobilenet_v1_1_0_224_tf 224x224 1.1 357.7 432.3
46 mobilenet_v2_1_0_224_tf 224x224 0.6 291.5 338.9
47 mobilenet_v2_1_4_224_tf 224x224 1.2 206.2 228.8
48 mobilenet_v2_cityscapes_tf 1024x2048 132.74 1.9 3.3
49 mobilenet_v3_small_1_0_tf2 224x224 0.132 377.1 460.5
50 monodepth2_pt 192x640 257.21 46.6 49.4
51 movenet_ntd_pt 192x192 0.5 102.3 361.9
52 MT-resnet18_mixed_pt 512x320 13.65 35.6 50.9
53 multi_task_v3_pt 320x512 25.44 18.6 30.2
54 ocr_pt 960x960 875.7 1.1 1.3
55 ofa_depthwise_res50_pt 176x176 1.25 117.6 278.1
56 ofa_rcan_latency_pt 360x640 45.7 18 19.2
57 ofa_resnet50_0_9B_pt 160x160 1.8 199.3 213.7
58 ofa_yolo_pruned_0_30_pt 640x640 34.71 23 26.5
59 ofa_yolo_pruned_0_50_pt 640x640 24.62 29.5 35.5
60 ofa_yolo_pt 640x640 48.88 18.1 20.1
61 person-orientation_pruned_558m_pt 224x112 0.558 846 982.8
62 personreid-res18_pt 176x80 1.1 448.1 519.6
63 personreid-res50_pt 256x128 5.3 106.4 128.2
64 pmg_pt 224x224 2.28 167.2 178.4
65 pointpainting 40000x64x16 112 1.3 2.8
66 pointpillars_kitti_12000_pt 12000x100x4 10.8 22.4 32.3
67 pointpillars_nuscenes 40000x64x5 108 2.4 5.4
68 rcan_pruned_tf 360x640 86.95 10 10.3
69 refinedet_VOC_tf 320x320 81.9 10.9 13.1
70 RefineDet-Medical_EDD_tf 320x320 9.8 72.5 85.6
71 resnet_v1_101_tf 224x224 14.4 50.4 51.7
72 resnet_v1_152_tf 224x224 21.8 34.2 34.8
73 resnet_v1_50_tf 224x224 7 96.9 101.6
74 resnet_v2_101_tf 299x299 26.78 25.3 25.8
75 resnet_v2_152_tf 299x299 40.47 17.2 17.5
76 resnet_v2_50_tf 299x299 13.1 48.5 50.4
77 resnet50_pt 224x224 4.1 85.7 89.4
78 resnet50_tf2 224x224 7.7 87.5 91.3
79 SA_gate_base_pt 360x360 178 3.6 4.7
80 salsanext_pt 64x2048 20.4 10.4 37.5
81 salsanext_v2_pt 64x2048 32 6.5 12.1
82 semantic_seg_citys_tf2 512x1024 54 8.1 15.4
83 SemanticFPN_cityscapes_pt 256x512 10 38.7 86.8
84 SemanticFPN_Mobilenetv2_pt 512x1024 5.4 11.5 32.9
85 SESR_S_pt 360x640 7.48 95.8 105.5
86 solo_pt 640x640 107 1.5 4.4
87 squeezenet_pt 224x224 0.82 619 763.2
88 ssd_inception_v2_coco_tf 300x300 9.6 41.8 47.5
89 ssd_mobilenet_v1_coco_tf 300x300 2.5 121.3 171.1
90 ssd_mobilenet_v2_coco_tf 300x300 3.8 87.6 112.6
91 ssd_resnet_50_fpn_coco_tf 640x640 178.4 3 5.4
92 ssdlite_mobilenet_v2_coco_tf 300x300 1.5 114.8 161
93 ssr_pt 256x256 39.72 6.5 6.5
94 superpoint_tf 480x640 52.4 11.5 20.6
95 textmountain_pt 960x960 575.2 1.8 1.9
96 tsd_yolox_pt 640x640 73 13.9 14.6
97 ultrafast_pt 288x800 8.4 38.8 42.8
98 unet_chaos-CT_pt 512x512 23.3 24.1 30.6
99 vehicle_make_resnet18_pt 224x224 3.627 227.3 249
100 vehicle_type_resnet18_pt 224x224 3.627 228 249.3
101 vgg_16_tf 224x224 31 21.8 22
102 vgg_19_tf 224x224 39.3 18.8 18.9
103 xilinxSR_pt 360x640x3 182.44 2.5 2.6
104 yolov3_coco_416_tf2 416x416 65.9 14.2 14.8
105 yolov3_voc_tf 416x416 65.6 14.5 14.9
106 yolov4_csp_pt 640x640 121 7.9 8.4
107 yolov4_leaky_416_tf 416x416 60.3 14.4 15.4
108 yolov4_leaky_512_tf 512x512 91.2 11 11.7
109 yolov5_large_pt 640x640 109.6 9.3 9.9
110 yolov5_nano_pt 640x640 4.6 78.3 137.8
111 yolov5s6_pt 640x640 17 11.4 14.9
112 yolov6m_pt 640x640 82.2 6.6 12.8
113 yolox_nano_pt 416x416x3 1 200.8 275.3