The KV260 starter kit uses a custom Zynq UltraScale+ device. One B4096 DPU core is implemented in the programmable logic and delivers 1.23 TOPS INT8 peak performance for deep learning inference acceleration.
Refer to the following table for the throughput performance (in frames/sec or fps) for various neural network samples on KV260 with the DPU clocked at 300 MHz.
No | Neural Network | Input Size | GOPS | Performance (fps) (Single thread) | Performance (fps) (Multiple thread) |
---|---|---|---|---|---|
1 | bcc_pt | 800x1000 | 268.9 | 3.6 | 4 |
2 | bevdet | 256x704 | 407.6 | 1.7 | 2.5 |
3 | c2d2_lite | 512x512 | 6.86 | 3.2 | 3.4 |
4 | centerpoint | 2560x40x4 | 54 | 17.2 | 20.2 |
5 | cflownet_pt | 128x128 | 5.21 | 67 | 73.9 |
6 | chen_color_resnet18_pt | 224x224 | 3.627 | 227.7 | 249.3 |
7 | clocs | 12000x100x4 | 41 | 3.1 | 9.1 |
8 | drunet_pt | 528x608 | 2.59 | 65.1 | 77.7 |
9 | efficientdet_d2_tf | 768x768 | 11.06 | 3.8 | 5 |
10 | efficientnet_lite_tf2 | 224x224 | 0.77 | 217.9 | 243.3 |
11 | efficientnet-b0_tf2 | 224x224 | 0.36 | 83.9 | 87.3 |
12 | efficientNet-edgetpu-L_tf | 300x300 | 19.36 | 37.5 | 38.6 |
13 | efficientNet-edgetpu-M_tf | 240x240 | 7.34 | 86 | 90 |
14 | efficientNet-edgetpu-S_tf | 224x224 | 4.72 | 123.9 | 131.7 |
15 | ENet_cityscapes_pt | 512x1024 | 8.6 | 11.2 | 29.8 |
16 | face_mask_detection_pt | 512x512 | 0.593 | 123.6 | 170.3 |
17 | face-quality_pt | 80x60 | 0.06 | 3269.8 | 5659.6 |
18 | facerec-resnet20_mixed_pt | 112x96 | 3.5 | 180.5 | 185.2 |
19 | facereid-large_pt | 96x96 | 0.5 | 1178.1 | 1465.2 |
20 | facereid-small_pt | 80x80 | 0.09 | 2697.7 | 4181.6 |
21 | fadnet | 576x960 | 441 | 1.7 | 2.3 |
22 | fadnet_pruned | 576x960 | 154 | 2.7 | 4.3 |
23 | fadnet_v2_pt | 576x960 | 412 | 1.8 | 2.5 |
24 | fadnet_v2_pruned_pt | 576x960 | 201 | 2.8 | 4.5 |
25 | FairMot_pt | 640x480 | 36 | 24.3 | 27.2 |
26 | FPN-resnet18_covid19-seg_pt | 352x352 | 22.7 | 39.8 | 41.5 |
27 | HardNet_MSeg_pt | 352x352 | 22.78 | 26.5 | 27.5 |
28 | hfnet_tf | 960x960 | 20.09 | 3.4 | 11.5 |
29 | HRNet_pt | 1024x2048 | 1511.9 | N/A | N/A |
30 | inception_resnet_v2_tf | 299x299 | 26.4 | 25.6 | 26.2 |
31 | inception_v1_tf | 224x224 | 3 | 206.9 | 229.8 |
32 | inception_v2_tf | 224x224 | 3.88 | 100.1 | 105.2 |
33 | inception_v3_pt | 299x299 | 5.7 | 64.8 | 68.4 |
34 | inception_v3_tf | 299x299 | 11.5 | 64.9 | 68.4 |
35 | inception_v3_tf2 | 299x299 | 11.5 | 64.1 | 67.6 |
36 | inception_v4_2016_09_09_tf | 299x299 | 24.6 | 31 | 31.8 |
37 | medical_seg_cell_tf2 | 128x128 | 5.3 | 168.2 | 183.2 |
38 | MLPerf_resnet50_v1.5_tf | 224x224 | 8.19 | 86.4 | 90.1 |
39 | mlperf_ssd_resnet34_tf | 1200x1200 | 433 | 2 | 2.6 |
40 | mobilenet_1_0_224_tf2 | 224x224 | 1.1 | 351.9 | 423.7 |
41 | mobilenet_edge_0_75_tf | 224x224 | 0.62 | 283.4 | 328 |
42 | mobilenet_edge_1_0_tf | 224x224 | 0.99 | 234.2 | 263.6 |
43 | mobilenet_v1_0_25_128_tf | 128x128 | 0.027 | 1452 | 2423.2 |
44 | mobilenet_v1_0_5_160_tf | 160x160 | 0.15 | 979.5 | 1460.7 |
45 | mobilenet_v1_1_0_224_tf | 224x224 | 1.1 | 357.7 | 432.3 |
46 | mobilenet_v2_1_0_224_tf | 224x224 | 0.6 | 291.5 | 338.9 |
47 | mobilenet_v2_1_4_224_tf | 224x224 | 1.2 | 206.2 | 228.8 |
48 | mobilenet_v2_cityscapes_tf | 1024x2048 | 132.74 | 1.9 | 3.3 |
49 | mobilenet_v3_small_1_0_tf2 | 224x224 | 0.132 | 377.1 | 460.5 |
50 | monodepth2_pt | 192x640 | 257.21 | 46.6 | 49.4 |
51 | movenet_ntd_pt | 192x192 | 0.5 | 102.3 | 361.9 |
52 | MT-resnet18_mixed_pt | 512x320 | 13.65 | 35.6 | 50.9 |
53 | multi_task_v3_pt | 320x512 | 25.44 | 18.6 | 30.2 |
54 | ocr_pt | 960x960 | 875.7 | 1.1 | 1.3 |
55 | ofa_depthwise_res50_pt | 176x176 | 1.25 | 117.6 | 278.1 |
56 | ofa_rcan_latency_pt | 360x640 | 45.7 | 18 | 19.2 |
57 | ofa_resnet50_0_9B_pt | 160x160 | 1.8 | 199.3 | 213.7 |
58 | ofa_yolo_pruned_0_30_pt | 640x640 | 34.71 | 23 | 26.5 |
59 | ofa_yolo_pruned_0_50_pt | 640x640 | 24.62 | 29.5 | 35.5 |
60 | ofa_yolo_pt | 640x640 | 48.88 | 18.1 | 20.1 |
61 | person-orientation_pruned_558m_pt | 224x112 | 0.558 | 846 | 982.8 |
62 | personreid-res18_pt | 176x80 | 1.1 | 448.1 | 519.6 |
63 | personreid-res50_pt | 256x128 | 5.3 | 106.4 | 128.2 |
64 | pmg_pt | 224x224 | 2.28 | 167.2 | 178.4 |
65 | pointpainting | 40000x64x16 | 112 | 1.3 | 2.8 |
66 | pointpillars_kitti_12000_pt | 12000x100x4 | 10.8 | 22.4 | 32.3 |
67 | pointpillars_nuscenes | 40000x64x5 | 108 | 2.4 | 5.4 |
68 | rcan_pruned_tf | 360x640 | 86.95 | 10 | 10.3 |
69 | refinedet_VOC_tf | 320x320 | 81.9 | 10.9 | 13.1 |
70 | RefineDet-Medical_EDD_tf | 320x320 | 9.8 | 72.5 | 85.6 |
71 | resnet_v1_101_tf | 224x224 | 14.4 | 50.4 | 51.7 |
72 | resnet_v1_152_tf | 224x224 | 21.8 | 34.2 | 34.8 |
73 | resnet_v1_50_tf | 224x224 | 7 | 96.9 | 101.6 |
74 | resnet_v2_101_tf | 299x299 | 26.78 | 25.3 | 25.8 |
75 | resnet_v2_152_tf | 299x299 | 40.47 | 17.2 | 17.5 |
76 | resnet_v2_50_tf | 299x299 | 13.1 | 48.5 | 50.4 |
77 | resnet50_pt | 224x224 | 4.1 | 85.7 | 89.4 |
78 | resnet50_tf2 | 224x224 | 7.7 | 87.5 | 91.3 |
79 | SA_gate_base_pt | 360x360 | 178 | 3.6 | 4.7 |
80 | salsanext_pt | 64x2048 | 20.4 | 10.4 | 37.5 |
81 | salsanext_v2_pt | 64x2048 | 32 | 6.5 | 12.1 |
82 | semantic_seg_citys_tf2 | 512x1024 | 54 | 8.1 | 15.4 |
83 | SemanticFPN_cityscapes_pt | 256x512 | 10 | 38.7 | 86.8 |
84 | SemanticFPN_Mobilenetv2_pt | 512x1024 | 5.4 | 11.5 | 32.9 |
85 | SESR_S_pt | 360x640 | 7.48 | 95.8 | 105.5 |
86 | solo_pt | 640x640 | 107 | 1.5 | 4.4 |
87 | squeezenet_pt | 224x224 | 0.82 | 619 | 763.2 |
88 | ssd_inception_v2_coco_tf | 300x300 | 9.6 | 41.8 | 47.5 |
89 | ssd_mobilenet_v1_coco_tf | 300x300 | 2.5 | 121.3 | 171.1 |
90 | ssd_mobilenet_v2_coco_tf | 300x300 | 3.8 | 87.6 | 112.6 |
91 | ssd_resnet_50_fpn_coco_tf | 640x640 | 178.4 | 3 | 5.4 |
92 | ssdlite_mobilenet_v2_coco_tf | 300x300 | 1.5 | 114.8 | 161 |
93 | ssr_pt | 256x256 | 39.72 | 6.5 | 6.5 |
94 | superpoint_tf | 480x640 | 52.4 | 11.5 | 20.6 |
95 | textmountain_pt | 960x960 | 575.2 | 1.8 | 1.9 |
96 | tsd_yolox_pt | 640x640 | 73 | 13.9 | 14.6 |
97 | ultrafast_pt | 288x800 | 8.4 | 38.8 | 42.8 |
98 | unet_chaos-CT_pt | 512x512 | 23.3 | 24.1 | 30.6 |
99 | vehicle_make_resnet18_pt | 224x224 | 3.627 | 227.3 | 249 |
100 | vehicle_type_resnet18_pt | 224x224 | 3.627 | 228 | 249.3 |
101 | vgg_16_tf | 224x224 | 31 | 21.8 | 22 |
102 | vgg_19_tf | 224x224 | 39.3 | 18.8 | 18.9 |
103 | xilinxSR_pt | 360x640x3 | 182.44 | 2.5 | 2.6 |
104 | yolov3_coco_416_tf2 | 416x416 | 65.9 | 14.2 | 14.8 |
105 | yolov3_voc_tf | 416x416 | 65.6 | 14.5 | 14.9 |
106 | yolov4_csp_pt | 640x640 | 121 | 7.9 | 8.4 |
107 | yolov4_leaky_416_tf | 416x416 | 60.3 | 14.4 | 15.4 |
108 | yolov4_leaky_512_tf | 512x512 | 91.2 | 11 | 11.7 |
109 | yolov5_large_pt | 640x640 | 109.6 | 9.3 | 9.9 |
110 | yolov5_nano_pt | 640x640 | 4.6 | 78.3 | 137.8 |
111 | yolov5s6_pt | 640x640 | 17 | 11.4 | 14.9 |
112 | yolov6m_pt | 640x640 | 82.2 | 6.6 | 12.8 |
113 | yolox_nano_pt | 416x416x3 | 1 | 200.8 | 275.3 |