InternImage Versions Save

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

track_model

1 year ago

We train a vectormapnet based on InternImage-Small on the Online HD Map Construction Challenge For Autonomous Driving。

whl_files

1 year ago

seg_models

1 year ago

InternImage for semantic segmentation: ADE20K, Cityscapes.

ADE20K

UperNet + InternImage

backbone resolution mIoU (ss/ms) train speed train time #param FLOPs Config Download
InternImage-T 512x512 47.9 / 48.1 0.23s / iter 10.5h 59M 944G config ckpt | log
InternImage-S 512x512 50.1 / 50.9 0.25s / iter 11.5h 80M 1017G config ckpt | log
InternImage-B 512x512 50.8 / 51.3 0.26s / iter 12h 128M 1185G config ckpt | log
InternImage-L 640x640 53.9 / 54.1 0.42s / iter 19h 256M 2526G config ckpt | log
InternImage-XL 640x640 55.0 / 55.3 0.47s / iter 22h 368M 3142G config ckpt | log

Cityscapes

UperNet + InternImage

backbone resolution mIoU (ss/ms) train speed train time #params FLOPs Config Download
InternImage-T 512x1024 82.58 / 83.40 0.32s / iter 14.5h 59M 1889G config ckpt | log
InternImage-S 512x1024 82.74 / 83.45 0.36s / iter 16.5h 80M 2035G config ckpt | log
InternImage-B 512x1024 83.18 / 83.97 0.39s / iter 17h 128M 2369G config ckpt | log
InternImage-L 512x1024 83.68 / 84.41 0.50s / iter 23h 256M 3234G config ckpt | log
InternImage-XL 512x1024 83.62 / 84.28 0.56s / iter 26h 368M 4022G config ckpt | log

UperNet + InternImage (with additional data)

Mapillary 80k + Cityscapes (w/ coarse data) 160k

backbone resolution mIoU (ss/ms) train speed train time #params FLOPs Config Download
InternImage-L 512x1024 85.94 / 86.22 0.50s / iter 23h 256M 3234G config ckpt | log
InternImage-XL 512x1024 86.20 / 86.42 0.56s / iter 26h 368M 4022G config ckpt | log

SegFormerHead + InternImage (with additional data)

Mapillary 80k + Cityscapes (w/ coarse data) 160k

backbone resolution mIoU (ss/ms) train speed train time #params FLOPs Config Download
InternImage-L 512x1024 85.16 / 85.67 0.37s / iter 17h 220M 1580G config ckpt | log
InternImage-XL 512x1024 85.41 / 85.93 0.43s / iter 19.5h 330M 2364G config ckpt | log

det_model

1 year ago

InternImage for object detection: COCO.

Mask R-CNN + InternImage

backbone schd box mAP mask mAP train speed train time #param FLOPs Config Download
InternImage-T 1x 47.2 42.5 0.36s / iter 9h 49M 270G config ckpt | log
InternImage-T 3x 49.1 43.7 0.34s / iter 26h 49M 270G config ckpt | log
InternImage-S 1x 47.8 43.3 0.40s / iter 10h 69M 340G config ckpt | log
InternImage-S 3x 49.7 44.5 0.40s / iter 30h 69M 340G config ckpt | log
InternImage-B 1x 48.8 44.0 0.45s / iter 11.5h 115M 501G config ckpt | log
InternImage-B 3x 50.3 44.8 0.45s / iter 34h 115M 501G config ckpt | log

Cascade Mask R-CNN + InternImage

backbone schd box mAP mask mAP train speed train time #param FLOPs Config Download
InternImage-L 1x 54.9 47.7 0.73s / iter 18h 277M 1399G config ckpt
InternImage-L 3x 56.1 48.5 0.79s / iter 15h (n4) 277M 1399G config ckpt | log
InternImage-XL 1x 55.3 48.1 0.82s / iter 21h 387M 1782G config ckpt | log
InternImage-XL 3x 56.2 48.8 0.91s / iter 17h (n4) 387M 1782G config ckpt | log

cls_model

1 year ago