본문 바로가기

AI/Object Detection

[OD] DBNet

 

GitHub - WenmuZhou/DBNet.pytorch: A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization - GitHub - WenmuZhou/DBNet.pytorch: A pytorch re-implementation of Real-time Scene Text Detection with...

github.com

 

  • Our model is based on Cascade R-CNN[2] for Object Detection .
  • In detail, we follow the setting get better localization performance through cascading refine boxes.
  • Deformable Convolution(DCN)[4] with bottleneck ratio of 4 is applied both on the layer2,layer3 and layer4 of ResNext-101, and
  • We use data augmentation(RandomFlip, ShiftScaleRotate, Multi-Scale, CenterCrop) and
  • image process algorithm Gaussian noise, RandomBrightnessContrast, Cutout to expand the dataset.
  • After analyzing the dataset and prediction results, we mark the easily confused annotation as is crowd and other classes as is crowd.
  • In this way, we can only do loss without back propagation, so as not to have a negative impact on the model.
  • In addition, we also improved the crop function. If we cut the images directly, it will have a great impact on the boundary, we only keep the box witch IOU greater than 0.8 with original box.

'AI > Object Detection' 카테고리의 다른 글

[OD] coco2017 dataset 다운로드  (0) 2023.06.07
[OD] Mask R-CNN  (0) 2021.07.02
[OD] YOLO v3  (0) 2021.07.02
[OD] YOLO v2  (0) 2021.07.02
[OD] FPN: Feature Pyramid Network  (0) 2021.07.02