[OD] DBNet

GitHub - WenmuZhou/DBNet.pytorch: A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization - GitHub - WenmuZhou/DBNet.pytorch: A pytorch re-implementation of Real-time Scene Text Detection with...

github.com

Our model is based on Cascade R-CNN[2] for Object Detection .
In detail, we follow the setting get better localization performance through cascading refine boxes.
Deformable Convolution(DCN)[4] with bottleneck ratio of 4 is applied both on the layer2,layer3 and layer4 of ResNext-101, and
We use data augmentation(RandomFlip, ShiftScaleRotate, Multi-Scale, CenterCrop) and
image process algorithm Gaussian noise, RandomBrightnessContrast, Cutout to expand the dataset.
After analyzing the dataset and prediction results, we mark the easily confused annotation as is crowd and other classes as is crowd.
In this way, we can only do loss without back propagation, so as not to have a negative impact on the model.
In addition, we also improved the crop function. If we cut the images directly, it will have a great impact on the boundary, we only keep the box witch IOU greater than 0.8 with original box.

'AI > Object Detection' 카테고리의 다른 글

[OD] coco2017 dataset 다운로드 (0)	2023.06.07
[OD] Mask R-CNN (0)	2021.07.02
[OD] YOLO v3 (0)	2021.07.02
[OD] YOLO v2 (0)	2021.07.02
[OD] FPN: Feature Pyramid Network (0)	2021.07.02

Shovel

[OD] DBNet

'AI > Object Detection' 카테고리의 다른 글

티스토리툴바

[OD] DBNet

'AI > Object Detection' 카테고리의 다른 글

'AI/Object Detection' Related Articles

티스토리툴바