- Python 3.7
- PaddlePaddle Daily version
- CUDA 10.1
- cuDNN 7.5
- NCCL 2.4.8
- All models were trained and tested in the COCO17 dataset.
- The codes of YOLOv5,YOLOv6,YOLOv7 and YOLOv8 can be found in PaddleYOLO. Note that the LICENSE of PaddleYOLO is GPL 3.0.
- Unless special instructions, all the ResNet backbone network using ResNet-B structure.
- Inference time (FPS): The reasoning time was calculated on a Tesla V100 GPU by
tools/eval.py
testing all validation sets in FPS (number of pictures/second). CuDNN version is 7.5, including data loading, network forward execution and post-processing, and Batch size is 1.
- We adopt and Detectron in the same training strategy.
- 1x strategy indicates that when the total batch size is 8, the initial learning rate is 0.01, and the learning rate decreases by 10 times after 8 epoch and 11 epoch, respectively, and the final training is 12 epoch.
- 2x strategy is twice as much as strategy 1x, and the learning rate adjustment position of epochs is twice as much as strategy 1x.
Paddle provides a skeleton network pretraining model based on ImageNet. All pre-training models were trained by standard Imagenet 1K dataset. ResNet and MobileNet are high-precision pre-training models obtained by cosine learning rate adjustment strategy or SSLD knowledge distillation training. Model details are available at PaddleClas.
Please refer to Faster R-CNN
Please refer to YOLOv3
Please refer to PP-YOLOE
Please refer to PP-YOLO
Please refer to PicoDet
Please refer to RetinaNet
Please refer to Cascade R-CNN
Please refer to SSD
Please refer to FCOS
Please refer to CenterNet
Please refer to TTFNet
Please refer to Group Normalization
Please refer to Deformable ConvNets v2
Please refer to HRNets
Please refer to Res2Net
Please refer to ConvNeXt
Please refer to GFL
Please refer to TOOD
Please refer to PSS-DET
Please refer to DETR
Please refer to Deformable DETR
Please refer to Sparse R-CNN
Please refer to Vision Transformer
Please refer to DINO
Please refer to YOLOX
Please refer to YOLOF
Please refer to Mask R-CNN
Please refer to Cascade R-CNN
Please refer to SOLOv2
Please refer to QueryInst
Please refer to Model Zoo for PaddleYOLO
Please refer to YOLOv5
Please refer to YOLOv6
Please refer to YOLOv7
Please refer to YOLOv7
Please refer to RTMDet
Please refer to Model Zoo for Face Detection
Please refer to BlazeFace
Please refer to Model Zoo for Rotated Object Detection
Please refer to PP-YOLOE-R
Please refer to FCOSR
Please refer to S2ANet
Please refer to Model Zoo for KeyPoint Detection
Please refer to PP-TinyPose
Please refer to HRNet
Please refer to Lite-HRNet
Please refer to HigherHRNet
Please refer to Model Zoo for Multi-Object Tracking
Please refer to DeepSORT
Please refer to ByteTrack
Please refer to OC-SORT
Please refer to BoT-SORT
Please refer to CenterTrack
Please refer to FairMOT
Please refer to JDE