Add model ScaledYOLOv4 Support #34

ziqi-jin · 2022-07-22T06:16:41Z

add model ScaledYOLOv4 Support
fixed the boundary problem in ScaledYOLOv4, YOLOv7 and YOLOR
optimized the normalization function in ScaledYOLOv4, YOLOv7 and YOLOR

* Fix compile problem in different python version (#26) * fix some usage problem in linux * Fix compile problem Co-authored-by: root <[email protected]> * Add PaddleDetetion/PPYOLOE model support (#22) * add ppdet/ppyoloe * Add demo code and documents * add convert processor to vision (#27) * update .gitignore * Added checking for cmake include dir * fixed missing trt_backend option bug when init from trt * remove un-need data layout and add pre-check for dtype * changed RGB2BRG to BGR2RGB in ppcls model * add model_zoo yolov6 c++/python demo * fixed CMakeLists.txt typos * update yolov6 cpp/README.md * add yolox c++/pybind and model_zoo demo * move some helpers to private * fixed CMakeLists.txt typos * add normalize with alpha and beta * add version notes for yolov5/yolov6/yolox * add copyright to yolov5.cc * revert normalize * fixed some bugs in yolox * fixed examples/CMakeLists.txt to avoid conflicts * add convert processor to vision * format examples/CMakeLists summary * Fix bug while the inference result is empty with YOLOv5 (#29) * Add multi-label function for yolov5 * Update README.md Update doc * Update fastdeploy_runtime.cc fix variable option.trt_max_shape wrong name * Update runtime_option.md Update resnet model dynamic shape setting name from images to x * Fix bug when inference result boxes are empty * Delete detection.py Co-authored-by: Jason <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]>

* Develop (#11) (#12) * Fix compile problem in different python version (#26) * fix some usage problem in linux * Fix compile problem Co-authored-by: root <[email protected]> * Add PaddleDetetion/PPYOLOE model support (#22) * add ppdet/ppyoloe * Add demo code and documents * add convert processor to vision (#27) * update .gitignore * Added checking for cmake include dir * fixed missing trt_backend option bug when init from trt * remove un-need data layout and add pre-check for dtype * changed RGB2BRG to BGR2RGB in ppcls model * add model_zoo yolov6 c++/python demo * fixed CMakeLists.txt typos * update yolov6 cpp/README.md * add yolox c++/pybind and model_zoo demo * move some helpers to private * fixed CMakeLists.txt typos * add normalize with alpha and beta * add version notes for yolov5/yolov6/yolox * add copyright to yolov5.cc * revert normalize * fixed some bugs in yolox * fixed examples/CMakeLists.txt to avoid conflicts * add convert processor to vision * format examples/CMakeLists summary * Fix bug while the inference result is empty with YOLOv5 (#29) * Add multi-label function for yolov5 * Update README.md Update doc * Update fastdeploy_runtime.cc fix variable option.trt_max_shape wrong name * Update runtime_option.md Update resnet model dynamic shape setting name from images to x * Fix bug when inference result boxes are empty * Delete detection.py Co-authored-by: Jason <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]> * Develop (#13) * Fix compile problem in different python version (#26) * fix some usage problem in linux * Fix compile problem Co-authored-by: root <[email protected]> * Add PaddleDetetion/PPYOLOE model support (#22) * add ppdet/ppyoloe * Add demo code and documents * add convert processor to vision (#27) * update .gitignore * Added checking for cmake include dir * fixed missing trt_backend option bug when init from trt * remove un-need data layout and add pre-check for dtype * changed RGB2BRG to BGR2RGB in ppcls model * add model_zoo yolov6 c++/python demo * fixed CMakeLists.txt typos * update yolov6 cpp/README.md * add yolox c++/pybind and model_zoo demo * move some helpers to private * fixed CMakeLists.txt typos * add normalize with alpha and beta * add version notes for yolov5/yolov6/yolox * add copyright to yolov5.cc * revert normalize * fixed some bugs in yolox * fixed examples/CMakeLists.txt to avoid conflicts * add convert processor to vision * format examples/CMakeLists summary * Fix bug while the inference result is empty with YOLOv5 (#29) * Add multi-label function for yolov5 * Update README.md Update doc * Update fastdeploy_runtime.cc fix variable option.trt_max_shape wrong name * Update runtime_option.md Update resnet model dynamic shape setting name from images to x * Fix bug when inference result boxes are empty * Delete detection.py Co-authored-by: Jason <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]> * documents * documents * documents * documents * documents * documents * documents * documents * documents * documents * documents * documents * Develop (#14) * Fix compile problem in different python version (#26) * fix some usage problem in linux * Fix compile problem Co-authored-by: root <[email protected]> * Add PaddleDetetion/PPYOLOE model support (#22) * add ppdet/ppyoloe * Add demo code and documents * add convert processor to vision (#27) * update .gitignore * Added checking for cmake include dir * fixed missing trt_backend option bug when init from trt * remove un-need data layout and add pre-check for dtype * changed RGB2BRG to BGR2RGB in ppcls model * add model_zoo yolov6 c++/python demo * fixed CMakeLists.txt typos * update yolov6 cpp/README.md * add yolox c++/pybind and model_zoo demo * move some helpers to private * fixed CMakeLists.txt typos * add normalize with alpha and beta * add version notes for yolov5/yolov6/yolox * add copyright to yolov5.cc * revert normalize * fixed some bugs in yolox * fixed examples/CMakeLists.txt to avoid conflicts * add convert processor to vision * format examples/CMakeLists summary * Fix bug while the inference result is empty with YOLOv5 (#29) * Add multi-label function for yolov5 * Update README.md Update doc * Update fastdeploy_runtime.cc fix variable option.trt_max_shape wrong name * Update runtime_option.md Update resnet model dynamic shape setting name from images to x * Fix bug when inference result boxes are empty * Delete detection.py Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: DefTruth <[email protected]> Co-authored-by: huangjianhui <[email protected]> Co-authored-by: Jason <[email protected]>

DefTruth

fastdeploy/vision/wongkinyiu/scaledyolov4.cc 中 LetterBox 处理

Resize::Run(mat, resize_w, resize_h);

在resize前加一个判断，只在宽高不相同时进行resize

if ((mat->Height() != resize_h) || (mat->Width() != resize_w)) {
  Resize::Run(mat, resize_w, resize_h);
}

其他几个模型yolor、yolov7对应的地方也修改一下。因为在预处理中已经先做了一次resize，这次resize其实已经resize到目标维度size了，这里的mat的维度应该是和目标size一致的，所以LetterBox里面的这个resize可以做一次维度判断，避免多做一次冗余的resize

DefTruth

fastdeploy/vision/wongkinyiu/scaledyolov4.cc 中 Preprocess 处理

double ratio = (size[0] * 1.0) / std::max(static_cast<float>(mat->Height()),
                                          static_cast<float>(mat->Width()));

这里只用了size[0]，也就是只支持方形输入，不支持矩形输入，但是我们无法判断用户是不是只用方形输入，比如[640,1280]、[320,640]等这种推理的输入size也是很常见的。所以这里最好修改成既支持方形也支持矩形。比如：

float ratio = std::min(size[1] * 1.0f / static_cast<float>(mat->Height()), 
                       size[0] * 1.0f / static_cast<float>(mat->Width()));

其他几个模型yolor、yolov7对应的地方也修改一下。

DefTruth

在 fastdeploy/vision/wongkinyiu/scaledyolov4.cc 中 Postprocess 的 pad处理逻辑有遗漏：

float scale = std::min(out_h / ipt_h, out_w / ipt_w);
for (size_t i = 0; i < result->boxes.size(); ++i) {
    float pad_h = (out_h - ipt_h * scale) / 2;
    float pad_w = (out_w - ipt_w * scale) / 2;
    // ...
}

这里的pad计算逻辑没有完全和LetterBox中的逻辑对上，因为LetterBox中有2中pad逻辑，一种是is_mini_pad=false(默认的，也就是_auto=false)，另一种是is_mini_pad=true(也就是_auto=true)，这两种模式下pad的计算方式是不一样的，所以在decode的时候也要做不一样的处理，不能只做is_mini_pad=false的处理。可参考以下逻辑（可以把pad计算提到循环外面）:

float scale = std::min(out_h / ipt_h, out_w / ipt_w);
float pad_h = (out_h - ipt_h * scale) / 2.0f;
float pad_w = (out_w - ipt_w * scale) / 2.0f;
if (is_mini_pad) {
  // 和 LetterBox中_auto=true的处理逻辑对应 
  pad_h = static_cast<float>(static_cast<int>(pad_h) % stride);
  pad_w = static_cast<float>(static_cast<int>(pad_w) % stride);
}
for (size_t i = 0; i < result->boxes.size(); ++i) {
  // ... decode 逻辑
}

yolor、yolov7中的相关处理也可以修改一下。

DefTruth

在 model_zoo/vision/scaledyolov4/api.md 中函数签名中的参数没有对齐

Predict函数

ScaledYOLOv4::Predict(cv::Mat* im, DetectionResult* result,
                float conf_threshold = 0.25,
                float nms_iou_threshold = 0.5)

建议对齐一下

ziqi-jin added 30 commits July 13, 2022 13:55

first commit for yolov7

1684b05

pybind for yolov7

71c00d9

CPP README.md

21ab2f9

CPP README.md

d63e862

modified yolov7.cc

7b3b0e2

README.md

d039e80

python file modify

a34a815

merge test

eb010a8

delete license in fastdeploy/

39f64f2

repush the conflict part

d071b37

README.md modified

d5026ca

README.md modified

fb376ad

file path modified

4b8737c

file path modified

ce922a0

file path modified

6e00b82

file path modified

8c359fb

file path modified

906c730

README modified

80c1223

README modified

6072757

move some helpers to private

2c6e6a4

add examples for yolov7

48136f0

api.md modified

6feca92

api.md modified

ae70d4f

api.md modified

f591b85

YOLOv7

f0def41

yolov7 release link

15b9160

yolov7 release link

4706e8c

yolov7 release link

dc83584

copyright

086debd

change some helpers to private

4f980b9

ziqi-jin and others added 16 commits July 19, 2022 11:57

Merge branch 'PaddlePaddle:develop' into develop

8103772

gitignore

f5f7a86

Transfer some funtions to private member of class

e6cec25

Transfer some funtions to private member of class

e25e4f2

first commit for yolor

a182893

for merge

3aa015f

Merge branch 'yolor' into develop

871cfc6

Merge branch 'PaddlePaddle:develop' into develop

7a5a6d9

Merge branch 'PaddlePaddle:develop' into develop

c996117

first commit for scaled_yolov4

af1b19f

commit for documents

e2c6360

change py name

dcf0855

accelerate the normalize

5e0a867

DefTruth requested changes Jul 24, 2022

View reviewed changes

DefTruth reviewed Jul 24, 2022

View reviewed changes

This comment was marked as duplicate.

Sign in to view

ziqi-jin closed this Jul 24, 2022

ziqi-jin reopened this Jul 24, 2022

code fixed by the commets above

932c2dd

DefTruth approved these changes Jul 25, 2022

View reviewed changes

jiangjiajun approved these changes Jul 25, 2022

View reviewed changes

jiangjiajun merged commit 36fc77e into PaddlePaddle:develop Jul 25, 2022

ziqi-jin deleted the sc_yolov4 branch August 10, 2022 06:22

chriswack mentioned this pull request Oct 17, 2022

OpenCV(4.3.0) Error: Assertion failed (inv_scale_x > 0) in resize #385

Closed

marsbzp mentioned this pull request Jan 14, 2023

C++使用fastdeploy多线程Paddle Inference后端推理OCR模型出现崩溃 #1143

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add model ScaledYOLOv4 Support #34

Add model ScaledYOLOv4 Support #34

ziqi-jin commented Jul 22, 2022

DefTruth left a comment

DefTruth left a comment

DefTruth left a comment

DefTruth left a comment

This comment was marked as duplicate.

Add model ScaledYOLOv4 Support #34

Add model ScaledYOLOv4 Support #34

Conversation

ziqi-jin commented Jul 22, 2022

DefTruth left a comment

Choose a reason for hiding this comment

DefTruth left a comment

Choose a reason for hiding this comment

DefTruth left a comment

Choose a reason for hiding this comment

DefTruth left a comment

Choose a reason for hiding this comment

Predict函数

This comment was marked as duplicate.