Bitsets3 #239

wclodius2 · 2020-09-30T02:05:04Z

Branch to implement bitset data types. Committed the following new source files:
src/stdlib_bitsets.f90
src/stdlib_bitset_64.f90
src/stdlib_bitset_large.f90

Added the following new test files:
src/tests/bitsets/CMakeLists.txt
src/tests/bitsets/Makefile.manual
src/tests/bitsets/test_stdlib_bitset_64.f90
src/tests/bitsets/test_stdlib_bitset_large.f90

Added the following new documentation file:
doc/specs/stdlib_bitssets.md

Modified the following compilation files:
src/CMakeLists.txt
src/Makefile.manual
src/tests/CMakeLists.txt
src/tests/Makefile.manual

Added stdlib_bitsets.f90, stdlib_bitset_64.f90, and stdlib_bitset_large.f90 and modified CMakeLists.txt and Makefile.manual so they should compile the files. [ticket: X]

Added tests/bitsets/test_stdlib_bitset*.f90, tests/bitsets/CMakeLists.txt, and tests/bitsets/Makefile.manual and modified tests/CMakeLists.txt and tests/Makefile.manual to compile the test programs. [ticket: X]

Eliminated unused variables in stdlib_bitset_64.f90, stdlib_bitset_large.f90 and rename variables called ablock to block_ in stdlib_bitset_large.f90 [ticket: X]

Added stdlib/doc/specs/stdlib_bitsets.md [ticket: X]

jvdp1

Thank you @wclodius2 for this PR. I will try to review it this weekend or next week.

doc/specs/stdlib_bitsets.md

src/tests/Makefile.manual

14NGiestas

There's a misspelling of the word 'occurred' in this file: "occured" (line 1092)
EDIT: this file

doc/specs/stdlib_bitsets.md

wclodius2 · 2020-10-05T01:40:33Z

Looks good.

…

On Oct 4, 2020, at 11:49 AM, Jeremie Vandenplas ***@***.***> wrote: @jvdp1 commented on this pull request. In src/tests/Makefile.manual <#239 (comment)>: > @@ -10,6 +11,7 @@ all: test: $(MAKE) -f Makefile.manual --directory=ascii test + $(MAKE) -f Makefile.manual --directoru=bitsets test ⬇️ Suggested change - $(MAKE) -f Makefile.manual --directoru=bitsets test + $(MAKE) -f Makefile.manual --directory=bitsets test This should allow the Github actions to be successful — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#239 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APTQDOWITSG6CMZ2MNAX2VLSJCYSBANCNFSM4R6QFU5A>.

14NGiestas · 2020-10-07T02:30:47Z

We should really discuss the error handling further in #219 . The 'go to' usage in this function... I'm not a fan xD. I think only one labeled part is needed to achieve the 'return with status' vs 'stop program with error message' desired behavior, with the aid of a error handling module, we could set the message and a flag to switch between the two possible actions, reusing the code. I will bring some working code snippet soon (I hope).
Also, in the assigment functions I wonder if it is possible to use the template system (as in the statistics module)?

wclodius2 · 2020-10-07T15:57:53Z

What is xD? The error handling has several aspects that could be commented on:

Rather than intermingling the error handling with the main logic I jump to error handling at the end of the procedure. This increases the amount of code slightly, and can obscure what the error handling is doing, but I think makes the main logic clearer.
I am handling some "errors" that possibly shouldn't be handled: memory allocation problems and user errors. With virtual memory, its not clear to me that allocation can ever fail, and with lazy allocation on Linux such failure will not occur on the allocate statement.. While not important in the procedure you call out, I also attempt to provide a fallback for some user errors where simple failure might be best.
Since we lack an error handling procedure I am relying explicitly on an if branch with error stop. An error handling procedure could incorporate the if branch and slightly simplify the logic by incorporating both a message and a status argument. There are however a lot of issues to be dealt with in developing an error handling module.
- Should it be a separate module or part of the logger?
- Should it include a set of error codes?
- What should the arguments be to the main error handling procedure?
  - message
  - module and procedure names
  - status/stat flag
  - error code
- How should processing be terminated, an error stop: with an F2018 runtime determined stop code, with an error code determined compile time stop code, generic compile time stop code?
Including the module and procedure names in the error stop makes the stop code longer and harder to read
Making the stop code message a character string parameter would shorten the error handling code, at the expense of cluttering up other portions of the code.

It is of course possible to use fypp preprocessing to generate templates for the assignments to and from logical arrays. The assignment procedures are few enough and simple enough that I thought it best to avoid a dependence on the preprocessor, but that is a matter of taste. The addition of more assignments, say to and from integers would probably change my opinion.

14NGiestas · 2020-10-07T17:43:09Z

What is xD?

ASCII version of 😆smiling face with closed eyes

1-5. Yes, the current way increases the amount of code, however the main logic can be still clear if we do something like that:

    subroutine fail_behavior(..., status)
        ! doing a lot of stuff here
        ! ...
        ! case X: found some fatal error
            error_handler % status = ERROR_CODE_TO_X
            error_handler % message = "Oh no! X happened"
            go to 404
        ! case Y: found other general error
            error_handler % status = ERROR_CODE_TO_Y
            error_handler % message = "Oh no! Y happened"
            go to 404

    404 if (present(status)) then
            status = error_handler % status
            return
        else
            error stop error_handler % message
        end if
    end subroutine

This is a way we can avoid the code duplication, reusing the go to branching. Note that error_handler should be a object that couples the information about what happened (not reflecting in any way the API I got in mind for a final version).
The error handling should be a separate module with some default general purpose errors and a abstract one, that would allow it to be extended when needed by other modules, this way they can provide their own specific codes constants and messages.

It is of course possible to use fypp preprocessing to generate templates for the assignments to and from logical arrays. The assignment procedures are few enough and simple enough that I thought it best to avoid a dependence on the preprocessor, but that is a matter of taste. The addition of more assignments, say to and from integers would probably change my opinion.

IMHO, we should use the template system every time that not using it leads to code duplication, since it gets easier to modify and extend the code in the future. (DRY principle).

wclodius2 · 2020-10-07T18:26:31Z

FWIW for the error handling what I thought you were proposing was something like

subroutine fail_behavior(..., status)
    ! doing a lot of stuff here
    ! ...
    ! case X: found some fatal error
        call error_handler(message="Oh no! X happened", error=error_code_to_x, status=status)
        return
    ! case Y: found other general error
        call error_handler(message="Oh no! Y happened", error=error_code_to_y, status=status)
        return
end subroutine

where error_handler assigns error to status if status is present and stops processing with a message to error_unit otherwise.

I can change the code to use fypp, but it probably requires starting a new branch. I can see how to add or modify files in the current branch, but not how to rename or remove existing files, which I need to change the *.f90 files to *.fypp.

14NGiestas · 2020-10-07T19:42:14Z

FWIW for the error handling what I thought you were proposing was something like (...)
where error_handler assigns error to status if status is present and stops processing with a message to error_unit otherwise.

Indeed, actually this is even better because we don't need retype the branching logic with "go to" statements in every function that may fail and is doable (working example here). To avoid having any boilerplate along the function's main logic, the module that have a specific need should build a function that calls this base function error_handler, Ex:

subroutine value_error(expected, got, status)
     ....
     write(message, '("value_error: Expected ",G0,", but got ",G0," instead.")') expected, got
     call error_handler(message, ...) 
end subroutine

I can change the code to use fypp, but it probably requires starting a new branch. I can see how to add or modify files in the current branch, but not how to rename or remove existing files, which I need to change the *.f90 files to *.fypp.

You can clone your repo here, modify it locally and push the commits (It will probably sync with the open PR)

jvdp1 · 2020-10-07T21:52:16Z

I can see how to add or modify files in the current branch, but not how to rename or remove existing files, which I need to change the *.f90 files to *.fypp.

You can rename and remove a file locally with your OS, and then add/remove, commit, and push with git.

wclodius2 · 2020-10-07T21:57:49Z

When I try

git clone https://github.com/wclodius2/stdlib/tree/bitsets3 my-bitsets

I get the error message

Cloning into 'my-bitsets'...
fatal: repository 'https://github.com/wclodius2/stdlib/tree/bitsets3/' not found

Is there something wrong with my syntax/repository name?

14NGiestas · 2020-10-07T22:46:39Z

This is the wrong URL, please do the following:

git clone https://github.com/wclodius2/stdlib.git my-bitsets
cd my-bitsets
git checkout bitsets3

…set*.f90 Changed makeefiles to preprocess ths stdlib_bitset*.fypp files to stdlib_bitset*.f90 files. [ticket: X]

Renamed files stdlib_bitset*.f90 to fypp preprocessor stdlib_bitset*.fypp files [ticket: X]

Changed stdlib_bitsets.fypp, stdlib_bitset_64.fypp, and stdlib_bitset_large.fypp to generate the assignment procedures of logical arrays to and from bitsets. [ticket: X]

Removed stdlib_bitsets.f90, stdlib_bitset_64.f90, and stdlib_bitset_large.f90 as they are now generated by the preprocessor. [ticket: X]

Defined an error_handler subroutine in stdlib_bitsets.fypp and used it to handle errors in stdlib_bitset_64.fypp and stdlib_bitset_large.fypp. Also was more consistent in documenting status argument results. Added char_string_too_large_error to status results. [ticket: X]

Was more consistent in using bulleted lists in documenting status error codes. Added char_string_too_large_error to the error codes. [ticket: X]

Working with fypp made it easier to add unwanted trailing blanks. I removed them. [ticket: X]

Introduced the parameters max_digits and overflow_bits to be used in checking for overflows on reads and writes. The parameters need to be changed if bits_kind is changed, and preferred parameters for bits_kind==int64 are defined, but commented out. [ticket: X]

Replaced go to 100 with exit in both stdlib_bitsets_64.fypp and stdlib_bitsets_large.fypp. [ticket: X]

src/stdlib_bitsets_large.fypp

jvdp1 · 2020-10-20T18:01:19Z

src/tests/bitsets/test_stdlib_bitset_64.f90

@@ -0,0 +1,744 @@
+program test_stdlib_bitset_64
+    use, intrinsic :: iso_fortran_env, only : int8, int16, int32, int64


Suggested change

use, intrinsic :: iso_fortran_env, only : int8, int16, int32, int64

use stdlib_kinds, only : int8, int16, int32, int64

jvdp1

Here are some additional comments on the tests. All options seem to be covered by the tests. I added some suggestions.
On overall I am pleased with this PR. Thank you @wclodius2!

src/tests/bitsets/test_stdlib_bitset_64.f90

src/tests/bitsets/test_stdlib_bitset_large.f90

wclodius2 · 2020-10-20T19:47:02Z

It means I wrote the code when I was tired and wanted someone else to check my reasoning. It is parsing a decimal representation of the size in bits, and I want to be certain that the size does not exceed what can be represented by an integer of `bits_kind`. That could happen for example if parsing a file with `bits_kind==int32` and the file was written with `bits_kind==int64`. We want to be sure that the representation for `bits_kind==int32` does not exceed `2**31-1`. My reasoning is that it will exceed that value if: 1. it has ten digits and is about to read another digit increasing its value by a factor of ten 2. It has nine digits and has a value greater than `2**31/10` 3. It had a value of `2**31/10` and reading the next digit causes an integer overflow of nine or less resulting in a negative value.

…

On Oct 20, 2020, at 11:46 AM, Jeremie Vandenplas ***@***.***> wrote: @jvdp1 commented on this pull request. In src/stdlib_bitsets_large.fypp <#239 (comment)>: > + end do find_start + + if ( pos > len(string) - 8 ) go to 999 + + if ( string(pos:pos) /= 's' .AND. string(pos:pos) /= 'S' ) go to 999 + + pos = pos + 1 + bits = 0 + digits = 0 + + do + select case( iachar( string(pos:pos) ) ) + case(ia0:ia9) + digits = digits + 1 + if ( digits == max_digits .AND. bits > overflow_bits ) go to 996 +!! May not be quite right ⬇️ Suggested change -!! May not be quite right +!! May not be quite right What does this comment mean? What could be not quite right? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#239 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APTQDOUBL3KX6BCX26LADBTSLXEG5ANCNFSM4R6QFU5A>.

Replaced the use of iso_fortran_env with stdlib_kinds. [ticket: X]

Changed handling of potential integer overflows on reads for bits_kind==int64, changing max_digits from 20 to 19. Removed comment that my treatment may not be quite right. Also fixed a typo in an error message. [ticket: X]

wclodius2 · 2020-10-21T00:17:21Z

I have now resolved my qualms about my handling of potential integer overflows on reads. I have changed max_digits for bits_kind==int64 from 20 to 19 (2**63-1 is about 9.2E18) and convinced myself that the rest of my logic was correct. I have move the comments that the code may not be quite right. (I also fixed a typo in an error message.)

Jeremie suggested numerous changes. I implemented most of them. [ticket: X]

wclodius2 · 2020-10-21T02:20:49Z

@jvdp1 thanks for the thorough review.

At the suggestion of Jeremie I replaced a number of go tos. [ticket: X]

src/tests/bitsets/test_stdlib_bitset_large.f90

Kreplaced go tos at the suggestion of Jeremie. [ticket: X]

Documented the use of the "named" forms, .EQ., .NE., .GT., .GE., .LT., .LE., as alternatives to the symbolic forms, ==, /=, >, >=, <, <= of the comparison operations. [ticket: X]

doc/specs/stdlib_bitsets.md

milancurcic

I have reviewed and made edits to the spec document. I have also built the code run the tests (successfully). For as far as I understand bitsets, I think the API is good.

I did not review the code. Due to the combination of limited time, large size of this PR, and my unfamiliarity with bitsets, I don't think I can do proper code review without putting in substantial effort. But I trust @wclodius2 and @jvdp1 that they did great work.

I recommend this PR to be merged a week from now.

wclodius2 · 2020-11-13T18:52:02Z

A bit set for each index `bit` in the range `0…bits-1` has a value of 0 or 1. A value of 1 for index `bit` can be considered as including the `bit` value in the subset, i.e., the value `1110` can be considered as defining the subset (1, 2, 3). This was how Wirth defined sets in Pascal.

…

On Nov 13, 2020, at 11:42 AM, Milan Curcic ***@***.***> wrote: @milancurcic commented on this pull request. In doc/specs/stdlib_bitsets.md <#239 (comment)>: > +equivalently be considered as a sequence of logical values or as a +subset of the integers 0 ... `bits-1`. The bits are indexed from 0 to or as a subset of the integers 0 ... bits-1. This confuses me. The equivalent integer values are 0 and 1, not 0 through bits-1, correct? Do you mean to say something like this? "It can equivalently be considered as a sequence of logical values or as a sequence of integers 0 and 1 with indices in the range 0... bits-1?" — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#239 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APTQDOTR4LOOIPRJ6D2QW63SPV42PANCNFSM4R6QFU5A>.

milancurcic · 2020-11-13T18:59:28Z

Okay, thanks, I understand now. All good.

milancurcic · 2020-11-13T19:02:45Z

Your example "i.e., the value 1110 can be considered as defining the subset (1, 2, 3)" was what made it click for me. Do you agree that we include it in the spec? Something like this:

"It can equivalently be considered as a sequence of logical values or as a subset of the integers 0 ... bits-1. For example, the value 1110 can be considered as defining the subset of integers [1, 2, 3]."

wclodius2 · 2020-11-13T19:04:57Z

Sounds good to me.

milancurcic · 2020-11-22T15:38:48Z

I will go ahead and merge this considering there aren't any objections. Thank you @wclodius2 and all the reviewers.

wclodius2 added 4 commits September 29, 2020 17:05

Added core files for stdlib_bitsets

cd7e19d

Added stdlib_bitsets.f90, stdlib_bitset_64.f90, and stdlib_bitset_large.f90 and modified CMakeLists.txt and Makefile.manual so they should compile the files. [ticket: X]

Prepared for testing of stdlib_bitsets

e35ebc7

Added tests/bitsets/test_stdlib_bitset*.f90, tests/bitsets/CMakeLists.txt, and tests/bitsets/Makefile.manual and modified tests/CMakeLists.txt and tests/Makefile.manual to compile the test programs. [ticket: X]

Eliminated unused variablese

e2f3d66

Eliminated unused variables in stdlib_bitset_64.f90, stdlib_bitset_large.f90 and rename variables called ablock to block_ in stdlib_bitset_large.f90 [ticket: X]

Added documentation for stdlib_bitsets

7d778cd

Added stdlib/doc/specs/stdlib_bitsets.md [ticket: X]

jvdp1 reviewed Oct 2, 2020

View reviewed changes

doc/specs/stdlib_bitsets.md Outdated Show resolved Hide resolved

doc/specs/stdlib_bitsets.md Show resolved Hide resolved

doc/specs/stdlib_bitsets.md Show resolved Hide resolved

doc/specs/stdlib_bitsets.md Show resolved Hide resolved

jvdp1 added 3 commits October 4, 2020 19:35

Update doc/specs/stdlib_bitsets.md

acfa3ac

Update doc/specs/stdlib_bitsets.md

7c5361c

formatting

eb2e5c1

jvdp1 reviewed Oct 4, 2020

View reviewed changes

src/tests/Makefile.manual Outdated Show resolved Hide resolved

Update src/tests/Makefile.manual

e70c909

14NGiestas reviewed Oct 4, 2020

View reviewed changes

jvdp1 reviewed Oct 4, 2020

View reviewed changes

doc/specs/stdlib_bitsets.md Outdated Show resolved Hide resolved

Update doc/specs/stdlib_bitsets.md

8e25812

wclodius2 added 4 commits October 7, 2020 18:22

Changed makefiles to accept stdlib_bitset*.fypp instead of stdlib_bit…

c9e851b

…set*.f90 Changed makeefiles to preprocess ths stdlib_bitset*.fypp files to stdlib_bitset*.f90 files. [ticket: X]

Renamed files stdlib_bitset*.f90 to stdlib_bitset*.fypp

d80e5d9

Renamed files stdlib_bitset*.f90 to fypp preprocessor stdlib_bitset*.fypp files [ticket: X]

Changed preprocessor files to generate logical assignments.

f2d67fc

Changed stdlib_bitsets.fypp, stdlib_bitset_64.fypp, and stdlib_bitset_large.fypp to generate the assignment procedures of logical arrays to and from bitsets. [ticket: X]

Removed files now generated by the preprocessor

2833ffa

Removed stdlib_bitsets.f90, stdlib_bitset_64.f90, and stdlib_bitset_large.f90 as they are now generated by the preprocessor. [ticket: X]

14NGiestas approved these changes Oct 8, 2020

View reviewed changes

wclodius2 added 3 commits October 9, 2020 06:47

Better documented status results

9e9c252

Was more consistent in using bulleted lists in documenting status error codes. Added char_string_too_large_error to the error codes. [ticket: X]

Removed trailing blanks

421c4d2

Working with fypp made it easier to add unwanted trailing blanks. I removed them. [ticket: X]

wclodius2 added 2 commits October 19, 2020 20:08

Replaced go to 100 with exit

9161fc7

Replaced go to 100 with exit in both stdlib_bitsets_64.fypp and stdlib_bitsets_large.fypp. [ticket: X]

jvdp1 reviewed Oct 20, 2020

View reviewed changes

src/stdlib_bitsets_large.fypp Outdated Show resolved Hide resolved

jvdp1 reviewed Oct 20, 2020

View reviewed changes

wclodius2 added 2 commits October 20, 2020 17:40

Changed used modues

3235ab4

Replaced the use of iso_fortran_env with stdlib_kinds. [ticket: X]

Changed handling of potential integer overflows on reads

523dbc6

Changed handling of potential integer overflows on reads for bits_kind==int64, changing max_digits from 20 to 19. Removed comment that my treatment may not be quite right. Also fixed a typo in an error message. [ticket: X]

Numerous changes suggested by Jeremie

20a15e5

Jeremie suggested numerous changes. I implemented most of them. [ticket: X]

Replaced go tos

57faccd

At the suggestion of Jeremie I replaced a number of go tos. [ticket: X]

jvdp1 reviewed Oct 21, 2020

View reviewed changes

src/tests/bitsets/test_stdlib_bitset_large.f90 Outdated Show resolved Hide resolved

wclodius2 and others added 3 commits October 21, 2020 07:26

Replaced go tos

9c03d16

Kreplaced go tos at the suggestion of Jeremie. [ticket: X]

Documented the "named" form for the comparison operations

5c2779d

Documented the use of the "named" forms, .EQ., .NE., .GT., .GE., .LT., .LE., as alternatives to the symbolic forms, ==, /=, >, >=, <, <= of the comparison operations. [ticket: X]

typography fixes

99fa382

milancurcic reviewed Nov 13, 2020

View reviewed changes

doc/specs/stdlib_bitsets.md Outdated Show resolved Hide resolved

milancurcic approved these changes Nov 13, 2020

View reviewed changes

milancurcic and others added 2 commits November 13, 2020 14:10

add example to the first paragraph

acb7cdb

Merge https://github.com/fortran-lang/stdlib into bitsets3

1be7ca3

milancurcic merged commit 37a1ed1 into fortran-lang:master Nov 22, 2020

This was referenced Nov 22, 2020

CI failure for macos-latest, 8 #247

Closed

Revert "Bitsets3" to diagnose CI issues #248

Closed

jvdp1 mentioned this pull request Dec 10, 2020

Failure on macOS in master #264

Closed

14NGiestas mentioned this pull request May 5, 2022

API for a bitset data type #221

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bitsets3 #239

Bitsets3 #239

wclodius2 commented Sep 30, 2020

jvdp1 left a comment

14NGiestas left a comment •

edited

Loading

wclodius2 commented Oct 5, 2020 via email

14NGiestas commented Oct 7, 2020

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020 •

edited

Loading

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020

jvdp1 commented Oct 7, 2020

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020

jvdp1 Oct 20, 2020

jvdp1 left a comment

wclodius2 commented Oct 20, 2020 via email

wclodius2 commented Oct 21, 2020

wclodius2 commented Oct 21, 2020

milancurcic left a comment

wclodius2 commented Nov 13, 2020 via email

milancurcic commented Nov 13, 2020

milancurcic commented Nov 13, 2020

wclodius2 commented Nov 13, 2020

milancurcic commented Nov 22, 2020

		@@ -0,0 +1,744 @@
		program test_stdlib_bitset_64
		use, intrinsic :: iso_fortran_env, only : int8, int16, int32, int64

	use, intrinsic :: iso_fortran_env, only : int8, int16, int32, int64
	use stdlib_kinds, only : int8, int16, int32, int64

Bitsets3 #239

Bitsets3 #239

Conversation

wclodius2 commented Sep 30, 2020

jvdp1 left a comment

Choose a reason for hiding this comment

14NGiestas left a comment • edited Loading

Choose a reason for hiding this comment

wclodius2 commented Oct 5, 2020 via email

14NGiestas commented Oct 7, 2020

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020 • edited Loading

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020

jvdp1 commented Oct 7, 2020

wclodius2 commented Oct 7, 2020

14NGiestas commented Oct 7, 2020

jvdp1 Oct 20, 2020

Choose a reason for hiding this comment

jvdp1 left a comment

Choose a reason for hiding this comment

wclodius2 commented Oct 20, 2020 via email

wclodius2 commented Oct 21, 2020

wclodius2 commented Oct 21, 2020

milancurcic left a comment

Choose a reason for hiding this comment

wclodius2 commented Nov 13, 2020 via email

milancurcic commented Nov 13, 2020

milancurcic commented Nov 13, 2020

wclodius2 commented Nov 13, 2020

milancurcic commented Nov 22, 2020

14NGiestas left a comment •

edited

Loading

14NGiestas commented Oct 7, 2020 •

edited

Loading