Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cuio: reduce/improve kernel parms: avro #6399

Closed

Conversation

cwharris
Copy link
Contributor

@cwharris cwharris commented Oct 2, 2020

Contributes to #5710 and #5943

@cwharris cwharris requested a review from a team as a code owner October 2, 2020 02:12
@GPUtester
Copy link
Collaborator

Please update the changelog in order to start CI tests.

View the gpuCI docs here.

@codecov
Copy link

codecov bot commented Oct 2, 2020

Codecov Report

Merging #6399 (71f88f0) into branch-0.18 (bd537b6) will increase coverage by 1.23%.
The diff coverage is n/a.

Impacted file tree graph

@@               Coverage Diff               @@
##           branch-0.18    #6399      +/-   ##
===============================================
+ Coverage        81.92%   83.15%   +1.23%     
===============================================
  Files               96       94       -2     
  Lines            16167    14779    -1388     
===============================================
- Hits             13245    12290     -955     
+ Misses            2922     2489     -433     
Impacted Files Coverage Δ
python/cudf/cudf/utils/dtypes.py 86.36% <0.00%> (-2.36%) ⬇️
python/cudf/cudf/utils/utils.py 82.16% <0.00%> (-2.09%) ⬇️
python/cudf/cudf/core/dtypes.py 88.39% <0.00%> (-2.00%) ⬇️
python/cudf/cudf/core/reshape.py 89.06% <0.00%> (-1.98%) ⬇️
python/cudf/cudf/core/dataframe.py 90.12% <0.00%> (-0.89%) ⬇️
python/cudf/cudf/io/csv.py 95.00% <0.00%> (-0.75%) ⬇️
python/cudf/cudf/datasets.py 96.55% <0.00%> (-0.42%) ⬇️
python/cudf/cudf/core/index.py 92.56% <0.00%> (-0.33%) ⬇️
python/cudf/cudf/core/column/string.py 86.35% <0.00%> (-0.30%) ⬇️
python/cudf/cudf/core/groupby/groupby.py 93.18% <0.00%> (-0.23%) ⬇️
... and 58 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update bd537b6...dac77ab. Read the comment docs.

@harrism harrism changed the title [REVIEW] cuio: reduce/impove kernel parms: avro [REVIEW] cuio: reduce/improve kernel parms: avro Oct 2, 2020
@@ -30,7 +30,7 @@ namespace avro {
*
* @returns true if successful, false if error
*/
bool container::parse(file_metadata *md, size_t max_num_rows, size_t first_row)
bool container::parse(file_metadata &md, size_t max_num_rows, size_t first_row)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The cudf convention is to take out params by pointer so as to indicate at call site that this will be modified.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good. I just checked to see if we can turn this in to something along the lines of std::pair<bool, file_metadata> parse(int, int);, as the invocation site suggests it can, but it requires combining the reader impl constructor+read functions to make it pretty. Saving for another time, and reverting to file_metadata *md

cpp/src/io/avro/avro_gpu.cu Outdated Show resolved Hide resolved
cpp/src/io/avro/reader_impl.cu Outdated Show resolved Hide resolved
cpp/src/io/avro/avro_gpu.cu Show resolved Hide resolved
@@ -425,40 +439,26 @@ table_with_metadata reader::impl::read(avro_reader_options const &options, cudaS
dict_pos += len;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If global_dictionary.host_ptr() + dict_pos is indeed correct, then you can use the offset argument global_dictionary.host_ptr(dict_pos)

total_dictionary_entries * sizeof(gpu::nvstrdesc_s) + dictionary_data_size);
hostdevice_vector<gpu::nvstrdesc_s> global_dictionary(total_dictionary_entries +
dictionary_data_size);

if (total_dictionary_entries > 0) {
size_t dict_pos = total_dictionary_entries * sizeof(gpu::nvstrdesc_s);
for (size_t i = 0; i < column_types.size(); ++i) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

3 lines below, there's reinterpret_cast<gpu::nvstrdesc_s *>(global_dictionary.host_ptr()) which is redundant now.

@cwharris cwharris changed the title [REVIEW] cuio: reduce/improve kernel parms: avro [WIP] cuio: reduce/improve kernel parms: avro Oct 2, 2020
@harrism
Copy link
Member

harrism commented Oct 6, 2020

Move to 0.17

@harrism harrism added 2 - In Progress Currently a work in progress code quality cuIO cuIO issue libcudf Affects libcudf (C++/CUDA) code. labels Oct 6, 2020
@harrism harrism changed the base branch from branch-0.16 to branch-0.17 October 6, 2020 05:01
@harrism
Copy link
Member

harrism commented Nov 23, 2020

@cwharris is this PR still in progress?

@cwharris cwharris closed this Dec 1, 2020
@cwharris
Copy link
Contributor Author

cwharris commented Dec 1, 2020

Closing, as @vuule has picked this up, I believe.

@vuule
Copy link
Contributor

vuule commented Dec 1, 2020

Looks like there has been a mix-up. I haven't picked this up, it's just that some of my PRs included similar parameter changes (to other cuIO components).
AFAIK, this PR was put on hold until Avro test coverage was improved. I believe the test coverage hasn't changed enough to resume development.
My suggestion would be to reopen and move to 0.18. @cwharris thoughts?

@cwharris cwharris reopened this Dec 1, 2020
@cwharris
Copy link
Contributor Author

cwharris commented Dec 1, 2020

@vuule sorry about that. moving to 0.18 seems appropriate.

@harrism harrism changed the base branch from branch-0.17 to branch-0.18 December 2, 2020 09:28
@cwharris cwharris changed the title [WIP] cuio: reduce/improve kernel parms: avro cuio: reduce/improve kernel parms: avro Dec 2, 2020
@cwharris
Copy link
Contributor Author

cwharris commented Feb 2, 2021

Will finish this PR once #7156 is in.

@github-actions
Copy link

This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.

@vuule
Copy link
Contributor

vuule commented Mar 22, 2021

@cwharris should we close this PR?

@cwharris
Copy link
Contributor Author

@vuule closing.

@cwharris cwharris closed this Apr 14, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 - In Progress Currently a work in progress cuIO cuIO issue libcudf Affects libcudf (C++/CUDA) code.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants