Applying the jetson fixes #847

ivansmith7795 · 2023-10-27T21:52:50Z

No description provided.

ivansmith7795 · 2023-10-27T21:53:14Z

Merging changes from jetson branch

TimDettmers · 2024-01-01T16:05:22Z

Did this work out for you? It seems a straightforward fix and a good contribution if this would make the library jetson compatible.

Titus-von-Koeller · 2024-01-30T00:09:15Z

@rickardp @younesbelkada

Do you have opinions on this PR? Could one of you two do the review?

rickardp · 2024-02-06T19:47:45Z

Makefile

@@ -41,14 +41,15 @@ CC_KEPLER += -gencode arch=compute_37,code=sm_37 # Kepler
 CC_CUDA11x := -gencode arch=compute_75,code=sm_75
 CC_CUDA11x += -gencode arch=compute_80,code=sm_80
 CC_CUDA11x += -gencode arch=compute_86,code=sm_86
-
+CC_CUDA11x += -gencode arch=compute_87,code=sm_87


Can we confirm that the cmake file works with the Jetson devices? It compiles, but I do not have a device to test with.

Wheels can be taken from the latest build from here
https://github.com/TimDettmers/bitsandbytes/actions/workflows/python-package.yml

rickardp · 2024-02-06T19:49:54Z

csrc/kernels.cu

@@ -2409,7 +2409,7 @@ template <int ITEMS_PER_THREAD, int SUBTILE_ROWS, int THREADS>__global__ void kd
 }


-template <int THREADS, int ITEMS_PER_THREAD, int TILE_ROWS, int TILE_COLS, int SPARSE_DECOMP> __global__ void kDoubleRowColQuant(half *__restrict__ const A, float *__restrict__ const rowStats, float * __restrict__ const colStats, char *out_col_normed, char *out_row_normed, int *rowidx, int *colidx, half *val, int * __restrict__ nnz_block_ptr, float threshold, int rows, int cols, int tiledCols)
+template <int THREADS, int ITEMS_PER_THREAD, int TILE_ROWS, int TILE_COLS, int SPARSE_DECOMP> __global__ void kDoubleRowColQuant(half *__restrict__ const A, float *__restrict__ const rowStats, float * __restrict__ const colStats, int8_t *out_col_normed, int8_t *out_row_normed, int *rowidx, int *colidx, half *val, int * __restrict__ nnz_block_ptr, float threshold, int rows, int cols, int tiledCols)


Not sure why this is needed, but as long as it compiles on all platforms (looking at you, MSVC :) ), I don't see a problem with the change either .IIRC, int8_t is exactly 8 bits, while char is at least 8 bits

rickardp · 2024-02-06T19:50:59Z

include/SIMD.h

@@ -28,6 +28,9 @@ FORCE_INLINE int popcnt32(int x32)

 #if defined(USE_AVX) || defined(USE_AVX2)
 #include <immintrin.h>
+#elif defined __aarch64__
+#warning "--- THIS IS AARCH64"
+#include <sse2neon.h>


We are going to need to support Neon one way or the other. I am pondering if this is the right approach though, or if we should implement the Neon intrinsics directly? If it saves us time in the short run, maybe a viable option?

rickardp · 2024-02-06T19:52:07Z

setup.py

@@ -1,3 +1,4 @@
+#!/usr/bin/python3


If we want this, /usr/bin/env python3 is more portable.

Also, the file is not executable. Need to chmod 755 and commit if this is to make sense

Applying the jetson fixes

b3066b1

ivansmith7795 closed this Oct 27, 2023

TimDettmers reopened this Jan 1, 2024

TimDettmers added high priority (first issues that will be worked on) Low Risk Risk of bugs in transformers and other libraries labels Jan 1, 2024

rickardp mentioned this pull request Jan 2, 2024

Make native code portable and add GitHub workflow for building #949

Merged

rickardp reviewed Feb 6, 2024

View reviewed changes

Titus-von-Koeller force-pushed the main branch 2 times, most recently from 9b72679 to 7800734 Compare July 27, 2024 13:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Applying the jetson fixes #847

Applying the jetson fixes #847

ivansmith7795 commented Oct 27, 2023

ivansmith7795 commented Oct 27, 2023

TimDettmers commented Jan 1, 2024

Titus-von-Koeller commented Jan 30, 2024

rickardp Feb 6, 2024

rickardp Feb 6, 2024

rickardp Feb 6, 2024

rickardp Feb 6, 2024

Applying the jetson fixes #847

Are you sure you want to change the base?

Applying the jetson fixes #847

Conversation

ivansmith7795 commented Oct 27, 2023

ivansmith7795 commented Oct 27, 2023

TimDettmers commented Jan 1, 2024

Titus-von-Koeller commented Jan 30, 2024

rickardp Feb 6, 2024

Choose a reason for hiding this comment

rickardp Feb 6, 2024

Choose a reason for hiding this comment

rickardp Feb 6, 2024

Choose a reason for hiding this comment

rickardp Feb 6, 2024

Choose a reason for hiding this comment