-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【Hackathon 4 No.9】Add pca_lowrank API to Paddle #53743
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
❌ The PR is not created using PR's template. You can refer to this Demo. |
我对于paddle的稀疏算子的使用似乎不规范,对于当前sparse::matmul的报错,在 线上的报错为: NotFoundError: The kernel `matmul_coo_dense` is not registered. 线下的报错为:
线下python端的报错更详细一些
我想知道问题在于sparse matmul算子使用还是sparse tensor的创建。 |
Sorry to inform you that 485662d's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
这个意思是说矩阵的非零元素的个数比 rows * cols大,rows是矩阵的行数,cols是矩阵的列数。 要不在创建sparse tensor和调用sparse matmul的前后加点信息看看哪里开始有异常? |
这个问题的来源是当前paddle.sparse.sum API在某些情况下出现会出现非零元素数量多于tensor元素数量的情况,我不太确定这是否正常,下面是复现代码 # 原始数据 Paddle
Tensor(shape=[17, 4], dtype=paddle.float64, place=Place(gpu:0), stop_gradient=True,
indices=[[0, 1, 2, 3, 4, 5],
[1, 0, 0, 3, 1, 2]],
values=[-0.35408317, 0.34116612, 0.01369466, -0.04936034, 0.00005647,
-0.00023232]) # 原始数据 Pytorch
tensor(indices=tensor([[0, 1, 2, 3, 4, 5],
[1, 0, 0, 3, 1, 2]]),
values=tensor([-3.5408e-01, 3.4117e-01, 1.3695e-02, -4.9360e-02,
5.6474e-05, -2.3232e-04]),
size=(17, 4), nnz=6, dtype=torch.float64, layout=torch.sparse_coo) paddle.sparse.sum(x, axis=-2) # Paddle结果
Tensor(shape=[4], dtype=paddle.float64, place=Place(gpu:0), stop_gradient=True,
indices=[[1, 0, 0, 3, 1, 2]],
values=[-0.35402670, 0.35486078, 0.35486078, -0.04936034, -0.35402670,
-0.00023232])
torch.sparse.sum(s_t_x, dim=-2) # Pytorch结果
tensor(indices=tensor([[0, 1, 2, 3]]),
values=tensor([ 3.5486e-01, -3.5403e-01, -2.3232e-04, -4.9360e-02]),
size=(4,), nnz=4, dtype=torch.float64, layout=torch.sparse_coo) 因此sum的结果在sparse.matmul计算时会得到
|
@zrr1999 看起来是sparse.sum的问题,能否定位下? |
好的,我去查一下 |
将paddle.sparse.matmul绕过后单测可以通过,能麻烦对代码要更改的点都review一下吗 |
同时需要补充单测过一下coverage流水线 |
是否需要在spare.pca_lowrank代码中添加CUDA11.0的限制 |
不需要。windows流水线重新run一下 |
python/paddle/tensor/linalg.py
Outdated
positive number. The data type of x should be float32 or float64. | ||
q (int, optional): a slightly overestimated rank of :math:`X`. | ||
Default value is :math:`q=min(6,N,M)`. | ||
center (bool, optional): if True, center the input tensor, otherwise, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if True, center the input tensor, otherwise,Default value is True.
otherwise后面是不是少了一段话?不太通顺。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
static的CI没通过,看起来是因为sparse API的文档代码跑到了 return _C_ops.sparse_matmul(x, y),而这个必须cuda11.X支持。看起来需要在spare.pca_lowrank代码中添加CUDA11.0的限制来通过static的CI。 |
我在示例代码和计算代码中都添加了CUDA11限制,请问保留哪一处比较合适? |
两处都加限制吧;我看示例代码加了限制,但static的CI还是失败了。是因为你用 os.environ["CUDA_VISIBLE_DEVICES"] = "" 隐藏GPU了,但是此CI的cuda版本大于11.0,所以跑错分支了,可以删除os的限制吧。 |
这里OS的限制是自动加的,好像是通过这种方式限制gpu在cpu模式下运行,源码在
如果使用 我的想法是简单使用注释的方式解释,如下所示,这样CI能通过,用户也能看懂 print("sparse.pca_lowrank API only support CUDA 11.x")
U, S, V = None, None, None
# U, S, V = pca_lowrank(sparse_x) |
好的,麻烦@sunzhongkai588 看看这样写文档代码可以吗? |
print("sparse.pca_lowrank API only support CUDA 11.x")
U, S, V = None, None, None
# U, S, V = pca_lowrank(sparse_x) @Patrick-Star125 注释这行,可以简单写清楚是什么情况下用。 |
简单加了一行解释,static的CI过了 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for docs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for docs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Coverage的CI可以豁免,因为sparse的功能需要cuda11.X。
x = paddle.sparse.sparse_coo_tensor( | ||
indices_tensor, values, (rows, columns) | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we also need to test the sparse_csr_tensor
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This API don't support sparse_csr_tensor due to sparse.sum
with axis(-2) has not been implemented.
(Unimplemented)
axis
of SumCsrKernel only support None or -1 now.More number will be supported in the future.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add a TODO to remind us to complement the test when sparse.sum with axis(-2) is implemented in the future.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
python/paddle/sparse/unary.py
Outdated
if x.is_sparse(): | ||
return paddle.sparse.matmul(x, B) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This API requires sparse tensor as input(L1084) , so there is no need for this judgment and encapsulation anymore?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some parts of code would use paddle.matmul such as L1057 matmul(M, Q_c)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if matmul(M, Q_c)
use paddle.matmul
then write it directly, I think it is easier to understand.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
if x.is_sparse(): | ||
return paddle.sparse.transpose(x, perm) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This API requires sparse tensor as input(L1084) , so there is no need for this judgment anymore?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some parts of code would use paddle.matmul such as L1127
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Considering saving calculation code of perm
, this can be preserved
RFC needs to be modified to match the code. e.g. API in RFC is |
rfc modify pr has been submit PaddlePaddle/community#555 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
New features
PR changes
APIs
Description
Add pca_lowrank API to Paddle
Rfc PR: PaddlePaddle/community#474
待完成