Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]libop_abi.so undefined symbol #557

Closed
yhliu918 opened this issue Apr 23, 2021 · 10 comments · Fixed by #797
Closed

[BUG]libop_abi.so undefined symbol #557

yhliu918 opened this issue Apr 23, 2021 · 10 comments · Fixed by #797
Assignees

Comments

@yhliu918
Copy link
Contributor

Hi guys, I cloned the latest version of deepmd-kit, master branch and devel branch.
My TF environment is tensorflow-gpu 2.4.1 ( pip source ), with python3.6 , gcc/g++ 7.3.1 , cuda10.1
When I 'pip install .' , it would deliver the error message of "libop_abi.so : undefined symbol: _ZN10tensorflow8OpKernel11TraceStringEPNS_15OpKernelContextEb "
image

@njzjz
Copy link
Member

njzjz commented Apr 23, 2021

I cannot reproduce the error on a new environment of Google Colaboratory. https://gist.github.com/b9b7c5c3b002e3e1b6e853c5b0e7317b

@njzjz
Copy link
Member

njzjz commented Apr 23, 2021

Can you post the output of tensorflow.version.COMPILER_VERSION and tensorflow.version.GIT_VERSION?

@yhliu918
Copy link
Contributor Author

Hi, my TF version is tensorflow-gpu 2.4.1, I think you didn't use the gpu version in the link you replied

@AnguseZhang
Copy link
Collaborator

tensorflow.version.COMPILER_VERSION
'7.3.1 20180303'
tensorflow.version.GIT_VERSION
'v2.4.0-49-g85c8b2a817f'

@njzjz
Copy link
Member

njzjz commented Apr 23, 2021

It can not be reproduced on tensorflow-gpu as well. Note that tensorflow is also the GPU version.

@njzjz
Copy link
Member

njzjz commented Apr 23, 2021

I notice tensorflow/text#385 reported a same error, but their tensorflow is from docker source and not pip.

@njzjz njzjz closed this as completed Apr 23, 2021
@njzjz njzjz added wontfix and removed bug labels Apr 24, 2021
@AnguseZhang
Copy link
Collaborator

How to fix this problem : first install tensorflow and then install deepmd-kit.

@hyhuang00
Copy link

I tried to solution above but the problem persists:
tensorflow.python.framework.errors_impl.NotFoundError:~/.local/lib/python3.7/site-packages/deepmd/libop_abi.so: undefined symbol: _ZN10tensorflow11GetNodeAttrERKNS_9AttrSliceEN4absl14lts_2020_02_2511string_viewEPSt6vectorIiSaIiEE

@njzjz
Copy link
Member

njzjz commented May 19, 2021

I tried to solution above but the problem persists:

tensorflow.python.framework.errors_impl.NotFoundError:~/.local/lib/python3.7/site-packages/deepmd/libop_abi.so: undefined symbol: _ZN10tensorflow11GetNodeAttrERKNS_9AttrSliceEN4absl14lts_2020_02_2511string_viewEPSt6vectorIiSaIiEE

Please provide more information such as how you install it. If it's a new problem, you can open a new issue.

@njzjz
Copy link
Member

njzjz commented Jun 25, 2021

In #797, I'm going to throw a message if this error happen.

amcadmus pushed a commit that referenced this issue Jun 26, 2021
* throw a message if tf runtime is incompatible

fix #557 and #796.

* still raise if tf version is correct

* detect TF_CXX11_ABI_FLAG

* format codes

* fix lint

* move messages into the function

* fix lint

* fix lints
gzq942560379 pushed a commit to HPC-AI-Team/deepmd-kit that referenced this issue Sep 2, 2021
* throw a message if tf runtime is incompatible

fix deepmodeling#557 and deepmodeling#796.

* still raise if tf version is correct

* detect TF_CXX11_ABI_FLAG

* format codes

* fix lint

* move messages into the function

* fix lint

* fix lints
@njzjz njzjz added Docs and removed document labels Nov 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants