PowerPC builds #8

jaimergp · 2019-10-18T15:34:57Z

ppc64le are technically possible, but in reality there are some barriers. I will collect relevant issues and PRs here.

All

doxygen has no ppc64le build yet. I am submitting a PR here.
We will also need to ask to build version 1.8.14, since .16 fails with current openmm.

CUDA

The overall GPU support in conda-forge assumes a one-to-one relationship between CUDA versions and Docker images because it only considers x64 architectures. This should be addressed in conda-smithy and nvcc. Tracking issue.
Create CUDA Docker images for ppc64le. PR here.
defaults only provides cudatoolkit v9.0 for ppc64le. There are no plans to change that in defaults, but conda-forge might get their own permissions.

OpenCL

ocl-icd could be used to trigger the compilation of the OpenCL parts.

We could get an OpenCL + CPU build with relatively low effort if we fix doxygen and ocl-icd. Would this be enough?

The text was updated successfully, but these errors were encountered:

jakirkham · 2019-10-18T15:48:18Z

cc @jayfurmanek

jayfurmanek · 2019-10-22T02:41:15Z

I saw the doxygen build going. Looks like it timed out (10mins no output). A couple things we could try there:

set idle_timeout to 60 or something in the conda-forge.yaml
set it to use patchelf specifically. for the path patcher - thatmight help speed up that final phase there.

Also:
There was no GPU support for ppc64le on CENTOS6. In fact, CENTOS6 predates ppc64le as an arch. The anvil images and conda toolchain use CENTOS7 (cos7) on ppc64le and aarch64.

Anaconda doesn't provide newer cudatoolkit versions for ppc64le, unfortunately, although IBM does.

I don't know if anyone has tried ocl-icd on ppc64le. I know NVIDIA doesn't provide OpenCL for ppc64le so it may not be worth doing much with ocl-icd unfortunately.

jaimergp · 2019-10-22T07:23:18Z

Thanks for the valuable feedback @jayfurmanek!

I saw the doxygen build going. Looks like it timed out (10mins no output).

We changed the provider to azure for ppc64le and, although it takes a couple of hours, it worked! Doxygen is not frequently updated, so I'd say it's ok to leave as is.

There was no GPU support for ppc64le on CENTOS6. In fact, CENTOS6 predates ppc64le as an arch. The anvil images and conda toolchain use CENTOS7 (cos7) on ppc64le and aarch64.

Didn't know that, nice! One less thing to worry about.

Anaconda doesn't provide newer cudatoolkit versions for ppc64le, unfortunately, although IBM does.

Is there any official way to use the IBM channels with conda-forge?

I know NVIDIA doesn't provide OpenCL for ppc64le so it may not be worth doing much with ocl-icd unfortunately.

If that's the case (I didn't know that either) then you are right, then there is probably no point in trying until we have official CUDA builds in ppc64le.

Thanks again!

jayfurmanek · 2019-11-05T16:08:22Z

The IBM channel is here:
https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/

There is a license that needs to be accepted at package install time with an environment variable. IBM_POWERAI_LICENSE_ACCEPT=yes

It currently has various levels of CUDA 10.1 for ppc64le and x86-64.

giadefa · 2021-02-25T17:52:16Z

Hi,
what is the state for the release of openmm for ppc64le? Here #36 (comment) there seem to be still shortcomings.

jchodera · 2021-02-25T18:14:57Z

In particular, @giadefa pointed out that there are now new Power9 supercomputers with powerful GPUs:
https://www.hpc.cineca.it/hardware/marconi100

jaimergp · 2021-02-25T19:01:09Z

I recall master is ready for PPC, but we need to cut a new release for that. See openmm/openmm#2993

mrshirts · 2021-03-16T01:50:12Z

We would love to start running on ORNL GPU's soon, so this would be great to get finalized!

jayfurmanek · 2021-03-16T05:04:22Z

Also, forge does have up to date cudatoolkit and ocl-icd packages for ppc64le now too, so I don't see any other blockers.

jaimergp · 2021-03-16T09:52:03Z

Once openmm/openmm#2993 is accepted for release, I'll work on the CF machinery to put the PPC builds out there!

jchodera · 2021-03-16T14:46:15Z

@peastman: Can we prioritize a 7.5.1 bugfix release to enable the ppc64le openmm toolchain to start building?

peastman · 2021-03-16T17:44:35Z

The thing blocking 7.5.1 is finding someone with an ARM Mac who can test that. If we either drop the ARM Mac support, or clearly mark it as untested, we can move ahead with releasing 7.5.1.

jaimergp · 2021-03-16T17:51:53Z

We can leave the existing warnings for 7.5.1 on arm64 and remove them when we have tested it thoroughly (either in a new build or in a new version).

jchodera · 2021-03-16T18:03:40Z

+1 for just keeping the warnings. We've had the minimal tests run, and you didn't want us to send you an ARM machine, while I'm still months away from being allowed to use one by MSK. Let's get it out there so people can give us feedback.

peastman · 2021-03-16T18:08:52Z

Ok!

raimis · 2021-04-16T10:37:26Z

OpenMM 7.5.1rc1 is out (https://anaconda.org/conda-forge/openmm/files?version=7.5.1rc2), but I don't see the packages for PowerPC. Are we still on track to support PowerPC in OpenMM 7.5.1?

jchodera · 2021-04-18T01:46:59Z

@peastman @jaimergp: Wasn't 7.5.1 supposed to have everything we need for ppc64le support?

peastman · 2021-04-18T02:55:27Z

Yes, I thought it was building for it. @jaimergp do you know why it didn't?

jaimergp · 2021-04-19T12:35:30Z

Because we (I) haven't rolled out support for CUDA on PPC yet. I was half hoping somebody else would do it while we fixed its support in OpenMM, but that didn't happen, so I'll get to it.

It shouldn't delay the release of the other builds though; I can work on it in the meantime.

raimis · 2021-04-19T13:31:05Z

@jaimergp thanks for the update. Do you have an estimate when the PowerPC packages will be available?

jaimergp · 2021-04-20T16:02:42Z

We need three (cascading) pieces of infrastructure:

Docker images -- see PPC: add 11.0, 11.1, 11.2 + cached cudatoolkit pkg docker-images#178
NVCC wrapper package -- see Add PowerPC nvcc-feedstock#66. This might require a PR for the CentOS 8 sysroot.
Submit the arch migrator -- pending

So I can't give an estimate, but at least you can see the progress here.

peastman · 2021-04-20T16:47:33Z

Thanks! No need to hold up anything else while we wait for it.

raimis · 2021-05-14T10:53:45Z

@jaimergp

I see that conda-forge/docker-images#178 and conda-forge/nvcc-feedstock#66 have been merged. What is the situation with the last step?

jaimergp · 2021-05-14T11:05:38Z

I am working on it. I'll submit a PR later!

jaimergp · 2021-05-14T11:52:03Z

@raimis see #55

tonigi · 2022-06-10T16:04:32Z

PPC builds used to be made on CI and uploaded to conda-forge until 7.6.0 (and they worked great btw). This does not seem to be the case for 7.7.0 any more. Any chance to resume them?

peastman · 2022-06-10T16:30:53Z

PPC builds no longer work when built with the compilers used by conda-forge. A lot of the test cases fail or segfault. They work fine when built using the standard system compilers. I've tried to track down the problem but without success. I believe it's caused by a compiler bug. Unfortunately, this means distributing PPC builds through conda-forge is now impossible

tonigi · 2022-06-10T16:34:52Z

Oh no. Is there a "single place" for the local build instructions? (I used to have an attempt at https://github.com/giorginolab/miniomm/wiki/%5BOBSOLETE%5D-Compiling-OpenMM-on-M100 , but not sure how much they can be trusted).

peastman · 2022-06-10T16:53:46Z

Instructions on building from source are at http://docs.openmm.org/latest/userguide/library/02_compiling.html. We haven't done a survey of compilers to figure out which specific ones work and which fail. My general impression has been that gcc is buggier than clang, but that's based on only a few incidents. Once you build, be sure to do a make test. Using the conda-forge compilers with PPC, we get a bunch of test failures like these:

  1/9 Test #45: TestCpuCheckpoints ...............***Failed    0.24 sec
  exception: Particle coordinate is NaN.  For more information, see https://github.com/openmm/openmm/wiki/Frequently-Asked-Questions#nan
  
      Start 48: TestCpuCustomManyParticleForce
  2/9 Test #47: TestCpuCustomGBForce .............***Exception: SegFault  2.25 sec
  
      Start 49: TestCpuCustomNonbondedForce
  3/9 Test #49: TestCpuCustomNonbondedForce ......***Failed    0.20 sec
  exception: Assertion failure at TestCustomNonbondedForce.h:103.   Expected [4500, 0, 0], found [0, 0, 0]

tonigi · 2022-06-11T00:39:56Z

By chance, is this a problem that only appears in CI? From what I understand conda-forge runs PPC64LE through emulation by default, which in my impression is buggy especially for numerics. A native (local) conda-build with conda-forge gcc 12.1.0-16 seems to work. (But there are other quirks, like CMake not finding CUDA)

peastman · 2022-06-11T17:39:07Z

I don't know. I don't have access to an actual PPC Linux system, so the only way I'm able to test it is through emulation. I can say, though, that it has all the hallmarks of a compiler bug. For example, I store some values into memory, load that memory into a SIMD register, and the register ends up with the wrong values. But if I print out the memory locations I just stored to before loading them into the register, then it ends up with the right values. That's the sort of behavior you tend to see if there's a bug in the compiler's optimization stage. This also isn't the first time I've run into a bug in gcc on PPC.

This was referenced Dec 3, 2019

Upgrade to Conda Build 3 (finally) and Move to Azure DevOps omnia-md/conda-dev-recipes#177

Merged

WIP · ppc64le support #16

Closed

raimis mentioned this issue May 13, 2021

ppc64 conda openmm build openmm/openmm#3116

Closed

jakirkham mentioned this issue Apr 2, 2022

Trim unneeded CUDA arch feedstocks (add 2 needed ones) conda-forge/conda-forge-pinning-feedstock#2700

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PowerPC builds #8

PowerPC builds #8

jaimergp commented Oct 18, 2019 •

edited

Loading

jakirkham commented Oct 18, 2019

jayfurmanek commented Oct 22, 2019

jaimergp commented Oct 22, 2019 •

edited

Loading

jayfurmanek commented Nov 5, 2019 •

edited

Loading

giadefa commented Feb 25, 2021

jchodera commented Feb 25, 2021

jaimergp commented Feb 25, 2021

mrshirts commented Mar 16, 2021

jayfurmanek commented Mar 16, 2021

jaimergp commented Mar 16, 2021

jchodera commented Mar 16, 2021

peastman commented Mar 16, 2021

jaimergp commented Mar 16, 2021

jchodera commented Mar 16, 2021

peastman commented Mar 16, 2021

raimis commented Apr 16, 2021

jchodera commented Apr 18, 2021

peastman commented Apr 18, 2021

jaimergp commented Apr 19, 2021

raimis commented Apr 19, 2021

jaimergp commented Apr 20, 2021

peastman commented Apr 20, 2021

raimis commented May 14, 2021

jaimergp commented May 14, 2021

jaimergp commented May 14, 2021

tonigi commented Jun 10, 2022

peastman commented Jun 10, 2022

tonigi commented Jun 10, 2022

peastman commented Jun 10, 2022

tonigi commented Jun 11, 2022

peastman commented Jun 11, 2022

PowerPC builds #8

PowerPC builds #8

Comments

jaimergp commented Oct 18, 2019 • edited Loading

All

CUDA

OpenCL

jakirkham commented Oct 18, 2019

jayfurmanek commented Oct 22, 2019

jaimergp commented Oct 22, 2019 • edited Loading

jayfurmanek commented Nov 5, 2019 • edited Loading

giadefa commented Feb 25, 2021

jchodera commented Feb 25, 2021

jaimergp commented Feb 25, 2021

mrshirts commented Mar 16, 2021

jayfurmanek commented Mar 16, 2021

jaimergp commented Mar 16, 2021

jchodera commented Mar 16, 2021

peastman commented Mar 16, 2021

jaimergp commented Mar 16, 2021

jchodera commented Mar 16, 2021

peastman commented Mar 16, 2021

raimis commented Apr 16, 2021

jchodera commented Apr 18, 2021

peastman commented Apr 18, 2021

jaimergp commented Apr 19, 2021

raimis commented Apr 19, 2021

jaimergp commented Apr 20, 2021

peastman commented Apr 20, 2021

raimis commented May 14, 2021

jaimergp commented May 14, 2021

jaimergp commented May 14, 2021

tonigi commented Jun 10, 2022

peastman commented Jun 10, 2022

tonigi commented Jun 10, 2022

peastman commented Jun 10, 2022

tonigi commented Jun 11, 2022

peastman commented Jun 11, 2022

jaimergp commented Oct 18, 2019 •

edited

Loading

jaimergp commented Oct 22, 2019 •

edited

Loading

jayfurmanek commented Nov 5, 2019 •

edited

Loading