-
Notifications
You must be signed in to change notification settings - Fork 373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature(wgt): enable DI using torch-rpc to support GPU-p2p and RDMA-rpc #562
Open
SolenoidWGT
wants to merge
16
commits into
opendilab:main
Choose a base branch
from
SolenoidWGT:p2p-rpc
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Commits on Jan 12, 2023
-
feature(wgt): enable DI using torch-rpc to support GPU-p2p and RDMA-rpc
1. Add torchrpc message queue. 2. Implement buffer based on CUDA-shared-tensor to optimize the data path of torchrpc. 3. Add 'bypass_eventloop' arg in Task() and Parallel(). 4. Add thread lock in distributer.py to prevent sender and receiver competition. 5. Add message queue perf test for torchrpc, nccl, nng, shm 6. Add comm_perf_helper.py to make program timing more convenient. 7. Modified the subscribe() of class MQ, adding 'fn' parameter and 'is_once' parameter. 8. Add new DummyLock and ConditionLock type in lock_helper.py 9. Add message queues perf test. 10. Introduced a new self-hosted runner to execute cuda, multiprocess, torchrpc related tests.
Configuration menu - View commit details
-
Copy full SHA for 5cfc2fb - Browse repository at this point
Copy the full SHA 5cfc2fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cee01a - Browse repository at this point
Copy the full SHA 2cee01aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6162b81 - Browse repository at this point
Copy the full SHA 6162b81View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97b9bc7 - Browse repository at this point
Copy the full SHA 97b9bc7View commit details
Commits on Jan 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c5119f5 - Browse repository at this point
Copy the full SHA c5119f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for a7a57a6 - Browse repository at this point
Copy the full SHA a7a57a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for fdd1bb9 - Browse repository at this point
Copy the full SHA fdd1bb9View commit details
Commits on Jan 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c06288e - Browse repository at this point
Copy the full SHA c06288eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fa5319 - Browse repository at this point
Copy the full SHA 3fa5319View commit details
Commits on Jan 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 345cc92 - Browse repository at this point
Copy the full SHA 345cc92View commit details -
Configuration menu - View commit details
-
Copy full SHA for dcc0a1a - Browse repository at this point
Copy the full SHA dcc0a1aView commit details -
modify test_learner_with_coordinator loop range and remove python3.7,…
…3.8 from cudatest
Configuration menu - View commit details
-
Copy full SHA for adab7fb - Browse repository at this point
Copy the full SHA adab7fbView commit details
Commits on Feb 2, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ca25b27 - Browse repository at this point
Copy the full SHA ca25b27View commit details
Commits on Feb 13, 2023
-
wangguoteng.p committed
Feb 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 44dcf13 - Browse repository at this point
Copy the full SHA 44dcf13View commit details -
test pytest worker = 1 to avoid timeout
wangguoteng.p committedFeb 13, 2023 Configuration menu - View commit details
-
Copy full SHA for 735e7cc - Browse repository at this point
Copy the full SHA 735e7ccView commit details
Commits on Mar 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e32055b - Browse repository at this point
Copy the full SHA e32055bView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.