Use zero-copy outputs with PyTorch #5699

mzient · 2024-11-05T14:42:05Z

Category:

Optimization

Description:

When using the dynamic executor, the outputs can be safely transferred (they won't be overwritten). This PR implements zero-copy outputs along with tests. It also updates the RN50 training script to use the new executor.

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

…instead). Signed-off-by: Michal Zientkiewicz <[email protected]>

Signed-off-by: Michal Zientkiewicz <[email protected]>

dali-automaton · 2024-11-05T14:45:36Z

CI MESSAGE: [20097716]: BUILD STARTED

dali-automaton · 2024-11-05T17:24:24Z

CI MESSAGE: [20097716]: BUILD PASSED

jantonguirao · 2024-11-06T11:17:38Z

dali/python/nvidia/dali/plugin/pytorch/__init__.py

-                    feed_ndarray(tensor, pyt_tensors[category], cuda_stream=stream)
-                else:
-                    feed_ndarray(tensor, pyt_tensors[category])
+                    if isinstance(tensor, TensorGPU):


in the previous code we were also matching TensorListGPU. Can you provide some background for this change?

It was dead code. We replace TensorList with a tensor in line 254:

category_tensors[category] = out.as_tensor()

mzient · 2024-11-06T12:15:42Z

dali/python/nvidia/dali/plugin/pytorch/__init__.py

-                    pyt_tensors[category] = [
-                        torch.empty(
-                            shape,
+            if copy:


OMG, GitHub made a big mess here. It's just indented and reformatted. Nothing more happened in the if copy branch.

mzient · 2024-11-06T12:17:14Z

dali/test/python/test_fw_iterators.py

@@ -1575,7 +1586,7 @@ def test_pytorch_iterator_pass_reader_name():
                        LastBatchPolicy.FILL,
                        LastBatchPolicy.DROP,
                    ]:
-                        for iters in [1, 2, 3, 2 * shards_num]:


Varying the number of iterations doesn't add anything to the test.

mdabek-nvidia · 2024-11-06T13:35:43Z

dali/python/nvidia/dali/plugin/pytorch/__init__.py

-                    pyt_tensors[category] = [
-                        torch.empty(
-                            shape,
+            if copy:


This method is long and complex. My advice is to refactor it into shorter methods.

It might be a complex task of its own, given the number of local variables that would need to be passe around.

mzient added 2 commits November 4, 2024 17:50

Don't copy outputs to PyTorch when using the new execuor (use DLPack …

9953196

…instead). Signed-off-by: Michal Zientkiewicz <[email protected]>

Use zero-copy in ragged iterator.

68333f3

Signed-off-by: Michal Zientkiewicz <[email protected]>

mzient force-pushed the zero_copy_outputs_pytorch branch from 4c11075 to 9e1f0bb Compare November 5, 2024 14:43

Use new executor in PyTorch RN50 convergence test.

4a94a74

Signed-off-by: Michal Zientkiewicz <[email protected]>

mzient force-pushed the zero_copy_outputs_pytorch branch from 9e1f0bb to 4a94a74 Compare November 5, 2024 14:43

dali-automaton assigned jantonguirao and mdabek-nvidia Nov 6, 2024

jantonguirao reviewed Nov 6, 2024

View reviewed changes

mzient commented Nov 6, 2024

View reviewed changes

jantonguirao approved these changes Nov 6, 2024

View reviewed changes

mdabek-nvidia approved these changes Nov 6, 2024

View reviewed changes

mzient merged commit 1119827 into NVIDIA:main Nov 6, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use zero-copy outputs with PyTorch #5699

Use zero-copy outputs with PyTorch #5699

Uh oh!

mzient commented Nov 5, 2024

Uh oh!

dali-automaton commented Nov 5, 2024

Uh oh!

dali-automaton commented Nov 5, 2024

Uh oh!

jantonguirao Nov 6, 2024

Uh oh!

mzient Nov 6, 2024

Uh oh!

mzient Nov 6, 2024

Uh oh!

mzient Nov 6, 2024

Uh oh!

mdabek-nvidia Nov 6, 2024

Uh oh!

mzient Nov 6, 2024

Uh oh!

Uh oh!

Uh oh!

Use zero-copy outputs with PyTorch #5699

Use zero-copy outputs with PyTorch #5699

Uh oh!

Conversation

mzient commented Nov 5, 2024

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Uh oh!

dali-automaton commented Nov 5, 2024

Uh oh!

dali-automaton commented Nov 5, 2024

Uh oh!

jantonguirao Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

mzient Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

mzient Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

mzient Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

mdabek-nvidia Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

mzient Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!