Fix coverity issues 10/23 #5083

banasraf · 2023-10-06T08:19:29Z

Category:

Other

Description:

It fixes issues detected by coverity

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

banasraf · 2023-10-06T08:26:33Z

dali/pipeline/util/copy_with_stride.cu

@@ -351,7 +351,7 @@ void CopyDlTensorBatchGpu(TensorList<GPUBackend> &output, std::vector<DLMTensorP
  }
  int element_size, ndim;
  ValidateBatch(element_size, ndim, dl_tensors, batch_size);
-  SmallVector<strided_copy::StridedCopyDesc, 128> sample_descs;
+  SmallVector<strided_copy::StridedCopyDesc, 32> sample_descs;


Stack allocation was too large

Well, it worked, so I wouldn't say with any amount of certainty that it was "too large". For our target it seems to have been OK.... Your call.

Strided copy tests have had some random test failures in the CI for a while. Maybe there are still some issues there.

If we ran out of stack it wouldn't be "some issues". It would be a hard fault (SIGSEGV? I don't think linux has a separate stack overflow error).

banasraf · 2023-10-06T08:27:40Z

dali/pipeline/util/copy_with_stride.cu

@@ -181,7 +181,7 @@ DALI_DEVICE DALI_FORCEINLINE void AlignedCopy(const StridedCopyDesc &sample,
                                              MismatchedNdimT mismatched_ndim) {
  using T = typename ElementTypeDesc::type;
  using VecT = typename ElementTypeDesc::vec_type;
-  constexpr int vec_len = ElementTypeDesc::vec_len;
+constexpr int64_t vec_len = ElementTypeDesc::vec_len;


It's later used in expressions which could overflow with 32bit types

mzient · 2023-10-06T10:30:17Z

dali/operators/python_function/dltensor_function.h

+    try {
+      auto interpreter_lock = py::gil_scoped_acquire();
+    } catch (...) {
+    }


This code is broken now. This "lock" is like a lock_guard. It goes out of scope and the lock is released; then we do everything outside of the lock.

Also, I don't think that letting this error just disappear is a good idea.

Wouldn't this clear up anyway since this is in the ~DLTensorPythonFunctionImpl ?
I considered suggesting that this should kill the whole thing but I am not sure that it can occur?

Yeah, this should only throw in some extreme cases. If we cannot access GIL there's probably something more wrong with the process so maybe it's just better to let it terminate

mzient · 2023-10-06T10:30:38Z

dali/python/backend_impl.cc

+    try {
+      py::gil_scoped_release interpreter_unlock{};
+    } catch (...) {
+    }


That's trading a nice error for a deadlock. Please revert.

mzient

Broken GIL handling in ~DLTensorPythonFunctionImpl

banasraf · 2023-10-06T11:00:12Z

!build

dali-automaton · 2023-10-06T11:05:06Z

CI MESSAGE: [10135281]: BUILD STARTED

dali-automaton · 2023-10-06T12:53:56Z

CI MESSAGE: [10135281]: BUILD PASSED

Signed-off-by: Rafal Banas <rbanas@nvidia.com>

banasraf commented Oct 6, 2023

View reviewed changes

awolant self-assigned this Oct 6, 2023

mzient self-assigned this Oct 6, 2023

awolant approved these changes Oct 6, 2023

View reviewed changes

mzient reviewed Oct 6, 2023

View reviewed changes

mzient requested changes Oct 6, 2023

View reviewed changes

mzient approved these changes Oct 6, 2023

View reviewed changes

Fix coverity issues 10/23

c9a4387

Signed-off-by: Rafal Banas <rbanas@nvidia.com>

banasraf force-pushed the fix-coverity-issues-10-23 branch from d05e343 to c9a4387 Compare October 6, 2023 13:05

banasraf merged commit a9d5f76 into NVIDIA:main Oct 6, 2023
3 of 4 checks passed

JanuszL pushed a commit to JanuszL/DALI that referenced this pull request Oct 13, 2023

Fix coverity issues 10/23 (NVIDIA#5083)

9af277d

Signed-off-by: Rafal Banas <rbanas@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix coverity issues 10/23 #5083

Fix coverity issues 10/23 #5083

banasraf commented Oct 6, 2023

banasraf Oct 6, 2023

mzient Oct 6, 2023

awolant Oct 6, 2023 •

edited

Loading

mzient Oct 6, 2023

banasraf Oct 6, 2023

mzient Oct 6, 2023

mzient Oct 6, 2023 •

edited

Loading

awolant Oct 6, 2023

banasraf Oct 6, 2023

mzient Oct 6, 2023

mzient left a comment

banasraf commented Oct 6, 2023

dali-automaton commented Oct 6, 2023

dali-automaton commented Oct 6, 2023

Fix coverity issues 10/23 #5083

Fix coverity issues 10/23 #5083

Conversation

banasraf commented Oct 6, 2023

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Choose a reason for hiding this comment

Choose a reason for hiding this comment

awolant Oct 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mzient Oct 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mzient left a comment

Choose a reason for hiding this comment

banasraf commented Oct 6, 2023

dali-automaton commented Oct 6, 2023

dali-automaton commented Oct 6, 2023

awolant Oct 6, 2023 •

edited

Loading

mzient Oct 6, 2023 •

edited

Loading