forked from NVIDIA/DALI
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Synchronize CUDA stream once in operator benchmark (NVIDIA#3525)
* Synchronize CUDA stream once in operator benchmark CUDA stream was synchronized after each iteration in operator benchmark, which introduced an error to the measurements, especially for small data and small batch sizes. In a real pipeline the synchronization would not happen after each operation. This commit moves the synchronization out of the loop, synchronizing the stream only once in a benchmark. Added sync_each_n parameter. Signed-off-by: Szymon Karpiński <hugo@staszic.waw.pl>
- Loading branch information
1 parent
ad81faf
commit 01450e3
Showing
1 changed file
with
18 additions
and
10 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters