CUDA Visual Profiler Changelog

What's new in CUDA Visual Profiler 1.1.6 Beta

Dec 22, 2008
  • New columns added for kernel methods:
  • Size of grid of blocks (grid size X, grid size Y)
  • Size of a thread block (block size X, block size Y, block size Z)
  • Register count per thread
  • Static shared memory size per block
  • Dynamic shared memory size per block
  • StreamID of kernel launched
  • New columns added for memcopy methods:
  • number of bytes
  • direction of transfer (host to device or device to host)
  • cputime
  • New view options added:
  • Comparison summary plot : This plot can be used to compare summary profiling data for two sessions.
  • Kernel table : This lists number of calls, grid size, block size, shared memory size per block and register count per thread for each kernel.
  • Memcopy table: This lists number of calls, memory transfer size in bytes and memory transfer direction for each memcopy.
  • cudaprof now detects whether a CUDA capable device is available on the system. If a CUDA device is not found the following message is displayed: "Unable to load cuda library. CUDA Visual Profiler device features will be disabled."
  • and certain options like Profile menu options are disabled.
  • Also based on the device type certain options are enabled or disabled. A new option "Profile->Device Properties" is provided to display cuda device properties.
  • The cputime value displayed is adjusted based on whether kernel execution is asynchronous or not.
  • Summary table has a new method display option. User can choose between "base name", "base name with suffix" or "full name". The "base name" option is useful to combine data for different template based kernel methods having the same name.
  • Width plot
  • display with cpu time enabled is changed. cputime is shown as a separate bar below gputime.
  • A new option is added to use occupancy as a bar height option in width plot.
  • Added height zoom option for height plot.
  • Common improvements to plots
  • Added title for each plot.
  • Added option to display plot configuration options.
  • The error reporting during program execution is changed to help in identifying the specific cause of the error.
  • The cudaprof user document (earlier README file) is now converted to HTML format (cudaprof.html) and can also be viewed using the "Help->Cuda Visual Profiler Help" menu option or using the function key. This new help option is currently only supported on Windows.
  • The format of the cudaprof .cpj project files is changed from plain text to XML. The information for each session which was earlier in separate .csn files is now part of the .cpj file. The new format is used for any new projects created or when existing projects are updated. Existing projects in the old format can also be opened.
  • CUDA device names for all available CUDA devices are now saved for each session and they are shown in Session Properties.
  • Added menu option "File->Delete" to delete a cuda profiler project.
  • In the Windows version the Microsoft Visual C++ libraries are no longer included in the cudaprof ZIP file. If you do not have Microsoft Visual C++ 2005 SP1 installed you will need to download and install the Microsoft Visual C++ 2005 SP1 Redistributable Package.