Blog

DLProf is a framework provided by NVIDIA that can profile major deep learning frameworks such as TensorFlow and PyTorch. DLProf has the ability to provide detailed profiling of CPU and GPU processes. DLProf is an essential tool for optimizing hardware usage of training and custom written CUDA code. DLProf can also offer suggestion on how to improve code.