Synapse Slayer: The Timing profile of the Neural network

Tuesday, September 29, 2009

The Timing profile of the Neural network

The code was profiled using the NVIDIA cuda profiler which gives the execution time of the individual kernel calls.

It looks like the cross correlations are taking the most time. The parallelization is done on the filter pixels which is the smallest size compared to the sensitivities and feature map sizes. CudaFFT might be a good option for the same.

Synapse Slayer

Tuesday, September 29, 2009

The Timing profile of the Neural network

No comments:

About Me

Blog Archive

Contact Me

Chat with me!!

Call me

Synapse Slayer

Tuesday, September 29, 2009

The Timing profile of the Neural network

No comments:

About Me

Subscribe To Synapses Slayer

Blog Archive

Contact Me

Chat with me!!

Call me