- Hands-On GPU Programming with Python and CUDA
- Dr. Brian Tuomanen
- 213字
- 2021-06-10 19:25:34
Profiling your code
We saw in the previous example that we can individually time different functions and components with the standard time function in Python. While this approach works fine for our small example program, this won't always be feasible for larger programs that call on many different functions, some of which may or may not be worth our effort to parallelize, or even optimize on the CPU. Our goal here is to find the bottlenecks and hotspots of a program—even if we were feeling energetic and used time around every function call we make, we might miss something, or there might be some system or library calls that we don't even consider that happen to be slowing things down. We should find candidate portions of the code to offload onto the GPU before we even think about rewriting the code to run on the GPU; we must always follow the wise words of the famous American computer scientist Donald Knuth: Premature optimization is the root of all evil.
We use what is known as a profiler to find these hot spots and bottlenecks in our code. A profiler will conveniently allow us to see where our program is taking the most time, and allow us to optimize accordingly.
- Linux運維之道(第3版)
- Kubernetes修煉手冊
- Modern Web Testing with TestCafe
- Social Media Mining with R
- WordPress Mobile Web Development:Beginner's Guide
- 操作系統(tǒng)基礎(chǔ)與實踐:基于openEuler平臺
- PLC控制系統(tǒng)應(yīng)用與維護
- Windows Phone 7.5 Data Cookbook
- 移動應(yīng)用UI設(shè)計模式(第2版)
- Windows 7中文版從入門到精通(修訂版)
- 完美應(yīng)用RHEL 8
- Application Development in iOS 7
- OpenSolaris設(shè)備驅(qū)動原理與開發(fā)
- Linux設(shè)備驅(qū)動開發(fā)
- UI設(shè)計手繪表現(xiàn)從入門到精通