Information Technology Reference
In-Depth Information
number of GPU cores employed for data race detection, GUARD is able to perform
data race detection for 8-core, 16-core, and 32-core CPUs with near-zero performance
overhead. With minimal hardware support, GUARD can be invoked for data race de-
tection with negligible performance impact. Overall, GUARD proves to be a powerful
tool in the parallel programming environment, necessitated by the emergence of many-
core processors, and facilitated by the development of heterogeneous architectures with
on-chip data-parallel cores.
Acknowledgments. This work is supported in part by National Science Foundation
grants CCF-0916583 and CPS-0931931. We would like to thank all anonymous review-
ers for their constructive comments that helped to improve the quality of this paper. We
would also like to thank Jieming Yin and Ragavendra Natarajan for their suggestions to
improve the paper.
References
1. Lu, S., Park, S., Seo, E., Zhou, Y.: Learning from mistakes: a comprehensive study on real
world concurrency bug characteristics. In: Proceedings of the 13th International Conference
on Architectural Support for Programming Languages and Operating Systems (2008)
2. Xiong, W., Park, S., Zhang, J., Zhou, Y., Ma, Z.: Ad hoc synchronization considered harmful.
In: Proceedings of the 9th USENIX Conference on Operating Systems Design and Imple-
mentation (2010)
3. Intel Corporation: Intel Thread Checker, http://www.intel.com
4. Savage, S., Burrows, M., Nelson, G., Sobalvarro, P., Anderson, T.: Eraser: A dynamic data
race detector for multithreaded programs. ACM Transactions on Computer Systems (1997)
5. Muzahid, A., Suarez, D., Qi, S., Torrellas, J.: Sigrace: signature-based data race detection.
In: International Symposium on Computer Architecture, ISCA (2009)
6. Zhou, P., Teodorescu, R., Zhou, Y.: Hard: Hardware-assisted lockset-based race detection.
In: International Symposium on High Performance Computer Architecture, HPCA (2007)
7. Brookwood, N.: AMD Fusion Family of APUs: Enabling a Superior, Immersive PC Experi-
ence. Advanced Micro Devices(AMD) White Paper (2010)
8. Intel Corporation: Intel Sandy Bridge Microarchitecture, http://www.intel.com
9. NVIDIA Corporation: NVIDIA Project Denver, http://goo.gl/i2Q3Z
10. Lamport, L.: Time, clocks, and the ordering of events in a distributed system. Communica-
tions of ACM (1978)
11. Engler, D., Ashcraft, K.: Racerx: effective, static detection of race conditions and deadlocks.
In: Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles (2003)
12. Adve, S.V., Hill, M.D., Miller, B.P., Netzer, R.H.B.: Detecting data races on weak memory
systems. In: Proceedings of the 18th Annual International Symposium on Computer Archi-
tecture (1991)
13. Gupta, S., Sultan, F., Cadambi, S., Ivancic, F., Rotteler, M.: Using hardware transactional
memory for data race detection. In: IEEE International Symposium on Parallel Distributed
Processing, IPDPS (2009)
14. Boyer, M., Skadron, K., Weimer, W.: Automated dynamic analysis of cuda programs. In:
Third Workshop on Software Tools for MultiCore Systems (2008)
15. Hou, Q., Zhou, K., Guo, B.: Debugging gpu stream programs through automatic dataflow
recording and visualization. ACM Transactions on Graphics, TOG (2009)
Search WWH ::




Custom Search