Posted on 2008-03-15 14:46
ZelluX 阅读(1026)
评论(6) 编辑 收藏 所属分类:
Laboratory
包括各种paper, survey以及workshop上的讲座等内容
待读:Memory Resource Management in VMware ESX Server,
Carl A. Waldspurger, OSDI 02
A Performance Study of General Purpose Applications on Graphics Processors, First Workshop on General Purpose Processing on GPU
The Google File System,
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, SOSP 2003
Parallelization:1.
Automatic Parallelization for Graphic Processing Units,
Alan Leung, 6th Workshop on Compiler-Driven Performance
自动识别Java程序中的并行化代码,放到GPU上做,用到了RapidMind库。
2.
Liquid SIMD: Abstracting SIMD Hardware using Lightweight Dynamic Mapping,
Nathan Clark.etc, University of Michigan & Cambridge, HPCA 07
粗读
3.
Parallel Computing: What has changed lately?,
David B.Kirk, NVIDIA Corporation 2007
CUDA比CTM好用多了。。
4.
Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform, GSPx Multicore Applications Conference
粗读
5.
Mars: A MapReduce Framework on Graphics Processors香港科大、微软、雅虎一起做的一个MapReduce库,利用GPU进行map操作会提升不少性能,不过reduce操作应该会慢不少吧。
6.
A Survey of General-Purpose Computation on Graphics Hardware, EUROGRAPHICS 2005
写得很全面,GPU计算的大致流程,相关算法,以及调试工具等方面都涉及了,里面提到的一些GPU缺陷现在也已经被完善了。
7.
HMPP™: A Hybrid Multi-core Parallel Programming Environment,
CAPS entreprise
粗读。和OpenMP蛮像的,有一些HMPP directives。
8.
Streamware: Programming General-Purpose Multicore Processors Using Streams,
Jayanth Gummaraju.etc, Stanford University, ASPLOS 08
精读。适用于多种并行架构(多核CPU、GPGPU)的编程平台,对cache hierarchy的管理很出色(甚至通过这个平台,一些程序跑在单处理器的条件下也能加速)。
Compilers:1.
Open Research Compiler (ORC): Beyond Version 1.0, PACT 02
主要看了其中的Loop Nested Optimization和Interprocedural Optimization
Virtulization:1.
SubVirt: Implementing malware with virtual machines,
Samuel T.King, etc., IEEE SP 06
阅读笔记:
http://www.blogjava.net/zellux/archive/2008/05/05/198564.htmlhttp://www.blogjava.net/zellux/archive/2008/05/06/198693.html2.
Comptibility is Not Transparency: VMM Detection Myths and Realities,
Stanford University, VMWare, UBC/XenSource, Carnegie Mellon, Hot OS 07
听过:1.
Accelerating Two-Dimensional PageWalks for Virtualized Systems,
Ravi Bhargava.etc小组例会上CC讲的,Shadow Page Table开始就听大不懂了 =_=。不过还是先放上来,占个位,恩。
2.
Inter-domain Socket Communications Supporting High Performance and Full Binary Compatibility on Xen,
Kangho Kim.etc, VEE 08
通过一套新的XWAY机制来提高Xen中不同Domain互相之间网络传输速率。不过仅限于TCP协议。
3.
Dispersing Proprietary Applications as Benchmarks through Code Mutation,
Luk Van Ertvelde & Lieven Eeckhout, ASPLOS 08
很好玩的一篇paper。现在的benchmark跑的程序通常都是开源的或是免费的软件,但是商业软件就会涉及到版权问题。
这篇paper讲的是如何通过二进制代码的突变(mutation)达到更改软件功能,但又不影响软件跑在benchmark上时所耗的时间。
4.
Threads Cannot Be Implemented As a Library, ACM SIGPLAN 2005
举例说明了用库实现线程(如pthread)的一些缺陷