The IMPACT Research Group
Illinois Microarchitecture Project utilizing Advanced Compiler Technology

Publications
[2008] [2007] [2006] [2005] [2004] [2003] [2002] [2001] [2000] [1999] [1998] [1997] [1996] [1995] [1994] [1993] [1992] [1991] [1990] [1989]
2008 Publications

Analyses for Extensive Parallelization of Video Applications in C (PS)
Shane Ryoo and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-08-02, University of Illinois at Urbana-Champaign, Urbana, IL, June 2008.

CUBA: An Architecture for Efficient CPU/Co-processor Data Communication
Isaac Gelado, John H. Kelm, Shane Ryoo, Nacho Navarro, Steve S. Lumetta, and Wen-mei W. Hwu
Proceedings of the 22nd ACM International Conference on Supercomputing, June 2008.

GPU Acceleration of Cutoff Pair Potential for Molecular Modeling Applications (PS | PDF)
(Posted versions copyright ACM 2008. For personal use only.)
Christopher I. Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, and Wen-Mei W. Hwu
Proceedings of the 2008 International Conference on Computing Frontiers, May 2008.

Program Optimization Strategies for Data-Parallel Many-Core Processors (PS | PDF)
Shane Ryoo
PhD Dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana, IL, 2008.

Program Optimization Space Pruning for a Multithreaded GPU (PS | PDF | PDF slides)
(Posted versions copyright ACM 2008. For personal use only.)
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain-Zee Ueng, John A. Stratton, and Wen-mei W. Hwu
Proceedings of the 2008 International Symposium on Code Generation and Optimization, April 2008.

MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores
(PS | PDF)
John A. Stratton, Sam S. Stone, Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-08-01, University of Illinois, Urbana, IL, 2008.

Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA (PS | PDF | PDF slides)
(Posted versions copyright ACM 2008. For personal use only.)
Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone, David B. Kirk (NVIDIA), and Wen-mei W. Hwu
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, February 2008.
2007 Publications

Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications
(PS | PDF | PDF slides)
Shane Ryoo, Christopher I. Rodrigues, and Wen-mei W. Hwu
The 20th International Workshop on Languages and Compilers for Parallel Computing, October 2007.

How GPUs Can Improve the Quality of Magnetic Resonance Imaging
(PDF (with minor corrections) | PDF slides)
Sam S. Stone, Haoran Yi, Wen-mei W. Hwu, Justin P. Haldar, Bradley P. Sutton, and Zhi-Pei Liang
The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007.

Program Optimization Study on a 128-Core GPU
(PS | PDF | PDF slides)
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain-Zee Ueng, and Wen-mei W. Hwu
The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007.

Performance Insights on Executing Non-Graphics Applications on CUDA on the NVIDIA GeForce 8800 GTX (PDF slides)
Wen-mei Hwu, David Kirk (NVIDIA), Shane Ryoo, Christopher Rodrigues, John Stratton, and Kuangwei Hwang
Presentation at Hot Chips 19, August 2007.

Implicit Parallel Programming Models for Thousand-Core Microprocessors (PDF)
Wen-mei Hwu, Shane Ryoo, Sain-Zee Ueng, John H. Kelm, Isaac Gelado, Sam S. Stone, Robert E. Kidd, Sara S. Baghsorkhi, Aqeel A. Mahesri, Stephanie C. Tsao, Nacho Navarro, Steve S. Lumetta, Matthew I. Frank, and Sanjay J. Patel
Proceedings of the 44th Annual Design Automation Conference, June 2007.

Automatic Discovery of Coarse-Grained Parallelism in Media Applications.
Shane Ryoo, Sain-Zee Ueng, Christopher I. Rodrigues, Robert E. Kidd, Matthew I. Frank, and Wen-mei W. Hwu
Transactions on HiPEAC I, LNCS 2050, pp. 194-213, 2007.

Dynamic Tracking of Information-Flow Signatures for Security Checking
William Healey, Karthik Pattabiraman, Shane Ryoo, Ravishanker Iyer and Wen-mei W. Hwu
Technical Report UILU-ENG-02-2002, University of Illinois at Urbana-Champaign, January 2007.
2006 Publications

Improved Superblock Optimization in GCC. (PDF version Talk)
Robert Kidd and Wen-mei W. Hwu
Proceedings of the GCC Developer's Summit, pp. 85-96, June 2006.

P3DE: Profile-Directed Predicated Partial Dead Code Elimination. (PowerPoint Slides)
Shane Ryoo, Sain-Zee Ueng, and Wen-mei W. Hwu
The 5th Workshop on EPIC Architectures and Compiler Technology, March 2006.

Tolerating Cache-Miss Latency With Multipass Pipelines.
Ronald D. Barnes, Shane Ryoo, and Wen-mei W. Hwu
IEEE Micro, Vol. 26, No. 1, January-February 2006.
2005 Publications

"Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense. (PS PDF Version)
Ronald D. Barnes, Shane Ryoo, and Wen-mei W. Hwu
Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture, November 2005.

Fulcra Pointer Analysis Framework. (PDF version)
Erik M. Nystrom
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005.

A Systematic Approach to Delivering Instruction-Level Parallelism in EPIC Systems. (PDF version)
John W. Sias
PhD. Dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005.
Abstract

Multiple-Pass Pipelining: Enhancing in-order Microarchitectures to Out-Of-Order Performance. (PDF version)
Ronald D. Barnes, JR.
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2005
2004 Publications

Matching On-Chip Data Storage To Telecommunication And Media Application Properties. (PDF version)
Hillery C. Hunter
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2004.

Partial Code Elimination in the IMPACT Compiler Framework. (PS PDF Version)
Shane Ryoo
MS Thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana, IL, 2004.

Template Bundling for EPIC Architectures. (PS PDF Version)
Sain-Zee Ueng
MS thesis, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 2004.

Applying Scalable Interprocedural Pointer Analysis to Embedded Applications.
Hillery C. Hunter, Erik M. Nystrom, Shane Ryoo, and Wen-mei W. Hwu
Workshop on Compilers and Tools for Constrained Embedded Systems, September 2004.

Bottom-up and Top-down Context-Sensitive Summary-based Pointer Analysis. (PostScript version, PDF version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
The Proceedings of the 11th Static Analysis Symposium, August 2004.
Abstract

Field-testing IMPACT EPIC Research Results in Itanium 2.
(PostScript version PDF version)
John W. Sias, Sain-zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom and Wen-mei W. Hwu
Proceedings of the 31st Annual International Symposium on Computer Architecture, pp. 26-37, July 2004.
Abstract

Extracting Data Flow Model from von Neumann Program for Synthesis(PS version)
Chien-Wei Li, Hong-Seok Kim and Wen-mei W.
Proceedings of the 13th International Workshop on Logic and Synthesis, June 2004.

Exploiting Load Flexibility for Embedded Power Savings.
Hillery C. Hunter, Shane Ryoo, James Player, Daniel A. Connors, and Wen-mei W. Hwu.
IMPACT Technical Report, IMPACT-04-01, University of Illinois, at Urbana-Champaign, June 2004.

Importance of Heap Specialization in Pointer Analysis. (PostScript version, PDF version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
The Proceedings of Program Analysis for Software Tools and Engineering, June 2004.
Abstract
2003 Publications

A Dynamic Application Analysis Framework. (PDF version)
Marie Therese Conte
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, at Urbana-Champaign, 2003.

A New Look at Exploiting Data Parallelism in Embedded Systems.
Hillery C. Hunter and Jaime H. Moreno
Proceedings of the International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, October 2003.

Memory Profiling: Expanding the 3G Developer's Bag of Tricks.
Hillery C. Hunter and Wen-mei W. Hwu
Workshop on Compilers and Tools for Constrained Embedded Systems, October 2003.

Motivating use of Memory Profiling in the 3G Domain. (SRC account required)
Hillery C. Hunter, Chien-Wei Li and Wen-mei W. Hwu
Proceedings of the SRC TECHCON 2003, August 2003.

An Innovative Low-Power High-Performance Programmable Signal Processor for Digital Communications. (Available through IBM JRD)
J. Moreno, V. Zyuban, U. Shvadron, F. Nesser, J. Derby, M. Ware, K. Kailas, A. Zaks, A. Geva, S. Ben-David, S. Asaad, T. Fox, D. Littrell, M. Biberstein, D. Naishlos, and H. Hunter
IBM Journal of Research and Development, vol. 47, no 2/3, March/May 2003.

Compaction algorithm for precise modular context-sensitive pointer analysis. (PostScript version, PDF version)
H. -S. Kim, E. M. Nystrom, R. D. Barnes, W. -M. Hwu.
IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL, 2003.
Abstract

Scalable, precise context-sensitive top-down process for modular points-to analysis. (PostScript version, PDF version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL, 2003.
Abstract
2002 Publications

Vacuum Packing: Extracting Hardware-Detected Program Phases for Post-link Optimization.
(PostScript version, PDF version)

Ronald D. Barnes, Erik M. Nystrom, Matthew C. Merten, and Wen-mei W. Hwu
Proceedings of the 35th International Symposium on Microarchitecture, November 2002.
Abstract

Code Coverage and Input Variability: Effects on Architecture and Compiler Research. (PostScript version, PDF version)
Hillery C. Hunter and Wen-mei W. Hwu
Proceedings of the International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, October, 2002.
Abstract

The IMPACT SC140 Code Generator. (PostScript version, PDF version)
Christopher J. Shannon
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, April 2002

Extracting Hardware-Detected Program Phases for Post-Link Optimization. Ronald D. Barnes
Master's thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 2002.
2001 Publications

Enhancing Loop Buffering of Media and Telecommunications Applications Using Low-overhead Predication. (PostScript version, PDF version)
John W. Sias, Hillery C. Hunter and Wen-mei W. Hwu
Proceedings of the 34th International Symposium on Microarchitecture, December, 2001.
Abstract

Modulo Schedule Buffers. (PostScript version, PDF version)
Matthew C. Merten and Wen-mei W. Hwu
Proceedings of the 34th International Symposium on Microarchitecture, December, 2001
Abstract Presentation Slides (PostScript version, PDF version)

Program Decision Logic Optimization Using Predication and Control Speculation (not available online at this time)
Wen-mei W. Hwu, David I. August and John W. Sias
Proceedings of the IEEE, November, 2001, pp. 1660-1675.

Itanium Performance Insights. (PDF slides PS Version handouts )
Wen-mei W. Hwu, John W. Sias, Matthew C. Merten, Erik M. Nystrom, Ronald D. Barnes, Christopher J. Shannon, Shane Ryoo and Jeff V. Olivier
Presentation at Microprocessor Forum, October 2001.

Code Reordering and Speculation Support for Dynamic Optimization Systems. (PostScript version, PDF version)
Erik M. Nystrom, Ronald D. Barnes, Matthew C. Merten, and Wen-mei W. Hwu
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, September 8-12, 2001.
Abstract

Itanium Performance Insights from the IMPACT Compiler. (Slides PDF version)(Handouts PostScript version, PDF version)
John W. Sias, Matthew C. Merten, Erik M. Nystrom, Ronald D. Barnes, Christopher J. Shannon, Joe D. Matarazzo, Shane Ryoo, Jeff V. Olivier, and Wen-mei W. Hwu
Presentation at Hot Chips 13, August 2001.

Characterization of Repeating Data Access Patterns in Integer Benchmarks. (PostScript version, PDF version)
Erik M. Nystrom, Roy Dz-ching Ju, and Wen-mei W. Hwu
Memory Performance Issues Workshop at the 28th International Symposium on Computer Architecture, July 2001.
Abstract

An Architectural Framework for Run-Time Optimization. (PostScript version, PDF version)
Matthew C. Merten, Andrew R. Trick, Ronald D. Barnes, Erik M. Nystrom, Christopher N. George, John C. Gyllenhaal, and Wen-mei W. Hwu
IEEE Transactions on Computers, Vol. 50, No. 6, June 2001, pp. 567-589.
Abstract
2000 Publications

Accurate and Efficient Predicate Analysis with Binary Decision Diagrams. (PostScript version, PDF version)
John W. Sias, David I. August, and Wen-mei W. Hwu
Proceedings of the 33rd International Symposium on Microarchitecture, December, 2000
Abstract

Hardware Support for Dynamic Activation of Compiler-Directed Computation Reuse. (PostScript version, PDF version)
Daniel. A. Connors, Hillery C. Hunter, Ben-Chung Cheng, and Wen-mei W. Hwu
Proceedings of the 9th International Conference on Architecture Support for Programming Languages and Operating Systems, November 2000.
Abstract

A Hardware Mechanism for Dynamic Extraction and Relayout of Program Hot Spots. (PostScript version, PDF version)
Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald D. Barnes, and Wen-mei W. Hwu
Proceedings of the 27th International Symposium on Computer Architecture, June 2000, pp. 59-70.
Abstract Presentation Slides (PostScript version, PDF version)

Modular Interprocedural Pointer Analysis Using Access Paths: Design, Implementation, and Evaluation. (PostScript version, PDF version)
Ben-Chung Cheng, and Wen-mei W. Hwu
Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design and Implementation, Vancouver, British Columbia, Canada, June, 2000

Compile-Time Memory Disambiguation for C Programs. (PostScript version, PDF version)
Ben-Chung Cheng
PhD thesis, Department of Computer Science, University of Illinois, Urbana, IL, May 2000

Eliminating Dynamic Computation Redundancy ( PostScript version, PDF version)
D. A. Connors
Ph.D. dissertation, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 2000
Abstract

Systematic Compilation for Predicated Execution. (PostScript version, PDF version)
David I. August
Ph.D. dissertation, Department of Electrical and Computer Engineering,
University of Illinois, Urbana IL, Feb. 2000
Abstract
1999 Publications

Condition Awareness Support for Predicate Analysis and Optimization. (PostScript version, PDF version)
John Wollenburg Sias
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 1999

Run-Time Cache Bypassing. (PostScript version, PDF version)
T. L. Johnson, D. A. Connors, M. C. Merten, and W. W. Hwu
IEEE Transactions on Computers, Vol. 48, No. 12, December 1999, pp. 1338-1354
Abstract

Compiler-Directed Dynamic Computation Reuse: Rationale and Initial Results. (PostScript version, PDF version)
Daniel A. Connors and Wen-mei W. Hwu
Proceedings of the 32nd International Symposium on Microarchitecture, November, 1999
Abstract

Feedback-Directed Data Cache Optimizations for the x86.
Ronald D. Barnes, Ronnie Chaiken, and David M. Gillies
Proceedings of the 2nd ACM Workshop on Feedback-Directed Optimization, November 1999.

A Framework for Profile-Driven Optimization in the IMPACT Binary Reoptimization System. (PostScript version, PDF version)
Matthew C. Merten
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, June 1999

A Framework for Install-Time Optimization of Binary Dynamic-Link Libraries. (PostScript version, PDF version)
Christopher N. George
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1999

A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to Support Runtime Optimization. (PostScript version, PDF version)
Matthew C. Merten, Andrew R. Trick, Christopher N. George, John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 26th International Symposium on Computer Architecture, May, 1999, pp. 136-147.
Abstract Presentation Slides (PostScript version, PDF version)

A New Framework for Debugging Globally Optimized Code. (PostScript version, PDF version )
Le-Chun Wu, Rajiv Mirani, Harish Patil, Bruce Olsen and Wen-mei W. Hwu
Proceedings of the 1999 ACM SIGPLAN Conference on Programming Language Design and Implementation, Atlanta, Georgia, May, 1999
Abstract

The Program Decision Logic Approach to Predicated Execution. (PostScript version, PDF version)
David I. August, John W. Sias, Jean-Michel Puiatti, Scott A. Mahlke, Daniel A. Connors, Kevin M. Crozier, and Wen-mei W. Hwu
Proceedings of the 26th International Symposium on Computer Architecture, May, 1999
Abstract

Optimizing Memory Accesses Using Advanced Compile-Time Memory Disambiguation Techniques. (PostScript version)
Ben-Chung Cheng and Daniel A. Connors and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-03, University of Illinois, Urbana, IL 1999

An Empirical Study of Function Pointers Using Spec Benchmarks. (PostScript version, PDF version)
Ben-Chung Cheng and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-02, University of Illinois, Urbana, IL 1999

A Practical Interprocedural Pointer Analysis Framework. (PostScript version, PDF version)
Ben-Chung Cheng and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-01, University of Illinois, Urbana, IL 1999
1998 Publications

A Software-Oriented Floating-Point Format for Enhancing Automotive Control Systems. (PostScript version, PDF version)
Yoji Yamada, Daniel A. Connors, and Wen-mei W. Hwu
Workshop on Compiler and Architecture Support for Embedded Computing Systems (CASES98), December, 1998
Abstract

Compiler-Directed Early Load-Address Generation. (PostScript version, PDF version)
Ben-Chung Cheng, Daniel A. Connors, and Wen-mei W. Hwu
Proceedings of the 31st International Symposium on Microarchitecture, December, 1998
Abstract

Effective Modulo Scheduling for Clustered Architectures. (PostScript version, PDF version)
Erik M. Nystrom and Alexandre E. Eichenberger
31st Annual International Symposium on Microarchitecture December, 1998
Abstract

A Study of the Cache and Branch Performance Issues with Running Java on Current Hardware Platforms. (PostScript version, PDF version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal and Wen-mei W. Hwu
Proceedings of COMPCON, February 1997, pp. 211-216
Abstract

Improving Static Branch Prediction in a Compiler (PostScript version, PDF version)
Brian L. Deitrich, Ben-Chung Cheng, and Wen-mei W. Hwu
Proceedings of International Parallel Architecture and Compilation Techniques, October 12-18, 1998
Abstract

Optimization and Executable Regeneration in the IMPACT Binary Reoptimization Framework. (PostScript version, PDF version)
Michael S. Thiems
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, August 1998

Optimization of Machine Descriptions for Efficient Use. (IJPP owns distribution rights, see CatchWord or UnCover for a reprint)
John C. Gyllenhaal, Wen-mei W. Hwu, and B. Ramakrishna Rau
International Journal of Parallel Programming, vol. 26, No. 4, August 1998, pp. 417-447
Abstract

Integrated Predicated and Speculative Execution in the IMPACT EPIC Architecture. (PostScript version, PDF version)
David I. August, Daniel A. Connors, Scott A. Mahlke, John W. Sias, Kevin M. Crozier, Ben-Chung Cheng, Patrick R. Eaton, Qudus B. Olaniran, and Wen-mei W. Hwu
Proceedings of the 25th International Symposium on Computer Architecture, July, 1998
Abstract

New Data-Location Tracking Scheme for the Recovery of Expected Variable Values (PostScript version, PDF version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-07, University of Illinois, Urbana, IL 1998

A New Breakpoint Implementation Scheme for Debugging Globally Optimized Code (PostScript version, PDF version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-06, University of Illinois, Urbana, IL 1998
Abstract

An Overview of the IMPACT X86 Binary Reoptimization Framework (PostScript version, PDF version)
Matthew C. Merten, and Michael S. Thiems
IMPACT Technical Report, IMPACT-98-05, University of Illinois, Urbana, IL 1998

A Novel Breakpoint Implementation Scheme for Debugging Optimized Code (PostScript version, PDF version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-01, University of Illinois, Urbana, IL 1998
Abstract
1997 Publications

A Framework for Balancing Control Flow and Predication (PostScript version, PDF version)
David I. August, Wen-mei W. Hwu, and Scott A. Mahlke
Proceedings of the 30th International Symposium on Microarchitecture, December 1997.
Abstract

Run-time Spatial Locality Detection and Optimization (PostScript version, PDF version)
Teresa L. Johnson, Matthew C. Merten, and Wen-mei W. Hwu,
Proceedings of the 30th International Symposium on Microarchitecture, December 1-3, 1997
Abstract

An Efficient Framework For Performing Execution-Constraint-Sensitive Transformations That Increase Instruction-Level Parallelism ( PostScript version, PDF version)
Appendix A ( PostScript version, PDF version) (not included in above file)
J. Gyllenhaal
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, September 1997
Abstract

Optimizing NET Compilers for Improved Java Performance ( Draft (PostScript), Draft (PDF), see IEEE Computer for final version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, and Wen-mei W. Hwu,
IEEE Computer, June 1997, pp. 67-75
Abstract

Region-Based Compilation: An Introduction and Motivation (IJPP owns distribution rights, see UnCover for a reprint)
R. E. Hank, and W. W. Hwu, and B. R. Rau
International Journal of Parallel Programming, vol. 25, no. 2, April 1997, pp. 113-146
Abstract

A Study of the Cache and Branch Performance Issues with Running Java on Current Hardware Platforms ( PostScript version, PDF version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal and Wen-mei W. Hwu,
Proceedings of COMPCON, February 1997, pp. 211-216
Abstract

Architectural Support Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results (PostScript version, PDF version)
David I. August, Daniel A. Connors, John C. Gyllenhaal, and Wen-mei W. Hwu
The 3rd International Symposium on High-Performance Computer Architecture, Feb. 1997.
Abstract
1996 Publications

Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results (PostScript version, PDF version)
Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 29th International Symposium on Microarchitecture, December 1996, pp. 90-99
Abstract

Modulo Scheduling of Loops in Control-Intensive Non-Numeric Programs (PostScript version, PDF version)
D. M. Lavery, and W. W. Hwu
Proceedings of the 29th Annual International Symposium on Microarchitecture, Dec. 1996, pp. 126-141
Abstract

Optimization of Machine Descriptions for Efficient Use (PostScript version, PDF version)
John C. Gyllenhaal, Wen-mei W. Hwu, and B. Ramakrishna Rau
Proceedings of the 29th International Symposium on Microarchitecture, December 1996, pp. 349-358.
Abstract

Speculative Hedge: Regulating Compile-Time Speculation Against Profile Variations (PostScript version, PDF version)
Brian L. Deitrich, Wen-mei W. Hwu
Proceedings of the 29th International Symposium on Microarchitecture, December 2-4, 1996, pp.70-79
Abstract

Region-Based Compilation (PostScript version, PDF version)
R. E. Hank
PhD thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1996
Abstract

Supporting Predicated Execution: Techniques and Tradeoffs (PostScript version, PDF version)
Jim McCormick
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, May 1996
Abstract

HMDES Version 2.0 Specification (PostScript version, PDF version)
John C. Gyllenhaal, W. W. Hwu and B. Ramakrishna Rau
IMPACT Technical Report, IMPACT-96-03, University of Illinois, Urbana, IL, 1996.
1995 Publications

Data Relocation and Prefetching for Programs with Large Data Sets (PostScript version, PDF version)
Yoji Yamada
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, 1995.
Abstract

Compiler Technology for Future Microprocessors (PostScript version, PDF version)
W. W. Hwu, R. E. Hank, D. M. Gallagher, S. A. Mahlke, D. M. Lavery, G. E. Haab, J. C. Gyllenhaal, and D. I. August
Proceedings of the IEEE, Vol. 83, No. 12, December 1995, pp. 1625-1640
Abstract

Region-Based Compilation: An Introduction and Motivation (PostScript version, PDF version)
R. E. Hank, and W. W. Hwu, and B. R. Rau
Proceedings of the 28th Annual International Symposium on Microarchitecture, Dec. 1995, pp. 158-168
Abstract

Unrolling-Based Optimizations for Modulo Scheduling (PostScript version, PDF version)
D. M. Lavery, and W. W. Hwu
Proceedings of the 28th Annual International Symposium on Microarchitecture, Dec. 1995, pp. 327-337
Abstract

A Comparison of Full and Partial Predicated Execution Support for ILP Processors (PostScript version, PDF version)
S. A. Mahlke, R. E. Hank, J.E. McCormick, D. I. August, and W. W. Hwu
Proceedings of the 22nd International Symposium on Computer Architecture, June 1995, pp. 138-150
Abstract

Code Scheduling and Optimization for a Superscalar X86 Microprocessor (PostScript version, PDF version)
Wayne Dugal
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, May, 1995.

Enhancing Instruction Level Parallelism Through Complier-Controlled Speculation (PostScript version, PDF version)
Roger Bringmann
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, May 1995
Abstract

Performance and Cost Analysis of the Execution Stage of Superscalar Microprocessors (PostScript version, PDF version)
Dimitri C. Argyres
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, May 1995.

Three Architectural Models for Compiler-Controlled Speculative Execution (PostScript version, PDF version)
P. P. Chang, N.J. Warter, S. A. Mahlke, W. Y. Chen, and W. W. Hwu
IEEE Transactions on Computers, Vol. 44, No. 4, April 1995, pp. 481-494
Abstract

The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors (PostScript version, PDF version)
P. P. Chang, D. M. Lavery, S. A. Mahlke, W. Y. Chen, and W. W. Hwu
IEEE Transactions on Computers, Vol. 44, No. 3, March 1995, pp. 353-370
Abstract

Sentinel Scheduling with Recovery Blocks (PostScript version, PDF version)
David I. August, Brian L. Dietrich, and Scott A. Mahlke
Technical Report CRHC-95-05, 1995, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Feb, 1995.
Abstract

Compiler-Assisted Multiple Instruction Retry (PostScript version, PDF version)
C. C. Li, S. K. Chen, W. K. Fuchs, and W. W. Hwu
IEEE Transactions on Computers, Vol.44, No.1, Jan. 1995.
Abstract
1994 Publications

Characterizing the Impact of Predicated Execution on Branch Prediction (PostScript version, PDF version)
S. A. Mahlke, R. E. Hank, R. A. Bringmann, J. C. Gyllenhaal, D. M. Gallagher, and W. W. Hwu
Proceedings of the 27th International Symposium on Microarchitecture, December 1994, pp. 217-227
Abstract

Data Relocation and Prefetching for Large Data Sets (PostScript version, PDF version)
Y. Yamada, J. Gyllenhaal, G. Haab, W. W. Hwu
Proceedings of the 27th Annual ACM/IEEE International Symposium on Microarchitecture, December, 1994, pp. 118-127
Abstract

Dynamic Memory Disambiguation Using the Memory Conflict Buffer (PostScript version, PDF version)
D. M. Gallagher, W. Y. Chen, S. A. Mahlke, J. C. Gyllenhaal, W. W. Hwu
Proceedings of the 6th International Conference on Architecture Support for Programming Languages and Operating Systems, San Jose, California, October, 1994. pp.183-195.
Abstract

A Machine Description Language for Compilation (PostScript version, PDF version)
J. Gyllenhaal
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, Sept. 1994

Compiler Support for SPARC Architecture Processors (PostScript version, PDF version)
Roland G. Ouellette
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1994

Incremental Compiler Transformations for Multiple Instruction Retry (PostScript version, PDF version)
S. K. Chen, N. J. Alewine, W. K. Fuchs, and W. W. Hwu
Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 24(9), Sept. 1994, pp. 1-20.
Abstract

Modulo Scheduling with Isomorphic Control Transformations (PostScript version, PDF version)
Nancy J. Warter
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1994
Abstract

The Susceptibility of Programs to Context Switching Effects
W. W. Hwu And T. M. Conte
IEEE Transactions on Computers, Vol. 43, No. 9, Sept. 1994.
Abstract

Profile-Assisted Instruction Scheduling (PostScript version, PDF version)
W. Y. Chen, S. A. Mahlke, N. J. Warter, S. Anik, and W. W. Hwu
International Journal for parallel Programming, Vol. 22, No. 2, April 1994, pp. 151-181
Abstract

Compiler-Assisted Multiple Instruction Rollback Recovery Using A Read Buffer
N. Alewine, S.K. Chen, W.K. Fuchs, and W. W. Hwu
IEEE Transactions on Computers, 1994.

Performance Implications of Synchronization Support for Parallel FORTRAN Programs (PostScript version, PDF version)
S. Anik and W. W. Hwu
Journal of Parallel and Distributed Computing, Vol. 22, 1994, pp. 202-215
Abstract
1993 Publications

Speculative Execution Exception Recovery using Write-back Suppression (PostScript version, PDF version)
Roger A. Bringmann, Scott A. Mahlke, Richard E. Hank, John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Austin, Texas, Dec. 1993, pp. 214-223
Abstract

Superblock Formation Using Static Program Analysis (PostScript version, PDF version)
Richard E. Hank, Scott A. Mahlke, Roger A. Bringmann, John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Austin, Texas, Dec. 1993, pp. 247-256
Abstract

Sentinel Scheduling: A Model for Compiler-Controlled Speculative Execution
S. A. Mahlke, W. Y. Chen, R. A. Bringmann, R. E. Hank, W. W. Hwu, B. R. Rau, and M. S. Schlansker
ACM Transactions on Computer Systems, Vol. 11, No. 4, Nov. 1993
Abstract

Architectural and Software Support for Executing Numerical Applications on High Performance Computers (PostScript version, PDF version)
Sadun Anik
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1993, CRHC-93-19
Abstract

Data Preload for Superscalar and VLIW Processors (PostScript version, PDF version)
William Y. Chen
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1993
Abstract

Register Connection: A New Approach to Adding Registers into Instruction Set Architectures (PostScript version, PDF version)
Tokuzo Kiyohara, Scott Mahlke, William Chen, Roger Bringmann, Richard Hank, Sadun Anik, Wen-mei Hwu
Proceedings of the 20th Annual International Symposium on Computer Architecture, San Diego, CA, May 17-19, 1993, pp. 247-256
Abstract

XPROF: An Execution Profiler for Window-oriented Applications (PostScript version, PDF version)
Aloke Gupta and Wen-mei W. Hwu
Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 23 (5), May 1993, pp. 487-510
Abstract

Performance Aspects of Computers with Graphical User Interfaces (PostScript version, PDF version)
Aloke Gupta
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, April 1993, CRHC-93-09
Abstract

The Benefit of Predicated Execution for Software Pipelining (PostScript version, PDF version)
Nancy J. Warter, Daniel M. Lavery, and Wen-mei W. Hwu
Proceedings of the 26th Annual Hawaii Int'l Conference on system Sciences, Wailea, Hawaii, Jan. 5-8, 1993, pp 497-506
Abstract

Machine Independent Register Allocation for the IMPACT-I C Compiler (PostScript version, PDF version)
R. E. Hank
MS thesis, Department of Electrical and Computer Engineering, University of Illinois, Urbana IL, 1993
1992 Publications

Code Scheduling for VLIW/Superscalar Processors with Limited Register Files (PostScript version, PDF version)
T. Kiyohara and J. Gyllenhaal
Proceedings of the 25th International Symposium on Microarchitecture, Dec. 1992, pp. 197-201
Abstract

Effective Compiler Support for Predicated Execution Using the Hyperblock (PostScript version, PDF version)
S. A. Mahlke, D. C. Lin, W. Y. Chen, R. E. Hank, and R. A. Bringmann
Proceedings of the 25th International Symposium on Microarchitecture, Dec. 1992, pp. 45-54
Abstract

Efficient Instruction Sequencing with Inline Target Insertion (PostScript version, PDF version)
Wen-mei W. Hwu and Pohua Chang
IEEE Transactions on Computers, Vol. 41, No.12, Dec. 1992, pp. 1537-1551
Abstract

Enhanced Modulo Scheduling for Loops with Conditional Branches (PostScript version, PDF version)
Nancy J. Warter, Grant E. Haab, Krishna Subramanian, and John W. Backhaus
Proceedings of 25th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Dec. 1992 pp. 170-179
Abstract

Compiler Code Transformations for Superscalar-Based High-Performance Systems (PostScript version, PDF version)
Scott A. Mahlke, William Y. Chen, John C. Gyllenhaal, Wen-mei W. Hwu, P. P. Chang, and T.Kiyohara
Proceedings of Supercomputing 1992, Minneapolis, Minnesota, Nov. 16-20, 1992, pp. 808-817
Abstract

Sentinel Scheduling for VLIW and Superscalar Processors (PostScript version, PDF version)
Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau, and Michael S. Schlansker
Proceedings of the Fifth Int'l Conference on Architecture Support for Programming Languages and Operating Systems (ASPLOS-V), Boston, MA, Oct. 12-15, 1992, pp.238-247
Abstract

A Template for Code Generator Development Using the IMPACT-I C Compile (PostScript version, PDF version)
Roger Bringmann
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992

Design and Implementation of a Portable Global Code Optimizer (PostScript version, PDF version)
Scott A. Mahlke
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992

Evaluation of Some Superscalar and VLIW Processor Designs (PostScript version, PDF version)
John Holm
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992

Systematic Computer Architecture Prototyping (PostScript version, PDF version)
Thomas M. Conte
PhD thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1992
Abstract

Executing Nested Parallel Loops On Shared-Memory Multiprocessors (PostScript version, PDF version)
Sadun. Anik and Wen-mei Hwu
Proceedings of the 21st Annual Int'l Conference on Parallel Processing, St. Charles, IL, Aug. 1992, pp.(III) 241-244
Abstract

Tolerating First Level Memory Access Latency In High-Performance Systems (PostScript version, PDF version)
William Chen, Scott Mahlke, and Wen-mei Hwu
Proceedings of the 21st Annual Int'l Conference on Parallel Processing, St Charles, IL, Aug. 1992, pp.(I) 36-43
Abstract

Branch Recovery with Compiler-Assisted Multiple Instruction Retry
N. J. Alewine, S. K. Chen, C. C. Li, W. K. Fuchs, and W. W. Hwu
Proceedings of the 22nd Annual International Symposium on Fault-Tolerant Computing, Boston, MA, July 8-10, 1992, pp. 66-73.

Tolerating Data Access Latency with Register Preloading (PostScript version, PDF version)
William Y. Chen, Scott A. Mahlke, and Wen-mei W. Hwu
Proceedings of the 1992 Int'l Conf. on Supercomputing, Washington D.C., July, 1992, pp. 104-113
Abstract

Profile-Guided Automatic Inline Expansion for C Programs (PostScript version, PDF version)
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, and Wen-mei W. Hwu
Software Practice and Experience, May 1992, Vol. 22, No. 5, pp. 349-369
Abstract

An Execution Profiler for Window-Oriented Applications (PostScript version, PDF version)
Aloke Gupta and Wen-Mei W. Hwu
Coordinated Science Lab, University of Illinois, Urbana, IL, Technical Report CRHC-92-02, 1992.
Abstract

Scalar Program Performance on Multiple-Instruction-Issue Processors with a Limited Number of Registers (PostScript version, PDF version)
Scott A. Mahlke, William Y. Chen, Pohua P. Chang, and Wen-mei W. Hwu
Proceedings of the 25th Annual Hawaii Int'l Conference on System Sciences, Jan. 6-9, 1992, pp. 34-44
Abstract
1991 Publications

Three Superblock Scheduling Models for Superscalar and Superpipelined Processors (PostScript version, PDF version)
Pohua P. Chang, Nancy J. Warter, Scott Mahlke, William Y. Chen, and Wen-mei W. Hwu
Technical Report CRHC-91-29, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Dec. 1991
Abstract

Using Profile Information to Assist Classic Compiler Code Optimizations (PostScript version, PDF version)
Pohua P. Chang, Scott A. Mahlke, and Wen-mei W. Hwu
Software Practice and Experience, Dec. 1991, Vol. 21, No. 12, pp. 1301-1321
Abstract

Comparing Static And Dynamic Code Scheduling for Multiple-Instruction-Issue Processors (PostScript version, PDF version)
Pohua P. Chang, William Y. Chen, Scott A. Mahlke, and Wen-mei W. Hwu
Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Albuquerque, New Mexico, Nov. 18-20,1991, pp. 69-73
Abstract

Data Access Microarchitectures for Superscalar Processor with Compiler-Assisted Data Prefetching (PostScript version, PDF version)
William Y. Chen, Scott A. Mahlke, Pohua P. Chang, and Wen-mei W. Hwu
Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Albuquerque, New Mexico, Nov. 1991, pp. 69-73
Abstract

An Optimizing Compiler Code Generator: A platform for RISC Performance Analysis (PostScript version, PDF version)
William Y. Chen
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1991

The Effect of Compiler Optimizations On Available Parallelism In Scalar Programs (PostScript version, PDF version)
Scott A. Mahlke, Nancy J. Warter, William Y. Chen, Pohua P. Chang, and Wen-mei W. Hwu
Proceedings of the 20th Annual Int'l Conference on Parallel Processing, St. Charles, IL, Aug. 12-16, 1991, pp. 142-145
Abstract

Performance Implications of Synchronization Support for Parallel Fortran Programs (PostScript version, PDF version)
Sadun Anik, and Wen-mei W. Hwu
Technical Report CRHC-91-21, Center for Reliable and High-Performance Computing, University of Illinois, Urbana, IL, Jun. 1991
Abstract

IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors (PostScript version, PDF version)
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Water, and Wen-mei W. Hwu
Proceedings of the 18th Annual Int'l Symposium on Computer Architecture, Toronto, Canada, May 28, 1991, pp. 266-275
Abstract

The Effect of Code Expanding Optimizations of Instruction Cache Design (PostScript version, PDF version)
William Y. Chen, Pohua Chang, Thomas M. Conte, and Wen-mei W. Hwu
Technical Report CRHC-91-17, Center for Reliable and High-Performance, university of Illinois, Urbana, IL, May 1991
Abstract

Benchmark Characterization
T.M. Conte and W. W. Hwu
Proceedings of the 24th Annual Hawaii International Conference on System Sciences, Jan. 8-11, 1991, pp. 364-372.
1990 Publications

A Multiported Nonblocking Cache for a Superscalar Uniprocessor (PostScript version, PDF version)
James Sicolo
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1990

Compiler Support for Predicated Execution in Superscalar Processors (PostScript version, PDF version)
David Lin
MS thesis, Department of Computer Science, University of Illinois, Urbana IL, Sept. 1990
1989 Publications

Comparing Software and Hardware Schemes For Reducing the Cost of Branches (PostScript version, PDF version)
Wen-mei W. Hwu, Thomas M. Conte, and Pohua Chang
Proceedings of the 16th Annual International Symposium on Computer Architecture, Jerusalem, Israel, May 28- June 1, 1989, pp. 224-233.
Abstract

Control Flow Optimization for Supercomputer Scalar Processing (PostScript version, PDF version)
P. P. Chang and W. W. Hwu
Proceedings of the 1989 Int'l Conf. on Supercomputing, Crete, Greece, Jun. 5-9, 1989
Abstract

Maintained by John Stratton (stratton at crhc dot uiuc dot edu)