The Research Group
|
|
Illinois Microarchitecture Project utilizing Advanced Compiler Technology
|
| [2008]
[2007]
[2006]
[2005]
[2004]
[2003]
[2002]
[2001]
[2000]
[1999]
[1998]
[1997]
[1996]
[1995]
[1994]
[1993]
[1992]
[1991]
[1990]
[1989] |
|
|
|
|
|
|
2008 Publications
|
-
Analyses for Extensive Parallelization of Video Applications in C
(PS)
Shane Ryoo and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-08-02, University of Illinois at
Urbana-Champaign, Urbana, IL, June 2008.
-
CUBA: An Architecture for Efficient CPU/Co-processor Data Communication
Isaac Gelado, John H. Kelm, Shane Ryoo, Nacho
Navarro, Steve S. Lumetta, and Wen-mei W. Hwu
Proceedings of the 22nd ACM International Conference on
Supercomputing, June 2008.
-
GPU Acceleration of Cutoff Pair Potential for Molecular Modeling Applications
(PS |
PDF)
(Posted versions copyright ACM 2008. For personal use only.)
Christopher I. Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, and Wen-Mei W. Hwu
Proceedings of the 2008 International Conference on Computing Frontiers, May 2008.
-
Program Optimization Strategies for Data-Parallel Many-Core Processors
(PS |
PDF)
Shane Ryoo
PhD Dissertation, Department of Electrical and Computer Engineering,
University of Illinois, Urbana, IL, 2008.
-
Program Optimization Space Pruning for a Multithreaded GPU
(PS
|
PDF |
PDF slides)
(Posted versions copyright ACM 2008. For personal use only.)
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone,
Sara S. Baghsorkhi, Sain-Zee Ueng, John A. Stratton, and Wen-mei W. Hwu
Proceedings of the 2008 International Symposium on Code
Generation and Optimization, April 2008.
-
MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores
(PS |
PDF)
John A. Stratton, Sam S. Stone, Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-08-01, University of Illinois, Urbana, IL,
2008.
-
Optimization Principles and Application Performance Evaluation of a
Multithreaded GPU Using CUDA
(PS
|
PDF |
PDF slides)
(Posted versions copyright ACM 2008. For personal use only.)
Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone,
David B. Kirk (NVIDIA), and Wen-mei W. Hwu
Proceedings of the 13th ACM SIGPLAN Symposium on
Principles and Practice of Parallel Programming, February 2008.
| |
2007 Publications
|
-
Iteration Disambiguation for Parallelism Identification in Time-Sliced
Applications
(PS |
PDF |
PDF slides)
Shane Ryoo, Christopher I. Rodrigues, and Wen-mei W. Hwu
The 20th International Workshop on Languages and Compilers
for Parallel Computing, October 2007.
-
How GPUs Can Improve the Quality of Magnetic Resonance Imaging
(PDF (with minor corrections) |
PDF slides)
Sam S. Stone, Haoran Yi, Wen-mei W. Hwu, Justin P. Haldar, Bradley P. Sutton, and Zhi-Pei Liang
The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007.
-
Program Optimization Study on a 128-Core GPU
(PS |
PDF |
PDF
slides)
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain-Zee
Ueng, and Wen-mei W. Hwu
The First Workshop on General Purpose Processing on Graphics
Processing Units, October 2007.
-
Performance Insights on Executing Non-Graphics Applications on CUDA on the
NVIDIA GeForce 8800 GTX
(PDF slides)
Wen-mei Hwu, David Kirk (NVIDIA), Shane Ryoo, Christopher Rodrigues, John
Stratton, and Kuangwei Hwang
Presentation at Hot Chips 19, August 2007.
-
Implicit Parallel Programming Models for Thousand-Core Microprocessors
(PDF)
Wen-mei Hwu, Shane Ryoo, Sain-Zee Ueng,
John H. Kelm, Isaac Gelado, Sam S. Stone, Robert E. Kidd, Sara S.
Baghsorkhi, Aqeel A. Mahesri, Stephanie C. Tsao, Nacho Navarro, Steve S.
Lumetta, Matthew I. Frank, and Sanjay J. Patel
Proceedings of the 44th Annual Design Automation Conference, June 2007.
-
Automatic Discovery of Coarse-Grained Parallelism in Media Applications.
Shane Ryoo, Sain-Zee Ueng, Christopher I. Rodrigues, Robert E. Kidd,
Matthew I. Frank, and Wen-mei W. Hwu
Transactions on HiPEAC I, LNCS 2050, pp. 194-213, 2007.
-
Dynamic Tracking of Information-Flow Signatures for Security Checking
William Healey, Karthik Pattabiraman, Shane Ryoo, Ravishanker Iyer and Wen-mei
W. Hwu
Technical Report UILU-ENG-02-2002, University of
Illinois at Urbana-Champaign, January 2007.
| |
2006 Publications
|
-
Improved Superblock Optimization in GCC. (PDF
version
Talk)
Robert Kidd and Wen-mei W. Hwu
Proceedings of the GCC Developer's Summit, pp. 85-96, June 2006.
-
P3DE: Profile-Directed Predicated Partial Dead Code Elimination.
(PowerPoint Slides)
Shane Ryoo, Sain-Zee Ueng, and Wen-mei W. Hwu
The 5th Workshop on EPIC Architectures and Compiler Technology, March
2006.
-
Tolerating Cache-Miss Latency With Multipass Pipelines.
Ronald D. Barnes, Shane Ryoo, and Wen-mei W. Hwu
IEEE Micro, Vol. 26, No. 1, January-February 2006.
| |
2005 Publications
|
-
"Flea-flicker" Multipass Pipelining: An Alternative to the High-Power
Out-of-Order Offense.
(PS
PDF Version)
Ronald D. Barnes, Shane Ryoo, and Wen-mei W. Hwu
Proceedings of the 38th Annual IEEE/ACM International Symposium on
Microarchitecture, November 2005.
-
Fulcra Pointer Analysis Framework.
(PDF
version)
Erik M. Nystrom
PhD thesis, Department of Electrical and Computer Engineering, University
of Illinois, Urbana IL, 2005.
-
A Systematic Approach to Delivering Instruction-Level Parallelism in
EPIC Systems.
(PDF
version)
John W. Sias
PhD. Dissertation, Department of Electrical and Computer Engineering,
University of Illinois, Urbana IL, 2005. Abstract
-
Multiple-Pass Pipelining: Enhancing in-order Microarchitectures to
Out-Of-Order Performance.
(PDF
version)
Ronald D. Barnes, JR.
PhD thesis, Department of Electrical and Computer Engineering, University
of Illinois, Urbana IL, 2005
| | 2004 Publications
|
-
Matching On-Chip Data Storage To Telecommunication And Media Application
Properties.
(PDF
version)
Hillery C. Hunter
PhD thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, 2004.
-
Partial Code Elimination in the IMPACT Compiler Framework.
(PS
PDF
Version)
Shane Ryoo
MS Thesis, Department of Electrical and Computer Engineering, University
of Illinois, Urbana, IL, 2004.
-
Template Bundling for EPIC Architectures.
(PS
PDF Version)
Sain-Zee Ueng
MS thesis, Department of Electrical and Computer Engineering, University of Illinois at
Urbana-Champaign, 2004.
-
Applying Scalable Interprocedural Pointer Analysis to Embedded
Applications.
Hillery C. Hunter, Erik M. Nystrom, Shane Ryoo, and Wen-mei W. Hwu
Workshop on Compilers and Tools for Constrained Embedded Systems, September 2004.
-
Bottom-up and Top-down Context-Sensitive Summary-based Pointer
Analysis.
(PostScript
version, PDF
version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
The Proceedings of the 11th Static Analysis Symposium, August
2004.
Abstract
-
Field-testing IMPACT EPIC Research Results in Itanium 2.
(PostScript
version
PDF
version)
John W. Sias, Sain-zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom and
Wen-mei W. Hwu
Proceedings of the 31st Annual International Symposium on Computer Architecture, pp.
26-37, July 2004.
Abstract
-
Extracting Data Flow Model from von Neumann Program for Synthesis(PS
version)
Chien-Wei Li, Hong-Seok Kim and Wen-mei W.
Proceedings of the 13th International Workshop on Logic and Synthesis,
June 2004.
-
Exploiting Load Flexibility for Embedded Power Savings.
Hillery C. Hunter, Shane Ryoo, James Player, Daniel A. Connors, and Wen-mei W. Hwu.
IMPACT Technical Report, IMPACT-04-01, University of Illinois, at Urbana-Champaign,
June 2004.
-
Importance of Heap Specialization in Pointer Analysis.
(PostScript
version, PDF
version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
The Proceedings of Program Analysis for Software Tools and Engineering, June
2004.
Abstract
|
| 2003 Publications |
-
A Dynamic Application Analysis Framework.
(PDF
version)
Marie Therese Conte
PhD thesis, Department of Electrical and Computer Engineering, University of
Illinois, at Urbana-Champaign, 2003.
-
A New Look at Exploiting Data Parallelism in Embedded Systems.
Hillery C. Hunter and Jaime H. Moreno
Proceedings of the International Conference on Compilers, Architecture, and Synthesis
for Embedded Systems, October 2003.
-
Memory Profiling: Expanding the 3G Developer's Bag of Tricks.
Hillery C. Hunter and Wen-mei W. Hwu
Workshop on Compilers and Tools for Constrained Embedded Systems, October 2003.
-
Motivating use of Memory Profiling in the 3G Domain.
(SRC account required)
Hillery C. Hunter, Chien-Wei Li and Wen-mei W. Hwu
Proceedings of the SRC TECHCON 2003, August 2003.
-
An Innovative Low-Power High-Performance Programmable Signal
Processor for Digital Communications.
(Available
through IBM JRD)
J. Moreno, V. Zyuban, U. Shvadron, F. Nesser, J. Derby, M. Ware, K. Kailas, A. Zaks,
A. Geva, S. Ben-David, S. Asaad, T. Fox, D. Littrell, M. Biberstein, D. Naishlos, and
H. Hunter
IBM Journal of Research and Development, vol. 47, no 2/3, March/May 2003.
-
Compaction algorithm for precise modular context-sensitive pointer
analysis.
(PostScript
version, PDF
version)
H. -S. Kim, E. M. Nystrom, R. D. Barnes, W. -M. Hwu.
IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL,
2003.
Abstract
-
Scalable, precise context-sensitive top-down process for modular
points-to analysis.
(PostScript
version, PDF
version)
E. M. Nystrom, H. -S. Kim, W. -M. Hwu.
IMPACT Technical Report, IMPACT-03-03, University of Illinois, Urbana, IL,
2003.
Abstract
|
| 2002 Publications |
-
Vacuum Packing: Extracting Hardware-Detected Program Phases for
Post-link Optimization.
(PostScript
version, PDF
version)
Ronald D. Barnes, Erik M. Nystrom, Matthew C. Merten, and Wen-mei
W. Hwu
Proceedings of the 35th International Symposium on Microarchitecture,
November
2002.
Abstract
-
Code Coverage and Input Variability: Effects on Architecture and
Compiler Research.
(PostScript
version, PDF
version)
Hillery C. Hunter and Wen-mei W. Hwu
Proceedings of the International Conference on Compilers,
Architecture, and Synthesis for Embedded Systems, October, 2002.
Abstract
-
The IMPACT SC140 Code Generator.
(PostScript
version, PDF
version)
Christopher J. Shannon
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, April 2002
-
Extracting Hardware-Detected Program Phases for Post-Link
Optimization.
Ronald D. Barnes
Master's thesis, Department of Electrical and Computer Engineering,
University of Illinois, Urbana IL, 2002.
|
| 2001 Publications |
-
Enhancing Loop Buffering of Media and Telecommunications Applications
Using Low-overhead Predication.
(PostScript
version, PDF
version)
John W. Sias, Hillery C. Hunter and Wen-mei W. Hwu
Proceedings of the 34th International Symposium on Microarchitecture,
December,
2001.
Abstract
-
Modulo Schedule Buffers.
(PostScript
version, PDF
version)
Matthew C. Merten and Wen-mei W. Hwu
Proceedings of the 34th International Symposium on Microarchitecture,
December,
2001
Abstract Presentation Slides (PostScript
version, PDF
version)
-
Program Decision Logic Optimization Using Predication and Control
Speculation (not available online at this time)
Wen-mei W. Hwu, David I. August and John W. Sias
Proceedings of the IEEE, November, 2001, pp. 1660-1675.
-
Itanium Performance Insights.
(PDF slides
PS Version
handouts )
Wen-mei W. Hwu, John W. Sias, Matthew C. Merten, Erik M. Nystrom, Ronald D. Barnes,
Christopher J. Shannon, Shane Ryoo and Jeff V. Olivier
Presentation at Microprocessor Forum, October 2001.
-
Code Reordering and Speculation Support for Dynamic Optimization
Systems.
(PostScript
version, PDF
version)
Erik M. Nystrom, Ronald D. Barnes, Matthew C. Merten, and Wen-mei W.
Hwu
Proceedings of the International Conference on Parallel Architectures and Compilation
Techniques, September 8-12, 2001.
Abstract
-
Itanium Performance Insights from the IMPACT Compiler.
(Slides PDF
version)(Handouts PostScript
version, PDF
version)
John W. Sias, Matthew C. Merten, Erik M. Nystrom, Ronald D. Barnes, Christopher
J. Shannon, Joe D. Matarazzo, Shane Ryoo, Jeff V. Olivier, and Wen-mei W. Hwu
Presentation at Hot Chips 13, August 2001.
-
Characterization of Repeating Data Access Patterns in Integer
Benchmarks.
(PostScript
version, PDF
version)
Erik M. Nystrom, Roy Dz-ching Ju, and Wen-mei W. Hwu
Memory Performance Issues Workshop at the 28th International Symposium on Computer
Architecture, July 2001.
Abstract
-
An Architectural Framework for Run-Time Optimization.
(PostScript
version, PDF
version)
Matthew C. Merten, Andrew R. Trick, Ronald D. Barnes, Erik M. Nystrom, Christopher
N. George, John C. Gyllenhaal, and Wen-mei W. Hwu
IEEE Transactions on Computers, Vol. 50, No. 6, June 2001, pp. 567-589.
Abstract
|
| 2000 Publications |
-
Accurate and Efficient Predicate Analysis with Binary Decision
Diagrams.
(PostScript
version, PDF
version)
John W. Sias, David I. August, and Wen-mei W. Hwu
Proceedings of the 33rd International Symposium on Microarchitecture,
December, 2000
Abstract
-
Hardware Support for Dynamic Activation of Compiler-Directed
Computation Reuse.
(PostScript
version, PDF
version)
Daniel. A. Connors, Hillery C. Hunter, Ben-Chung Cheng, and
Wen-mei W. Hwu
Proceedings of the 9th International Conference on Architecture Support for
Programming Languages and Operating Systems, November 2000.
Abstract
-
A Hardware Mechanism for Dynamic Extraction and Relayout of Program
Hot Spots.
(PostScript
version, PDF
version)
Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald D. Barnes, and Wen-mei
W. Hwu
Proceedings of the 27th International Symposium on Computer Architecture, June
2000, pp. 59-70.
Abstract
Presentation Slides (PostScript
version, PDF
version)
-
Modular Interprocedural Pointer Analysis Using Access Paths: Design,
Implementation, and Evaluation.
(PostScript
version, PDF
version)
Ben-Chung Cheng, and Wen-mei W. Hwu
Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design
and Implementation, Vancouver, British Columbia, Canada, June, 2000
-
Compile-Time Memory Disambiguation for C Programs.
(PostScript
version, PDF
version)
Ben-Chung Cheng
PhD thesis, Department of Computer Science, University of Illinois,
Urbana, IL, May 2000
-
Eliminating Dynamic Computation Redundancy (
PostScript version, PDF
version)
D. A. Connors
Ph.D. dissertation, Department of Electrical and Computer Engineering, University
of Illinois,
Urbana IL, May 2000
Abstract
-
Systematic Compilation for Predicated Execution.
(PostScript
version, PDF
version)
David I. August
Ph.D. dissertation, Department of Electrical and Computer Engineering,
University of Illinois, Urbana IL, Feb. 2000
Abstract
|
| 1999 Publications |
-
Condition Awareness Support for Predicate Analysis and
Optimization.
(PostScript
version, PDF
version)
John Wollenburg Sias
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, 1999
-
Run-Time Cache Bypassing.
(PostScript
version,
PDF version)
T. L. Johnson, D. A. Connors, M. C. Merten, and W. W. Hwu
IEEE Transactions on Computers, Vol. 48, No. 12, December 1999, pp. 1338-1354
Abstract
-
Compiler-Directed Dynamic Computation Reuse: Rationale and Initial
Results.
(PostScript
version, PDF
version)
Daniel A. Connors and Wen-mei W. Hwu
Proceedings of the 32nd International Symposium on Microarchitecture, November,
1999
Abstract
-
Feedback-Directed Data Cache Optimizations for the x86.
Ronald D. Barnes, Ronnie Chaiken, and David M. Gillies
Proceedings of the 2nd ACM Workshop on Feedback-Directed Optimization, November
1999.
-
A Framework for Profile-Driven Optimization in the IMPACT Binary
Reoptimization System.
(PostScript
version, PDF
version)
Matthew C. Merten
MS thesis, Department of Electrical and Computer Engineering, University of Illinois,
Urbana IL, June 1999
-
A Framework for Install-Time Optimization of Binary Dynamic-Link
Libraries.
(PostScript
version,
PDF version)
Christopher N. George
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, May 1999
-
A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to
Support Runtime Optimization.
(PostScript
version, PDF
version)
Matthew C. Merten, Andrew R. Trick, Christopher N. George,
John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 26th International Symposium on Computer Architecture, May, 1999,
pp. 136-147.
Abstract
Presentation Slides (PostScript
version, PDF
version)
-
A New Framework for Debugging Globally Optimized Code.
(PostScript
version, PDF
version
)
Le-Chun Wu, Rajiv Mirani, Harish Patil, Bruce Olsen and Wen-mei W. Hwu
Proceedings of the 1999 ACM SIGPLAN Conference on Programming Language Design
and Implementation, Atlanta, Georgia, May, 1999
Abstract
-
The Program Decision Logic Approach to Predicated Execution.
(PostScript version, PDF
version)
David I. August, John W. Sias, Jean-Michel Puiatti, Scott A. Mahlke, Daniel A.
Connors, Kevin M. Crozier, and Wen-mei W. Hwu
Proceedings of the 26th International Symposium on Computer Architecture, May,
1999
Abstract
-
Optimizing Memory Accesses Using Advanced Compile-Time Memory
Disambiguation Techniques.
(PostScript
version)
Ben-Chung Cheng and Daniel A. Connors and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-03, University of Illinois, Urbana, IL 1999
-
An Empirical Study of Function Pointers Using Spec Benchmarks.
(PostScript
version, PDF
version)
Ben-Chung Cheng and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-02, University of Illinois, Urbana, IL 1999
-
A Practical Interprocedural Pointer Analysis Framework.
(PostScript
version, PDF
version)
Ben-Chung Cheng and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-99-01, University of Illinois, Urbana, IL 1999
|
| 1998 Publications |
-
A Software-Oriented Floating-Point Format for Enhancing Automotive
Control Systems.
(PostScript
version,
PDF version)
Yoji Yamada, Daniel A. Connors, and Wen-mei W. Hwu
Workshop on Compiler and Architecture Support for Embedded Computing Systems
(CASES98), December, 1998
Abstract
-
Compiler-Directed Early Load-Address Generation.
(PostScript
version, PDF
version)
Ben-Chung Cheng, Daniel A. Connors, and Wen-mei W. Hwu
Proceedings of the 31st International Symposium on Microarchitecture, December,
1998
Abstract
-
Effective Modulo Scheduling for Clustered Architectures.
(PostScript
version, PDF
version)
Erik M. Nystrom and Alexandre E. Eichenberger
31st Annual International Symposium on Microarchitecture December, 1998
Abstract
-
A Study of the Cache and Branch Performance Issues with Running Java
on Current Hardware Platforms.
(PostScript
version, PDF
version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal
and Wen-mei W. Hwu
Proceedings of COMPCON, February 1997, pp. 211-216
Abstract
-
Improving Static Branch Prediction in a Compiler
(PostScript
version,
PDF version)
Brian L. Deitrich, Ben-Chung Cheng, and Wen-mei W. Hwu
Proceedings of International Parallel Architecture and Compilation Techniques,
October 12-18, 1998
Abstract
-
Optimization and Executable Regeneration in the IMPACT Binary
Reoptimization Framework.
(PostScript
version, PDF
version)
Michael S. Thiems
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, August 1998
-
Optimization of Machine Descriptions for Efficient Use.
(IJPP owns distribution rights, see CatchWord or
UnCover for a reprint)
John C. Gyllenhaal, Wen-mei W. Hwu, and B. Ramakrishna Rau
International Journal of Parallel Programming, vol. 26, No. 4,
August 1998, pp. 417-447
Abstract
-
Integrated Predicated and Speculative Execution in the IMPACT EPIC
Architecture.
(PostScript
version, PDF
version)
David I. August, Daniel A. Connors, Scott A. Mahlke, John W. Sias,
Kevin M. Crozier, Ben-Chung Cheng, Patrick R. Eaton,
Qudus B. Olaniran, and Wen-mei W. Hwu
Proceedings of the 25th International Symposium on Computer Architecture, July,
1998
Abstract
-
New Data-Location Tracking Scheme for the Recovery of Expected
Variable Values
(PostScript
version, PDF
version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-07, University of Illinois, Urbana, IL 1998
-
A New Breakpoint Implementation Scheme for Debugging Globally Optimized Code
(PostScript
version, PDF
version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-06, University of Illinois, Urbana, IL 1998
Abstract
-
An Overview of the IMPACT X86 Binary Reoptimization Framework (PostScript
version, PDF
version)
Matthew C. Merten, and Michael S. Thiems
IMPACT Technical Report, IMPACT-98-05, University of Illinois, Urbana, IL 1998
-
A Novel Breakpoint Implementation Scheme for Debugging Optimized Code (PostScript
version, PDF
version)
Le-Chun Wu and Wen-mei W. Hwu
IMPACT Technical Report, IMPACT-98-01, University of Illinois, Urbana, IL 1998
Abstract
|
| 1997 Publications |
-
A Framework for Balancing Control Flow and Predication (PostScript
version,
PDF version)
David I. August, Wen-mei W. Hwu, and Scott A. Mahlke
Proceedings of the 30th International Symposium on Microarchitecture,
December 1997.
Abstract
-
Run-time Spatial Locality Detection and Optimization (PostScript
version,
PDF version)
Teresa L. Johnson, Matthew C. Merten, and Wen-mei W. Hwu,
Proceedings of the 30th International Symposium on Microarchitecture,
December 1-3, 1997
Abstract
-
An Efficient Framework For Performing Execution-Constraint-Sensitive
Transformations That Increase Instruction-Level Parallelism (
PostScript version,
PDF version)
Appendix A (
PostScript version,
PDF version) (not included in above file)
J. Gyllenhaal
PhD thesis, Department of Electrical and Computer Engineering,
University of Illinois, Urbana IL, September 1997
Abstract
-
Optimizing NET Compilers for Improved Java Performance
(
Draft (PostScript),
Draft (PDF), see IEEE Computer for final version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson,
John C. Gyllenhaal, and Wen-mei W. Hwu,
IEEE Computer, June 1997, pp. 67-75
Abstract
-
Region-Based Compilation: An Introduction and Motivation
(IJPP owns distribution rights, see UnCover for a reprint)
R. E. Hank, and W. W. Hwu, and B. R. Rau
International Journal of Parallel Programming, vol. 25, no. 2,
April 1997, pp. 113-146
Abstract
-
A Study of the Cache and Branch Performance Issues with Running Java on
Current Hardware Platforms (
PostScript version,
PDF version)
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C.
Gyllenhaal
and Wen-mei W. Hwu,
Proceedings of COMPCON, February 1997, pp. 211-216
Abstract
-
Architectural Support Compiler-Synthesized Dynamic Branch Prediction
Strategies: Rationale and Initial Results (PostScript
version,
PDF version)
David I. August, Daniel A. Connors, John C. Gyllenhaal, and Wen-mei W.
Hwu
The 3rd International Symposium on High-Performance Computer
Architecture, Feb. 1997.
Abstract
|
| 1996 Publications |
-
Java Bytecode to Native Code Translation: The Caffeine Prototype and
Preliminary Results (PostScript
version, PDF
version)
Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, and Wen-mei W. Hwu
Proceedings of the 29th International Symposium on Microarchitecture,
December 1996, pp. 90-99
Abstract
-
Modulo Scheduling of Loops in Control-Intensive Non-Numeric Programs (PostScript
version, PDF
version)
D. M. Lavery, and W. W. Hwu
Proceedings of the 29th Annual International Symposium on Microarchitecture,
Dec. 1996, pp. 126-141
Abstract
-
Optimization of Machine Descriptions for Efficient Use (PostScript
version,
PDF version)
John C. Gyllenhaal, Wen-mei W. Hwu, and B. Ramakrishna Rau
Proceedings of the 29th International Symposium on Microarchitecture,
December 1996, pp. 349-358.
Abstract
-
Speculative Hedge: Regulating Compile-Time Speculation Against Profile
Variations (PostScript
version, PDF
version)
Brian L. Deitrich, Wen-mei W. Hwu
Proceedings of the 29th International Symposium on Microarchitecture, December
2-4, 1996, pp.70-79
Abstract
-
Region-Based Compilation (PostScript
version, PDF
version)
R. E. Hank
PhD thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, May 1996
Abstract
-
Supporting Predicated Execution: Techniques and Tradeoffs (PostScript
version, PDF
version)
Jim McCormick
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, May 1996
Abstract
-
HMDES Version 2.0 Specification (PostScript
version, PDF
version)
John C. Gyllenhaal, W. W. Hwu and B. Ramakrishna Rau
IMPACT Technical Report, IMPACT-96-03, University of Illinois, Urbana, IL,
1996.
|
| 1995 Publications |
-
Data Relocation and Prefetching for Programs with Large Data Sets (PostScript
version, PDF
version)
Yoji Yamada
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, 1995.
Abstract
-
Compiler Technology for Future Microprocessors (PostScript
version, PDF
version)
W. W. Hwu, R. E. Hank, D. M. Gallagher, S. A. Mahlke, D. M. Lavery, G. E. Haab,
J. C. Gyllenhaal, and D. I. August
Proceedings of the IEEE, Vol. 83, No. 12, December 1995, pp. 1625-1640
Abstract
-
Region-Based Compilation: An Introduction and Motivation (PostScript
version, PDF
version)
R. E. Hank, and W. W. Hwu, and B. R. Rau
Proceedings of the 28th Annual International Symposium on Microarchitecture,
Dec. 1995,
pp. 158-168
Abstract
-
Unrolling-Based Optimizations for Modulo Scheduling (PostScript
version, PDF
version)
D. M. Lavery, and W. W. Hwu
Proceedings of the 28th Annual International Symposium on Microarchitecture,
Dec. 1995,
pp. 327-337
Abstract
-
A Comparison of Full and Partial Predicated Execution Support for ILP Processors
(PostScript
version,
PDF version)
S. A. Mahlke, R. E. Hank, J.E. McCormick, D. I. August, and W. W. Hwu
Proceedings of the 22nd International Symposium on Computer Architecture, June
1995, pp. 138-150
Abstract
-
Code Scheduling and Optimization for a Superscalar X86 Microprocessor (PostScript
version, PDF
version)
Wayne Dugal
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, May, 1995.
-
Enhancing Instruction Level Parallelism Through Complier-Controlled
Speculation (PostScript
version,
PDF version)
Roger Bringmann
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, May 1995
Abstract
-
Performance and Cost Analysis of the Execution Stage of Superscalar
Microprocessors (PostScript
version,
PDF version)
Dimitri C. Argyres
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, May 1995.
-
Three Architectural Models for Compiler-Controlled Speculative Execution (PostScript
version, PDF
version)
P. P. Chang, N.J. Warter, S. A. Mahlke, W. Y. Chen, and W. W. Hwu
IEEE Transactions on Computers, Vol. 44, No. 4, April 1995, pp. 481-494
Abstract
-
The Importance of Prepass Code Scheduling for Superscalar and Superpipelined
Processors
(PostScript
version, PDF
version)
P. P. Chang, D. M. Lavery, S. A. Mahlke, W. Y. Chen, and W. W. Hwu
IEEE Transactions on Computers, Vol. 44, No. 3, March 1995, pp. 353-370
Abstract
-
Sentinel Scheduling with Recovery Blocks (PostScript
version, PDF
version)
David I. August, Brian L. Dietrich, and Scott A. Mahlke
Technical Report CRHC-95-05, 1995, Center for Reliable and High-Performance
Computing, University of Illinois,
Urbana, IL, Feb, 1995.
Abstract
-
Compiler-Assisted Multiple Instruction Retry (PostScript
version, PDF
version)
C. C. Li, S. K. Chen, W. K. Fuchs, and W. W. Hwu
IEEE Transactions on Computers, Vol.44, No.1, Jan. 1995.
Abstract
|
| 1994 Publications |
-
Characterizing the Impact of Predicated Execution on Branch Prediction (PostScript
version, PDF
version)
S. A. Mahlke, R. E. Hank, R. A. Bringmann, J. C. Gyllenhaal, D. M. Gallagher,
and W. W. Hwu
Proceedings of the 27th International Symposium on Microarchitecture,
December 1994, pp. 217-227
Abstract
-
Data Relocation and Prefetching for Large Data Sets (PostScript
version, PDF
version)
Y. Yamada, J. Gyllenhaal, G. Haab, W. W. Hwu
Proceedings of the 27th Annual ACM/IEEE International Symposium
on Microarchitecture, December, 1994, pp. 118-127
Abstract
-
Dynamic Memory Disambiguation Using the Memory Conflict Buffer (PostScript
version, PDF
version)
D. M. Gallagher, W. Y. Chen, S. A. Mahlke, J. C. Gyllenhaal, W. W. Hwu
Proceedings of the 6th International Conference on Architecture Support for
Programming Languages and Operating Systems, San Jose, California, October,
1994. pp.183-195.
Abstract
-
A Machine Description Language for Compilation (PostScript
version,
PDF version)
J. Gyllenhaal
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois, Urbana IL, Sept. 1994
-
Compiler Support for SPARC Architecture Processors (PostScript
version,
PDF version)
Roland G. Ouellette
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1994
-
Incremental Compiler Transformations for Multiple Instruction Retry (PostScript
version, PDF
version)
S. K. Chen, N. J. Alewine, W. K. Fuchs, and W. W. Hwu
Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 24(9), Sept. 1994,
pp. 1-20.
Abstract
-
Modulo Scheduling with Isomorphic Control Transformations (PostScript
version, PDF
version)
Nancy J. Warter
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1994
Abstract
-
The Susceptibility of Programs to Context Switching Effects
W. W. Hwu And T. M. Conte
IEEE Transactions on Computers, Vol. 43, No. 9, Sept. 1994.
Abstract
-
Profile-Assisted Instruction Scheduling (PostScript
version, PDF
version)
W. Y. Chen, S. A. Mahlke, N. J. Warter, S. Anik, and W. W. Hwu
International Journal for parallel Programming, Vol. 22, No. 2, April 1994,
pp. 151-181
Abstract
-
Compiler-Assisted Multiple Instruction Rollback Recovery Using A Read
Buffer
N. Alewine, S.K. Chen, W.K. Fuchs, and W. W. Hwu
IEEE Transactions on Computers, 1994.
-
Performance Implications of Synchronization Support for Parallel FORTRAN
Programs (PostScript
version, PDF
version)
S. Anik and W. W. Hwu
Journal of Parallel and Distributed Computing, Vol. 22, 1994, pp. 202-215
Abstract
|
| 1993 Publications |
-
Speculative Execution Exception Recovery using Write-back Suppression (PostScript
version,
PDF version)
Roger A. Bringmann, Scott A. Mahlke, Richard E. Hank, John
C. Gyllenhaal, and Wen-mei
W. Hwu
Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture,
Austin,
Texas, Dec. 1993, pp. 214-223
Abstract
-
Superblock Formation Using Static Program Analysis (PostScript
version, PDF
version)
Richard E. Hank, Scott A. Mahlke, Roger A. Bringmann, John C. Gyllenhaal, and
Wen-mei W. Hwu
Proceedings of the 26th Annual ACM/IEEE Int'l Symposium on Microarchitecture,
Austin, Texas, Dec. 1993, pp. 247-256
Abstract
-
Sentinel Scheduling: A Model for Compiler-Controlled Speculative
Execution
S. A. Mahlke, W. Y. Chen, R. A. Bringmann, R. E. Hank,
W. W. Hwu, B. R. Rau, and M. S. Schlansker
ACM Transactions on Computer Systems, Vol. 11, No. 4, Nov. 1993
Abstract
-
Architectural and Software Support for Executing Numerical Applications
on High Performance Computers (PostScript
version, PDF
version)
Sadun Anik
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1993, CRHC-93-19
Abstract
-
Data Preload for Superscalar and VLIW Processors (PostScript
version, PDF
version)
William Y. Chen
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1993
Abstract
-
Register Connection: A New Approach to Adding Registers into
Instruction Set Architectures (PostScript
version, PDF
version)
Tokuzo Kiyohara, Scott Mahlke, William Chen, Roger Bringmann, Richard Hank,
Sadun Anik, Wen-mei Hwu
Proceedings of the 20th Annual International Symposium on Computer Architecture,
San Diego, CA, May 17-19, 1993, pp. 247-256
Abstract
-
XPROF: An Execution Profiler for Window-oriented Applications (PostScript
version, PDF
version)
Aloke Gupta and Wen-mei W. Hwu
Software, Practice & Experience, John Wiley & Sons Ltd., Vol. 23 (5), May
1993, pp. 487-510
Abstract
-
Performance Aspects of Computers with Graphical User Interfaces (PostScript
version, PDF
version)
Aloke Gupta
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, April 1993, CRHC-93-09
Abstract
-
The Benefit of Predicated Execution for Software Pipelining (PostScript
version, PDF
version)
Nancy J. Warter, Daniel M. Lavery, and Wen-mei W. Hwu
Proceedings of the 26th Annual Hawaii Int'l Conference on system Sciences,
Wailea,
Hawaii, Jan. 5-8, 1993, pp 497-506
Abstract
-
Machine Independent Register Allocation for the IMPACT-I C Compiler (PostScript
version, PDF
version)
R. E. Hank
MS thesis, Department of Electrical and Computer Engineering, University of
Illinois,
Urbana IL, 1993
|
| 1992 Publications |
-
Code Scheduling for VLIW/Superscalar Processors with Limited Register Files
(PostScript
version, PDF
version)
T. Kiyohara and J. Gyllenhaal
Proceedings of the 25th International Symposium on Microarchitecture, Dec. 1992,
pp. 197-201
Abstract
-
Effective Compiler Support for Predicated Execution Using the Hyperblock (PostScript
version, PDF
version)
S. A. Mahlke, D. C. Lin, W. Y. Chen, R. E. Hank, and R. A. Bringmann
Proceedings of the 25th International Symposium on Microarchitecture, Dec. 1992,
pp. 45-54
Abstract
-
Efficient Instruction Sequencing with Inline Target Insertion (PostScript
version, PDF
version)
Wen-mei W. Hwu and Pohua Chang
IEEE Transactions on Computers, Vol. 41, No.12, Dec. 1992, pp. 1537-1551
Abstract
-
Enhanced Modulo Scheduling for Loops with Conditional Branches (PostScript
version, PDF
version)
Nancy J. Warter, Grant E. Haab, Krishna Subramanian, and John W. Backhaus
Proceedings of 25th Annual ACM/IEEE Int'l Symposium on Microarchitecture, Dec.
1992 pp. 170-179
Abstract
-
Compiler Code Transformations for Superscalar-Based High-Performance Systems
(PostScript
version,
PDF version)
Scott A. Mahlke, William Y. Chen, John C. Gyllenhaal, Wen-mei W. Hwu, P. P.
Chang, and
T.Kiyohara
Proceedings of Supercomputing 1992, Minneapolis, Minnesota, Nov. 16-20, 1992,
pp.
808-817
Abstract
-
Sentinel Scheduling for VLIW and Superscalar Processors (PostScript
version, PDF
version)
Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau,
and Michael S. Schlansker
Proceedings of the Fifth Int'l Conference on Architecture Support for
Programming
Languages and Operating Systems (ASPLOS-V), Boston, MA, Oct. 12-15, 1992,
pp.238-247
Abstract
-
A Template for Code Generator Development Using the IMPACT-I C Compile (PostScript
version,
PDF version)
Roger Bringmann
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1992
-
Design and Implementation of a Portable Global Code Optimizer (PostScript
version, PDF
version)
Scott A. Mahlke
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1992
-
Evaluation of Some Superscalar and VLIW Processor Designs (PostScript
version, PDF
version)
John Holm
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1992
-
Systematic Computer Architecture Prototyping (PostScript
version, PDF
version)
Thomas M. Conte
PhD thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1992
Abstract
-
Executing Nested Parallel Loops On Shared-Memory Multiprocessors (PostScript
version, PDF
version)
Sadun. Anik and Wen-mei Hwu
Proceedings of the 21st Annual Int'l Conference on Parallel Processing,
St. Charles, IL, Aug. 1992, pp.(III) 241-244
Abstract
-
Tolerating First Level Memory Access Latency In High-Performance Systems (PostScript
version, PDF
version)
William Chen, Scott Mahlke, and Wen-mei Hwu
Proceedings of the 21st Annual Int'l Conference on Parallel Processing, St
Charles,
IL, Aug. 1992, pp.(I) 36-43
Abstract
-
Branch Recovery with Compiler-Assisted Multiple Instruction Retry
N. J. Alewine, S. K. Chen, C. C. Li, W. K. Fuchs, and W. W. Hwu
Proceedings of the 22nd Annual International Symposium on Fault-Tolerant
Computing, Boston, MA, July 8-10, 1992, pp. 66-73.
-
Tolerating Data Access Latency with Register Preloading (PostScript
version, PDF
version)
William Y. Chen, Scott A. Mahlke, and Wen-mei W. Hwu
Proceedings of the 1992 Int'l Conf. on Supercomputing, Washington D.C., July,
1992,
pp. 104-113
Abstract
-
Profile-Guided Automatic Inline Expansion for C Programs (PostScript
version, PDF
version)
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, and Wen-mei W. Hwu
Software Practice and Experience, May 1992, Vol. 22, No. 5, pp. 349-369
Abstract
-
An Execution Profiler for Window-Oriented Applications (PostScript
version, PDF
version)
Aloke Gupta and Wen-Mei W. Hwu
Coordinated Science Lab, University of Illinois, Urbana, IL,
Technical Report CRHC-92-02, 1992.
Abstract
-
Scalar Program Performance on Multiple-Instruction-Issue Processors with a
Limited Number of Registers (PostScript
version, PDF
version)
Scott A. Mahlke, William Y. Chen, Pohua P. Chang, and Wen-mei W. Hwu
Proceedings of the 25th Annual Hawaii Int'l Conference on System Sciences, Jan.
6-9, 1992, pp. 34-44
Abstract
|
| 1991 Publications |
-
Three Superblock Scheduling Models for Superscalar and Superpipelined
Processors (PostScript
version, PDF
version)
Pohua P. Chang, Nancy J. Warter, Scott Mahlke, William Y. Chen, and Wen-mei W.
Hwu
Technical Report CRHC-91-29, Center for Reliable and High-Performance Computing,
University of Illinois, Urbana, IL, Dec. 1991
Abstract
-
Using Profile Information to Assist Classic Compiler Code Optimizations (PostScript
version, PDF
version)
Pohua P. Chang, Scott A. Mahlke, and Wen-mei W. Hwu
Software Practice and Experience, Dec. 1991, Vol. 21, No. 12, pp. 1301-1321
Abstract
-
Comparing Static And Dynamic Code Scheduling for Multiple-Instruction-Issue
Processors (PostScript
version, PDF
version)
Pohua P. Chang, William Y. Chen, Scott A. Mahlke, and Wen-mei W. Hwu
Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture,
Albuquerque, New Mexico, Nov. 18-20,1991, pp. 69-73
Abstract
-
Data Access Microarchitectures for Superscalar Processor with
Compiler-Assisted Data Prefetching (PostScript
version, PDF
version)
William Y. Chen, Scott A. Mahlke, Pohua P. Chang, and Wen-mei W. Hwu
Proceedings of the 24th Annual ACM/IEEE Int'l Symposium on Microarchitecture,
Albuquerque, New Mexico, Nov. 1991, pp. 69-73
Abstract
-
An Optimizing Compiler Code Generator: A platform for RISC Performance Analysis
(PostScript
version, PDF
version)
William Y. Chen
MS thesis, Department of Computer Science, University of Illinois, Urbana IL,
Sept. 1991
-
The Effect of Compiler Optimizations On Available Parallelism In Scalar
Programs (PostScript
version, PDF
version)
Scott A. Mahlke, Nancy J. Warter, William Y. Chen, Pohua P. Chang, and Wen-mei
W. Hwu
Proceedings of the 20th Annual Int'l Conference on Parallel Processing, St.
Charles, IL,
Aug. 12-16, 1991, pp. 142-145
Abstract
-
Performance Implications of Synchronization Support for Parallel Fortran
Programs (PostScript
version, PDF
version)
Sadun Anik, and Wen-mei W. Hwu
Technical Report CRHC-91-21, Center for Reliable and High-Performance Computing,
University of Illinois, Urbana, IL, Jun. 1991
Abstract
-
IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors
(PostScript
version, PDF
version)
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Water, and Wen-mei W.
Hwu
Proceedings of the 18th Annual Int'l Symposium on Computer Architecture,
Toronto,
Canada, May 28, 1991, pp. 266-275
Abstract
-
The Effect of Code Expanding Optimizations of Instruction Cache Design (PostScript
version, PDF
version)
William Y. Chen, Pohua Chang, Thomas M. Conte, and Wen-mei W. Hwu
Technical Report CRHC-91-17, Center for Reliable and High-Performance,
university
of Illinois, Urbana, IL, May 1991
Abstract
-
Benchmark Characterization
T.M. Conte and W. W. Hwu
Proceedings of the 24th Annual Hawaii International Conference on System
Sciences,
Jan. 8-11, 1991, pp. 364-372.
|
| 1990 Publications |
-
A Multiported Nonblocking Cache for a Superscalar Uniprocessor (PostScript
version, PDF
version)
James Sicolo
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1990
-
Compiler Support for Predicated Execution in Superscalar Processors (PostScript
version, PDF
version)
David Lin
MS thesis, Department of Computer Science, University of Illinois, Urbana
IL, Sept. 1990
|
| 1989 Publications |
-
Comparing Software and Hardware Schemes For Reducing the Cost of Branches
(PostScript
version, PDF
version)
Wen-mei W. Hwu, Thomas M. Conte, and Pohua Chang
Proceedings of the 16th Annual International Symposium on Computer Architecture,
Jerusalem, Israel, May 28- June 1, 1989, pp. 224-233.
Abstract
-
Control Flow Optimization for Supercomputer Scalar Processing (PostScript
version, PDF
version)
P. P. Chang and W. W. Hwu
Proceedings of the 1989 Int'l Conf. on Supercomputing, Crete, Greece, Jun. 5-9,
1989
Abstract
|
|