Publications in Selected Areas

Parallel Computing: PL/Compiler, Runtime/OS, and Architectural Support



SC ParaStack: Efficient Hang Detection for MPI Programs at Large Scale
H. Li, Z. Chen, and R. Gupta
ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
12 pages, Denver, Colorado, November 2017.
ASPLOS KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations
K. Vora, R. Gupta, and G. Xu
ACM 22nd International Conference on Architectural Support for Programming Languages and Operating Systems,
13 pages, Xi'an, China, April 2017.
ASPLOS CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing
K. Vora, C. Tian, R. Gupta, and Z. Hu
ACM 22nd International Conference on Architectural Support for Programming Languages and Operating Systems,
13 pages, Xi'an, China, April 2017.
TACO Synergistic Analysis of Evolving Graphs
K. Vora, R. Gupta, and G. Xu
ACM Transactions on Architecture and Code Optimization,
Volume 13, Issue 4, Article No. 32, 27 pages, October 2016.
USENIX
ATC
Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing
K. Vora, G. Xu, and R. Gupta
USENIX Annual Technical Conference,
pages 507-522, Denver, Colorado, June 2016.
HPDC Efficient Processing of Large Graphs via Input Reduction
A. Kusum, K. Vora, R. Gupta, and I. Neamtiu
25th ACM International Symposium on High-Performance Parallel and Distributed Computing,
pages 245-257, Kyoto, Japan, May-June 2016.
HPDC Parallel Execution Profiles
Z. Benavides, R. Gupta, and X. Zhang
25th ACM International Symposium on High-Performance Parallel and Distributed Computing,
pages 215-218, Kyoto, Japan, May-June 2016.
ICS CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs
M. Belviranli, F. Khorasani, L.N. Bhuyan, and R. Gupta
ACM 30th International Conference on Supercomputing,
12 pages, Istanbul, Turkey, June 2016.
IPDPS Eliminating Intra-warp Load Imbalance in Irregular Nested Patterns via Collaborative Task Engagement
F. Khorasani, B. Rowe, R. Gupta, and L.N. Bhuyan
IEEE International Parallel and Distributed Processing Symposium,
pages 524-533, May 2016.
TACO Tumbler: An Effective Load Balancing Technique for MultiCPU Multicore Systems
K.K. Pusukuri, R. Gupta, and L.N. Bhuyan
ACM Transactions on Architecture and Code Optimization,
Volume 12, Issue 4, Article No. 36, 24 pages, January 2016.
MICRO Efficient Warp Execution in Presence of Divergence with Collaborative Context Collection
F. Khorasani, R. Gupta, and L.N. Bhuyan
The 48th Annual IEEE/ACM International Symposium on Microarchitecture,
pages 204-215, Waikiki, Hawaii, December 2015.
PACT Scalable SIMD-Efficient Graph Processing on GPUs
F. Khorasani, R. Gupta, and L.N. Bhuyan
International Conference on Parallel Architectures and Compilation Techniques,
pages 39-50, San Francisco, California, October 2015.
 WS & VR download: https://github.com/farkhor/WS-VR/
PACT Stadium Hashing: Scalable and Flexible Hashing on GPUs
F. Khorasani, M. Belviranli, R. Gupta, and L.N. Bhuyan
International Conference on Parallel Architectures and Compilation Techniques,
pages 63-74, San Francisco, California, October 2015.
ICS PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM Synchronization
M. Belviranli, P. Deng, L.N. Bhuyan, R. Gupta, and Q. Zhu
ACM 29th International Conference on Supercomputing,
pages 25-35, Newport Beach, June 2015.
SC Fence Scoping
C. Lin, V. Nagarajan, and R. Gupta
ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
pages 105-116, New Orleans, Louisiana, November 2014.
OOPSLA ASPIRE: Exploiting Asynchronous Parallelism in Iterative Algorithms using a Relaxed Consistency based DSM
K. Vora, S-C. Koduru, and R. Gupta
ACM SIGPLAN International Conference on Object Oriented Programming Systems, Languages and Applications,
pages 861-878, Portland, Oregon, October 2014.
PACT Shuffling: A Framework for Lock Contention Aware Thread Scheduling for Multicore Multiprocessor Systems
K.K. Pusukuri, R. Gupta, L.N. Bhuyan
International Conference on Parallel Architectres and Compilation Techniques,
pages 289-300, Edmonton, Alberta, Canada, August 2014.
HPDC CuSha: Vertex-Centric Graph Processing on GPUs
F. Khorasani, K. Vora, R. Gupta, and L.N. Bhuyan
23rd ACM International Symposium on High-Performance Parallel and Distributed Computing,
pages 239-251, Vancouver, Canada, June 2014.
download: http://farkhor.github.io/CuSha/
ICS Address-aware Fences
C. Lin, V. Nagarajan, and R. Gupta
ACM 27th International Conference on Supercomputing,
pages 313-324, June 2013.
TACO
HiPEAC
ADAPT: A Framework for Coscheduling Multithreaded Programs
K.K. Pusukuri, R. Gupta, and L.N. Bhuyan
ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC,
Volume 9, Issue 4, Article No. 45, 25 pages, January 2013.
TACO
HiPEAC
A Dynamic Self Scheduling Scheme for Heterogeneous Multiprocessor Architectures
M.E. Belviranli, L.N. Bhuyan, and R. Gupta
ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC,
Volume 9, Issue 4, Article No. 57, 20 pages, January 2013.
PLDI Effective Parallelization of Loops in the Presence of I/O Operations
M. Feng, R. Gupta, and I. Neamtiu
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 487-498, Beijing, China, June 2012.
ASPLOS Efficient Sequential Consistency via Conflict Ordering
C. Lin, V. Nagarajan, R. Gupta, and B. Rajaram
ACM 17th International Conference on Architectural Support for Programming
Languages and Operating Systems,
pages 273-286, London, UK, March 2012.
TACO
HiPEAC
PLDS: Partitioning Linked Data Structures for Parallelism
M. Feng, C. Lin, and R. Gupta
ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC,
Volume 8, Issue 4, Article No. 38, 21 pages, January 2012.
TACO
HiPEAC
Thread Tranquilizer: Dynamically Reducing Performance Variation
K.K. Pusukuri, R. Gupta, L.N. Bhuyan
ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC,
Volume 8, Issue 4, Article No. 46, 21 pages, January 2012.
PACT No More Backstabbing... A Faithful Scheduling Policy for Multithreaded Programs
K.K. Pusukuri, R. Gupta, L.N. Bhuyan
The 20th International Conference on Parallel Architectres and Compilation Techniques,
pages 12-21, Galveston Island, Texas, October 2011.
PPoPP SpiceC: Scalable Parallelism via implicit copying and explicit Commit
M. Feng, R. Gupta, and Y. Hu
16th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming,
pages 69-80, San Antonio, Texas, February 2011.
PPoPP Enhanced Speculative Parallelization Via Incremental Recovery
C. Tian, C. Lin, M. Feng, and R. Gupta
16th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming,
pages 189-200, San Antonio, Texas, February 2011.
PACT Recipient of a PACT 2010 Best Paper Award
Efficient Sequential Consistency Using Conditional Fences
C. Lin, V. Nagarajan, and R. Gupta
The 19th International Conference on Parallel Architectures and Compilation Techniques,
pages 295-306, Vienna, Austria, September 2010.
PLDI Supporting Speculative Parallelization in the Presence of Dynamic Data Structures
C. Tian, M. Feng, and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 62-73, Toronto, Canada, June 2010.
ISMM Speculative Parallelization Using State Separation and Multiple Value Prediction
C. Tian, M. Feng, and R. Gupta
Ninth International Symposium on Memory Management,
pages 63-72, Toronto, Canada, June 2010.
ISCA ECMon: Exposing Cache Events for Monitoring
V. Nagarajan and R. Gupta
ACM/IEEE 36th International Symposium on Computer Architecture,
Austin, Texas, June 2009.
SIGOPS Runtime Monitoring on Multicores via OASES
V. Nagarajan and R. Gupta
ACM SIGOPS Operating Systems Review,
special issue on the interaction among the OS, Compilers, and Multicore Processors,
pages 15-24, Vol. 43, No. 2, April 2009 (Invited Paper).
VEE Architectural Support for Shadow Memory in Multiprocessors
V. Nagarajan and R. Gupta
ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments,
pages 1-10, Washington DC, March 2009.
MICRO Copy Or Discard Execution Model For Speculative Parallelization On Multicores
C. Tian, M. Feng, V. Nagarajan, and R. Gupta
IEEE/ACM 41th International Symposium on Microarchitecture,
pages 330-341, Lake Como, Italy, Nov. 2008.
HPCA SENSS: Security Enhancement to Symmeteric Shared Memory Multiprocessors
Y. Zhang, L. Gao, J. Yang, X. Zhang, and R. Gupta
IEEE 11th International Symposium on High Performance Computer Architecture,
pages 352-362, San Francisco, California, February 2005.
HPCA Distributed Path Reservation Algorithms for Multiplexed All-Optical Interconnection Networks
X. Yuan, R. Melhem, and R. Gupta
IEEE 3rd International Symposium on High-Performance Computer Architecture,
pages 38-47, San Antonio, Texas, February 1997.
SC Compiled Communication for All-Optical TDM Networks
X. Yuan, R. Melhem, and R. Gupta
Supercomputing'96, Article No. 25, 15 pages, Pittsburgh, Pennsylvania, November 1996.
MICRO A Shape Matching Approach for Scheduling Fine-Grained Parallelism
B. Malloy, R. Gupta, and M.L. Soffa
IEEE/ACM 25th International Symposium on Microarchitecture,
pages 264-267, Portland, Oregon, December 1992.
JPDC SPMD Execution of Programs with Pointer-based Data Structures on Distributed-Memory Machines
R. Gupta
Journal of Parallel and Distributed Computing,
special issue on Multicomputer Programming and Application, Vol. 16, No. 2, pages 92-107, October 1992.
ICCL SPMD Execution of Programs with Dynamic Data Structures on Distributed Memory Machines
R. Gupta
IEEE 4th International Conference on Computer Languages,
pages 232-241, Oakland, California, April 1992.
SC Techniques for Integrating Parallelizing Transformations and Compiler Based Scheduling Methods,
T. Watts, M.L. Soffa, and R. Gupta
Supercomputing'92,
pages 830-839, Minneapolis, Minnesota, November 1992.
MICRO Executing Loops on a Fine-Grained MIMD Architecture
S. Lee and R. Gupta
IEEE/ACM 24th International Symposium on Microarchitecture,
pages 199-205, Albuquerque, New Mexico, November 1991.
MICRO A Fine-grained MIMD Architecture based upon Register Channels
R. Gupta
IEEE/ACM 23rd Workshop on Microprogramming and Microarchitecture,
pages 28-37, Orlando, Florida, December 1990.
SC The Design of a RISC based Multiprocessor Chip
R. Gupta, M. Epstein, and M. Whelan
Supercomputing'90,
pages 920-929, New York, November 1990.
SC Loop Displacement: An Approach for Transforming and Scheduling Loops for Parallel Execution
R. Gupta
Supercomputing'90,
pages 388-397, New York, November 1990.
PPoPP Employing Register Channels for the Exploitation of Instruction Level Parallelism
R. Gupta
ACM SIGPLAN 2nd Symposium on Principles and Practice of Parallel Programming,
pages 118-127, Seattle, Washington, March 1990.
ASPLOS The Fuzzy Barrier: A Mechanism for High-Speed Synchronization of Processors
R. Gupta
ACM 3rd International Conference on Architectural Support for Programming
Languages and Operating Systems
, pages 54-64, Boston, April 1989.


Software Analysis, Debugging, & Testing



OOPSLA RAIVE: Runtime Assessment of Floating-Point Instability by Vectorization
W-C. Lee, T. Bao, Y. Zheng, X. Zhang, K. Vora, and R. Gupta
ACM SIGPLAN International Conference on Object Oriented Programming Systems, Languages and Applications,
pages 623-638, Pittsburgh, Pennsylvania, October 2015.
CGO DrDebug: Deterministic Replay based Cyclic Debugging with Dynamic Slicing
Y. Wang, H. Patil, C. Pereira, G. Lueck, R. Gupta, and I. Neamtiu
IEEE/ACM International Symposium on Code Generation and Optimization,
pages 98-108, Orlando, Florida, February 2014.
DrDebug download: Replay & Slicing
CGO Lightweight Fault Detection in Parallelized Programs
L. Tan, M. Feng, and R. Gupta
IEEE/ACM International Symposium on Code Generation and Optimization,
pages 1-11, Shenzhen, China, February 2013.
TOPLAS Execution Suppression: An Automated Iterative Technique for locating Memory Errors
D. Jeffrey, V. Nagarajan, and R. Gupta
ACM Transactions on Programming Languages and Systems,
Vol. 32, No. 5, 36 pages, May 2010.
PASTE Learning Universal Probabilistic Models for Fault Localization
M. Feng and R. Gupta
Ninth ACM SIGPLAN-SIGSOFT Workshop on Program Analysis for Software Tools and Engineering,
pages 81-88, Toronto, Canada, June 2010.
ICSM Detecting Virus Mutations Via Dynamic Matching
M. Feng and R. Gupta
25th IEEE International Conference on Software Maintenance,
pages 105-114, Edmonton, Canada, September 2009.
ICSM Effective and Efficient Localization of Multiple Faults Using Value Replacement
D. Jeffrey, N. Gupta, and R. Gupta
25th IEEE International Conference on Software Maintenance,
pages 221-230, Edmonton, Canada, September 2009.
ICSM Identifying the Root Causes of Memory Bugs Using Corrupted Memory Location Suppression
D. Jeffrey, N. Gupta, and R. Gupta
International Conference on Software Maintenance,
pages 356-365, Beijing, China, September 2008.
ICSM Dynamic Slicing of Multithreaded Programs for Race Detection
S. Tallam, C. Tian, and R. Gupta
International Conference on Software Maintenance,
pages 97-106, Beijing, China, September 2008.
ISSTA Fault Localization Using Value Replacement
D. Jeffrey, N. Gupta, and R. Gupta
International Symposium on Software Testing and Analysis,
pages 167-178, Seattle, July 2008.
ISSTA Dynamic Recognition of Synchronization Operations for Improved Data Race Detection
C. Tian, V. Nagarajan, R. Gupta, and S. Tallam
International Symposium on Software Testing and Analysis,
pages 143-154, Seattle, July 2008.
ICSM ONTRAC: A System for Efficient ONline TRACing for Debugging
V. Nagarajan, D. Jeffrey, R. Gupta, and N. Gupta
International Conference on Software Maintenance,
pages 445-454, Paris, September 2007.
ICSM Matching Control Flow of Program Versions
V. Nagarajan, R. Gupta, X. Zhang, M. Madou, B. De Sutter, and K. De Bosschere
International Conference on Software Maintenance,
pages 84-93, Paris, September 2007.
ACM
TACO
Unified Control Flow and Dependence Traces
S. Tallam and R. Gupta
ACM Transactions on Architecture and Code Optimization,
Vol. 4, No. 3, 31 pages, September 2007.
ISSTA Enabling Tracing of Long-Running Multithreaded Programs via Dynamic Execution Reduction
S. Tallam, C. Tian, X. Zhang, and R. Gupta
International Symposium on Software Testing and Analysis,
pages 207-218, London, July 2007.
PLDI Towards Locating Execution Omission Errors
X. Zhang, S. Tallam, N. Gupta, and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 415-424, San Diego, June 2007.
FSE Dynamic Slicing Long Running Programs through Execution Fast Forwarding
X. Zhang, S. Tallam, and R. Gupta
14th ACM SIGSOFT Symposium on Foundations of Software Engineering,
pages 81-91, Portland, Oregon, November 2006.
PLDI Pruning Dynamic Slices With Confidence
X. Zhang, N. Gupta, and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 169-180, Ottawa, Canada, June 2006.
ICSE Locating Faults Through Automated Predicate Switching
X. Zhang, N. Gupta, and R. Gupta
IEEE/ACM International Conference on Software Engineering,
pages 272-281, Shanghai, China, May 2006.
ASE Locating Faulty Code Using Failure-Inducing Chops
N. Gupta, H. He, X. Zhang, and R. Gupta
IEEE/ACM International Conference on Automated Software Engineering,
pages 263-272, Long Beach, California, Nov. 2005.
ESEC
-FSE
Matching Execution Histories of Program Versions
X. Zhang and R. Gupta
Joint 10th European Software Engineering Conference and
13th ACM SIGSOFT Symposium on the Foundations of Software Engineering
,
pages 197-206, Lisbon, Portugal, September 2005.
ACM
TACO
Whole Execution Traces and their Applications
X. Zhang and R. Gupta
ACM Transactions on Architecture and Code Optimization,
Vol. 2, No. 3, pages 301-334, September 2005.
PACT Extended Whole Program Paths
S. Tallam, R. Gupta, and X. Zhang
International Conference on Parallel Architectures and Compilation Techniques,
pages 17-26, St. Loius, Missouri, September 2005.
ACM
TOPLAS
Cost and Precision Tradeoffs of Dynamic Data Slicing Algorithms
X. Zhang, R. Gupta, and Y. Zhang
ACM Transactions on Programming Languages and Systems,
Vol. 27, No. 4, pages 631-661, July 2005.
MICRO Whole Execution Traces
X. Zhang and R. Gupta
IEEE/ACM 37th International Symposium on Microarchitecture,
pages 105-116, Portland, Oregan, December 2004.
PLDI Cost Effective Dynamic Program Slicing
X. Zhang and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 94-106, Washington D.C., June 2004.
ICSE Effective Forward Computation of Dynamic Slices Using Reduced Ordered Binary Decision Diagrams
X. Zhang, R. Gupta, and Y. Zhang
IEEE/ACM International Conference on Software Engineering,
pages 502-511, Edinburgh, UK, May 2004.
CGO Extending Path Profiling across Loop Backedges and Procedure Boundaries
S. Tallam, X. Zhang, and R. Gupta
Second Annual IEEE/ACM International Symposium on Code Generation and Optimization,
pages 251-262, San Jose, CA, March 2004.
ICSE Recipient of ICSE 2003 Distinguished Paper Award.
Precise Dynamic Slicing Algorithms
X. Zhang, R. Gupta, and Youtao Zhang
IEEE/ACM International Conference on Software Engineering,
pages 319-329, Portland, Oregon, May 2003.
CGO Hiding Program Slices for Software Security
X. Zhang and R. Gupta
First Annual IEEE/ACM International Symposium on Code Generation and Optimization,
pages 325-336, San Francisco, CA, March 2003.
PLDI Timestamped Whole Program Path Representation and its Applications
Y. Zhang and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 180-190, Snowbird, Utah, June 2001.
ESEC
-FSE
Comparison Checking: An Approach to Avoid Debugging of Optimized Code
C. Jaramillo, R. Gupta, and M.L. Soffa
Joint 7th European Software Engineering Conference and
7th ACM SIGSOFT Symposium on the Foundations of Software Engineering
,
LNCS 1687, Springer Verlag, pages 268-284, Toulouse, France, Sept. 1999.
ACM
TOPLAS
A Practical Framework for Demand-Driven Interprocedural Data Flow Analysis
E. Duesterwald, R. Gupta, and M.L. Soffa
ACM Transactions on Programming Languages and Systems,
Vol. 19, No. 6, pages 992-1030, November 1997.
ACM
TOSEM
Hybrid Slicing: Integrating Dynamic Information with Static Analysis
R. Gupta, M.L. Soffa, and J. Howard
ACM Transactions on Software Engineering and Methodology,
Vol. 6, No. 4, pages 370-397, October 1997.
ESEC
-FSE
Refining Data Flow Information using Infeasible Paths
R. Bodik, R. Gupta, and M.L. Soffa
Joint 6th European Software Engineering Conference and
5th ACM SIGSOFT Symposium on the Foundations of Software Engineering
,
LNCS 1301, Springer Verlag, pages 361-377, Zurich, Switzerland, September 1997.
ICSE A Demand-Driven Analyzer for Data Flow Testing at the Integration Level
E. Duesterwald, R. Gupta, and M.L. Soffa
IEEE/ACM International Conference on Software Engineering,
pages 575-586, Berlin, Germany, March 1996.
FSE Hybrid Slicing: An Approach for Refining Static Slices using Dynamic Information
R. Gupta and M.L. Soffa
ACM SIGSOFT 3rd Symposium on the Foundations of Software Engineering,
pages 29-40, Washington, DC, October 1995.
ICSM Priority Based Data Flow Testing
R. Gupta and M.L. Soffa
IEEE-CS International Conference on Software Maintenance,
pages 348-357, Nice, France, October 1995.
ICSM A Framework for Partial Data Flow Analysis
R. Gupta and M.L. Soffa
IEEE-CS International Conference on Software Maintenance,
pages 4-13, Victoria, British Columbia, September 1994.
ACM
TOSEM
A Methodology for Controlling the Size of a Test Suite
M.J. Harrold, R. Gupta, and M.L. Soffa
ACM Transactions on Software Engineering and Methodology,
Vol. 2, No. 3, pages 270-285, July 1993.
ICSM An Approach to Regression Testing using Slicing
R. Gupta, M.J. Harrold, and M.L. Soffa
IEEE-CS International Conference on Software Maintenance,
pages 299-308, Orlando, Florida, November 1992.
POPL Demand-Driven Computation of Interprocedural Data Flow
E. Duesterwald, R. Gupta, and M.L. Soffa
ACM SIGPLAN-SIGACT 22nd Symposium on Principles of Programming Languages,
pages 37-48, San Francisco, California, January 1995.
POPL Generalized Dominators and Post-Dominators
R. Gupta
ACM SIGPLAN-SIGACT 19th Symposium on Principles of Programming Languages,
pages 246-257, Albuquerque, New Mexico, January 1992.
ISTAV Loop Monotonic Computations: An Approach for the Efficient Run-time Detection of Races
R. Gupta and M. Spezialetti
SIGSOFT Symposium on Testing, Analysis, and Verification,
pages 98-111, Victoria, Canada, October 1991.
ICSM A Methodology for Controlling the Size of a Test Suite
M.J. Harrold, R. Gupta, and M.L. Soffa
IEEE-CS International Conference on Software Maintenance,
pages 302-310, San Diego, CA, November 1990.


High-Performance & Embedded Processors: Compiler & Architectural Support



ACM TACO Dynamic Access Distance Driven Cache Replacement
M. Feng, C. Tian, C. Lin, and R. Gupta
ACM Transactions on Architecture and Code Optimization,
Vol. 8, No. 3, Article 14, 30 pages, October 2011.
HiPEAC Compiler-Assisted Memory Encryption for Embedded Processors
V. Nagarajan, R. Gupta, and A. Krishnaswamy
International Conference on High Performance Embedded Architectures and Compilers,
Ghent, Belgium, January 2007.
ACM
TECS
Dynamic Coalescing for 16-bit Instructions
A. Krishnaswamy and R. Gupta
ACM Transactions on Embedded Computing Systems
in special issue of selected LCTES'03 papers, Vol. 4, No. 1, pages 3-37, Feb. 2005.
MICRO Efficient Use of Invisible Registers in Thumb Code
A. Krishnaswamy and R. Gupta
IEEE/ACM 38th International Symposium on Microarchitecture,
pages 30-40, Barcelona, Spain, Nov. 2005.
HiPEAC Exploiting Computation Reuse Cache to Reduce Energy in Network Processors
B. Li, G. Venkatesh, B. Calder, and R. Gupta
International Conference on High Performance Embedded Architectures and Compilers,
LNCS 3793, Springer Verlag, pages 251-265, Barcelona, Spain, Nov. 2005.
ACM
TODAES
Frequent Value Encoding for Low Power Data Buses
J. Yang, R. Gupta, and C. Zhang
ACM Transactions on Design Automation of Electronic Systems,
Vol. 9, No. 3, pages 354-384, July 2004.
20 Years
of PLDI
Retrospective -- Complete Removal of Redundant Expressions
R. Bodik, R. Gupta and M.L. Soffa
20 Years of the ACM/SIGPLAN Conference on Programming Language Design
and Implementation (1979-1999): A Selection
,
ACM SIGPLAN Notices, Vol. 39, No. 4, pages 596-597, April 2004.
CASES Simple Offset Assignment in Presence of Subword Data
B. Li and R. Gupta
International Conference on Compilers, Architecture, and Synthesis of Embedded Systems,
pages 12-23, San Jose, CA, October 2003.
POPL Bitwidth Aware Global Register Allocation
S. Tallam and R. Gupta
30th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages,
pages 85-96, New Orleans, LA, January 2003.
ACM
TECS
Frequent Value Locality and its Applications
J. Yang and R. Gupta
ACM Transactions on Embedded Computing Systems (inaugural issue),
Vol. 1, No. 1, pages 79-105, November 2002.
MICRO Energy Efficient Frequent Value Data Cache Design
J. Yang and R. Gupta
IEEE/ACM 35th International Symposium on Microarchitecture,
pages 197-207, Istanbul, Turkey, November 2002.
CASES Bit Section Instruction Set Extension of ARM for Embedded Applications
B. Li and R. Gupta
International Conference on Compilers, Architecture, and Synthesis for Embedded Systems,
pages 69-78, Grenoble, France, October 2002.
ICS Load and Store Reuse Using Register File Contents
S. Onder and R. Gupta
ACM 15th International Conference on Supercomputing,
pages 289-302, Sorrento, Naples, Italy, June 2001.
MICRO Frequent Value Compression in Data Caches
J. Yang, Y. Zhang, and R. Gupta
IEEE/ACM 33rd International Symposium on Microarchitecture,
pages 258-265, Monterey, CA, December 2000.
ASPLOS Frequent Value Locality and Value-Centric Data Cache Design
Y. Zhang, J. Yang, and R. Gupta
ACM 9th International Conference on Architectural Support for Programming
Languages and Operating Systems
, pages 150-159, Cambridge, MA, November 2000.
PLDI ABCD: Eliminating Array Bounds Checks on Demand
R. Bodik, R. Gupta, and V. Sarkar
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 321-333, Vancouver B.C., Canada, June 2000.
MICRO Dynamic Memory Disambiguation in the Presence of Out-of-order Store Issuing
S. Onder and R. Gupta
IEEE/ACM 32nd International Symposium on Microarchitecture,
pages 170-176, Haifa, Israel, November 1999.
PACT Caching and Predicting Branch Sequences for Improved Fetch Effectiveness
S. Onder, J. Xu, and R. Gupta
International Conference on Parallel Architectures and Compilation Techniques,
pages 294-302, Newport Beach, California, October 1999.
PLDI Load-Reuse Analysis: Design and Evaluation
R. Bodik, R. Gupta, and M.L. Soffa
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 64-76, Atlanta, Georgia, May 1999.
ISCA Value Prediction in VLIW Machines
T. Nakra, R. Gupta, and M.L. Soffa
ACM/IEEE 26th International Symposium on Computer Architecture,
pages 258-269, Atlanta, Georgia, May 1999.
HPCA Global Context-based Value Prediction
T. Nakra, R. Gupta, and M.L. Soffa
IEEE 5th International Symposium on High Performance Computer Architecture,
pages 4-12, Orlando, Florida, January 1999.
PACT Capturing the Effects of Code Improving Transformations
C. Jaramillo, R. Gupta, and M.L. Soffa
International Conference on Parallel Architectures and Compilation Techniques,
pages 118-123, Paris, France, October 1998.
PACT Superscalar Execution with Direct Data Forwarding
S. Onder and R. Gupta
International Conference on Parallel Architectures and Compilation Techniques,
pages 130-135, Paris, France, October 1998.
PLDI Complete Removal of Redundant Expressions
R. Bodik, R. Gupta and M.L. Soffa
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 1-14, Montreal, Canada, June 1998.
ICCL Automatic Generation of Microarchitecture Simulators
S. Onder and R. Gupta
IEEE International Conference on Computer Languages,
pages 80-89, Chicago, Illinois, May 1998.
ICCL Path Profile Guided Partial Redundancy Elimination Using Speculation
R. Gupta, D. Berson, and J.Z. Fang
IEEE International Conference on Computer Languages,
pages 230-239, Chicago, Illinois, May 1998.
MICRO Resource-Sensitive Profile-Directed Data Flow Analysis for Code Optimization
R. Gupta, D. Berson, and J.Z. Fang
IEEE/ACM 30th International Symposium on Microarchitecture,
pages 558-568, Research Triangle Park, North Carolina, December 1997.
PACT Path Profile Guided Partial Dead Code Elimination Using Predication
R. Gupta, D. Berson, and J.Z. Fang
International Conference on Parallel Architectures and Compilation Techniques,
pages 102-115, San Francisco, California, November 1997.
PLDI Partial Dead Code Elimination using Slicing Transformations
R. Bodik and R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 159-170, Las Vegas, Nevada, June 1997.
PLDI Interprocedural Conditional Branch Elimination
R. Bodik, R. Gupta, and M.L. Soffa
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 146-158, Las Vegas, Nevada, June 1997.
PACT Resource Spackling: A Framework for Integrating Register Allocation in Local and Global Schedulers
D. Berson, R. Gupta, and M.L. Soffa
International Conference on Parallel Architectures and Compilation Techniques,
IFIP Transactions A-50, pages 135-146, Montreal, Canada, August 1994.
ACM
LOPLAS
Optimizing Array Bound Checks Using Flow Analysis
R. Gupta
ACM Letters on Programming Languages and Systems,
Vol.2, Nos.1-4, pages 135-150, March-December 1994.
ACM
TOPLAS
Efficient Register Allocation Via Coloring Using Clique Separators
R. Gupta, M.L. Soffa, and D. Ombres
ACM Transactions on Programming Languages and Systems,
Vol. 16, No. 3, pages 370-386, May 1994.
PLDI A Practical Data Flow Framework for Array Reference Analysis and its Application in Optimizations
E. Duesterwald, R. Gupta, and M.L. Soffa
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 68-77, Albuquerque, New Mexico, June 1993.
PACT URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures
D. Berson, R. Gupta, and M.L. Soffa
Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism,
IFIP Transactions A-23, pages 243-254, Orlando, Florida, January 1993.
SC Improving Instruction Cache Performance by Reducing Cache Pollution
R. Gupta and Chi-Hung Chi
Supercomputing'90,
pages 82-91, New York, November 1990.
PLDI A Fresh Look at Optimizing Array Bound Checks
R. Gupta
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 272-282, White Plains, NY, June 1990.
IEEE
TSE
Region Scheduling: An Approach for Detecting and Redistributing Parallelism
R. Gupta and M.L. Soffa
IEEE Transactions on Software Engineering,
Vol. 16, No. 4, pages 421-431, April 1990.
PLDI Register Allocation via Clique Separators
R. Gupta, M.L. Soffa, and T.F. Steele
ACM SIGPLAN Conference on Programming Language Design and Implementation,
pages 264-275, Portland, Oregon, June 1989.
PPEALS Compile-time Techniques for Efficient Utilization of Parallel Memories
R. Gupta and M.L. Soffa
ACM SIGPLAN Symposium on Parallel Programming: Experience with Applications,
Languages and Systems
, pages 235-246, New Haven, July 1988.