Publications in Selected Areas

Parallel Computing: PL/Compiler, Runtime/OS, and Architectural Support

MICRO	JetStream: Graph Analytics on Streaming Data with Event-Driven Hardware Accelerator S. Rahman, M. Afarin, N. Abu-Ghazaleh, and R. Gupta The 54th IEEE/ACM International Symposium on Microarchitecture, pages 1-15, Athens, Greece, October 2021.
EuroSys	Tripoline: Generalized Incremental Graph Processing via Graph Triangle Inequality *X. Jiang, C. Xu, X. Yin, Z. Zhao, and R. Gupta (* co-primary)** European Conference on Computer Systems, pages 17-32, Edinburgh, Scotland, UK, April 2021.
BigData	BEAD: Batched Evaluation of Iterative Graph-Queries with Evolving Analytics Demands A. Mazloumi, C. Xu, Z. Zhao, and R. Gupta IEEE International Conference on Big Data, pages 461-468, Atlanta, Georgia, December 2020.
MICRO	GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing S. Rahman, N. Abu-Ghazaleh, and R. Gupta The 53rd Annual IEEE/ACM International Symposium on Microarchitecture, pages 908-921, Athens, Greece, October 2020.
EuroSys	Subway: Minimizing Data Transfer during Out-of-GPU-Memory Graph Processing A. Nodehi, Z. Zhao, and R. Gupta European Conference on Computer Systems, 16 pages, Heraklion, Crete, Greece, April 2020.
BigData	MultiLyra: Scalable Distributed Evaluation of Batches of Iterative Graph Queries A. Mazloumi, X. Jiang, and R. Gupta IEEE International Conference on Big Data, pages 349-358, Los Angeles, CA, December 2019.
ASPLOS	PnP: Pruning and Prediction for Point-To-Point Iterative Graph Analytics C. Xu, K. Vora, and R. Gupta ACM 24nd International Conference on Architectural Support for Programming Languages and Operating Systems, pages 587-600, Providence, RI, April 2019.
ASPLOS	KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations K. Vora, R. Gupta, and G. Xu ACM 22nd International Conference on Architectural Support for Programming Languages and Operating Systems, 13 pages, Xi'an, China, April 2017.
ASPLOS	CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing K. Vora, C. Tian, R. Gupta, and Z. Hu ACM 22nd International Conference on Architectural Support for Programming Languages and Operating Systems, 13 pages, Xi'an, China, April 2017.
TACO	Synergistic Analysis of Evolving Graphs K. Vora, R. Gupta, and G. Xu ACM Transactions on Architecture and Code Optimization, Volume 13, Issue 4, Article No. 32, 27 pages, October 2016.
USENIX ATC	Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing K. Vora, G. Xu, and R. Gupta USENIX Annual Technical Conference, pages 507-522, Denver, Colorado, June 2016.
HPDC	Efficient Processing of Large Graphs via Input Reduction A. Kusum, K. Vora, R. Gupta, and I. Neamtiu 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, pages 245-257, Kyoto, Japan, May-June 2016.
HPDC	Parallel Execution Profiles Z. Benavides, R. Gupta, and X. Zhang 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, pages 215-218, Kyoto, Japan, May-June 2016.
ICS	CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs M. Belviranli, F. Khorasani, L.N. Bhuyan, and R. Gupta ACM 30th International Conference on Supercomputing, 12 pages, Istanbul, Turkey, June 2016.
IPDPS	Eliminating Intra-warp Load Imbalance in Irregular Nested Patterns via Collaborative Task Engagement F. Khorasani, B. Rowe, R. Gupta, and L.N. Bhuyan IEEE International Parallel and Distributed Processing Symposium, pages 524-533, May 2016.
TACO	Tumbler: An Effective Load Balancing Technique for MultiCPU Multicore Systems K.K. Pusukuri, R. Gupta, and L.N. Bhuyan ACM Transactions on Architecture and Code Optimization, Volume 12, Issue 4, Article No. 36, 24 pages, January 2016.
MICRO	Efficient Warp Execution in Presence of Divergence with Collaborative Context Collection F. Khorasani, R. Gupta, and L.N. Bhuyan The 48th Annual IEEE/ACM International Symposium on Microarchitecture, pages 204-215, Waikiki, Hawaii, December 2015.
PACT	Scalable SIMD-Efficient Graph Processing on GPUs F. Khorasani, R. Gupta, and L.N. Bhuyan International Conference on Parallel Architectures and Compilation Techniques, pages 39-50, San Francisco, California, October 2015. WS & VR download: https://github.com/farkhor/WS-VR/
PACT	Stadium Hashing: Scalable and Flexible Hashing on GPUs F. Khorasani, M. Belviranli, R. Gupta, and L.N. Bhuyan International Conference on Parallel Architectures and Compilation Techniques, pages 63-74, San Francisco, California, October 2015.
ICS	PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM Synchronization M. Belviranli, P. Deng, L.N. Bhuyan, R. Gupta, and Q. Zhu ACM 29th International Conference on Supercomputing, pages 25-35, Newport Beach, June 2015.
SC	Fence Scoping C. Lin, V. Nagarajan, and R. Gupta ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pages 105-116, New Orleans, Louisiana, November 2014.
OOPSLA	ASPIRE: Exploiting Asynchronous Parallelism in Iterative Algorithms using a Relaxed Consistency based DSM K. Vora, S-C. Koduru, and R. Gupta ACM SIGPLAN International Conference on Object Oriented Programming Systems, Languages and Applications, pages 861-878, Portland, Oregon, October 2014.
PACT	Shuffling: A Framework for Lock Contention Aware Thread Scheduling for Multicore Multiprocessor Systems K.K. Pusukuri, R. Gupta, L.N. Bhuyan International Conference on Parallel Architectres and Compilation Techniques, pages 289-300, Edmonton, Alberta, Canada, August 2014.
HPDC	CuSha: Vertex-Centric Graph Processing on GPUs F. Khorasani, K. Vora, R. Gupta, and L.N. Bhuyan 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing, pages 239-251, Vancouver, Canada, June 2014. download: http://farkhor.github.io/CuSha/
ICS	Address-aware Fences C. Lin, V. Nagarajan, and R. Gupta ACM 27th International Conference on Supercomputing, pages 313-324, June 2013.
TACO HiPEAC	ADAPT: A Framework for Coscheduling Multithreaded Programs K.K. Pusukuri, R. Gupta, and L.N. Bhuyan ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC, Volume 9, Issue 4, Article No. 45, 25 pages, January 2013.
TACO HiPEAC	A Dynamic Self Scheduling Scheme for Heterogeneous Multiprocessor Architectures M.E. Belviranli, L.N. Bhuyan, and R. Gupta ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC, Volume 9, Issue 4, Article No. 57, 20 pages, January 2013.
PLDI	Effective Parallelization of Loops in the Presence of I/O Operations M. Feng, R. Gupta, and I. Neamtiu ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 487-498, Beijing, China, June 2012.
ASPLOS	Efficient Sequential Consistency via Conflict Ordering C. Lin, V. Nagarajan, R. Gupta, and B. Rajaram ACM 17th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 273-286, London, UK, March 2012.
TACO HiPEAC	PLDS: Partitioning Linked Data Structures for Parallelism M. Feng, C. Lin, and R. Gupta ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC, Volume 8, Issue 4, Article No. 38, 21 pages, January 2012.
TACO HiPEAC	Thread Tranquilizer: Dynamically Reducing Performance Variation K.K. Pusukuri, R. Gupta, L.N. Bhuyan ACM Transactions on Architecture and Code Optimization, special issue of papers presented at HiPEAC, Volume 8, Issue 4, Article No. 46, 21 pages, January 2012.
PACT	No More Backstabbing... A Faithful Scheduling Policy for Multithreaded Programs K.K. Pusukuri, R. Gupta, L.N. Bhuyan The 20th International Conference on Parallel Architectres and Compilation Techniques, pages 12-21, Galveston Island, Texas, October 2011.
PPoPP	SpiceC: Scalable Parallelism via implicit copying and explicit Commit M. Feng, R. Gupta, and Y. Hu 16th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming, pages 69-80, San Antonio, Texas, February 2011.
PPoPP	Enhanced Speculative Parallelization Via Incremental Recovery C. Tian, C. Lin, M. Feng, and R. Gupta 16th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming, pages 189-200, San Antonio, Texas, February 2011.
PACT	Recipient of a PACT 2010 Best Paper Award Efficient Sequential Consistency Using Conditional Fences C. Lin, V. Nagarajan, and R. Gupta The 19th International Conference on Parallel Architectures and Compilation Techniques, pages 295-306, Vienna, Austria, September 2010.
PLDI	Supporting Speculative Parallelization in the Presence of Dynamic Data Structures C. Tian, M. Feng, and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 62-73, Toronto, Canada, June 2010.
ISMM	Speculative Parallelization Using State Separation and Multiple Value Prediction C. Tian, M. Feng, and R. Gupta Ninth International Symposium on Memory Management, pages 63-72, Toronto, Canada, June 2010.
ISCA	ECMon: Exposing Cache Events for Monitoring V. Nagarajan and R. Gupta ACM/IEEE 36th International Symposium on Computer Architecture, Austin, Texas, June 2009.
SIGOPS	Runtime Monitoring on Multicores via OASES V. Nagarajan and R. Gupta ACM SIGOPS Operating Systems Review, special issue on the interaction among the OS, Compilers, and Multicore Processors, pages 15-24, Vol. 43, No. 2, April 2009 (Invited Paper).
VEE	Architectural Support for Shadow Memory in Multiprocessors V. Nagarajan and R. Gupta ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, pages 1-10, Washington DC, March 2009.
MICRO	Copy Or Discard Execution Model For Speculative Parallelization On Multicores C. Tian, M. Feng, V. Nagarajan, and R. Gupta IEEE/ACM 41th International Symposium on Microarchitecture, pages 330-341, Lake Como, Italy, Nov. 2008.
HPCA	SENSS: Security Enhancement to Symmeteric Shared Memory Multiprocessors Y. Zhang, L. Gao, J. Yang, X. Zhang, and R. Gupta IEEE 11th International Symposium on High Performance Computer Architecture, pages 352-362, San Francisco, California, February 2005.
HPCA	Distributed Path Reservation Algorithms for Multiplexed All-Optical Interconnection Networks X. Yuan, R. Melhem, and R. Gupta IEEE 3rd International Symposium on High-Performance Computer Architecture, pages 38-47, San Antonio, Texas, February 1997.
SC	Compiled Communication for All-Optical TDM Networks X. Yuan, R. Melhem, and R. Gupta Supercomputing'96, Article No. 25, 15 pages, Pittsburgh, Pennsylvania, November 1996.
MICRO	A Shape Matching Approach for Scheduling Fine-Grained Parallelism B. Malloy, R. Gupta, and M.L. Soffa IEEE/ACM 25th International Symposium on Microarchitecture, pages 264-267, Portland, Oregon, December 1992.
JPDC	SPMD Execution of Programs with Pointer-based Data Structures on Distributed-Memory Machines R. Gupta Journal of Parallel and Distributed Computing, special issue on Multicomputer Programming and Application, Vol. 16, No. 2, pages 92-107, October 1992.
ICCL	SPMD Execution of Programs with Dynamic Data Structures on Distributed Memory Machines R. Gupta IEEE 4th International Conference on Computer Languages, pages 232-241, Oakland, California, April 1992.
SC	Techniques for Integrating Parallelizing Transformations and Compiler Based Scheduling Methods, T. Watts, M.L. Soffa, and R. Gupta Supercomputing'92, pages 830-839, Minneapolis, Minnesota, November 1992.
MICRO	Executing Loops on a Fine-Grained MIMD Architecture S. Lee and R. Gupta IEEE/ACM 24th International Symposium on Microarchitecture, pages 199-205, Albuquerque, New Mexico, November 1991.
MICRO	A Fine-grained MIMD Architecture based upon Register Channels R. Gupta IEEE/ACM 23rd Workshop on Microprogramming and Microarchitecture, pages 28-37, Orlando, Florida, December 1990.
SC	The Design of a RISC based Multiprocessor Chip R. Gupta, M. Epstein, and M. Whelan Supercomputing'90, pages 920-929, New York, November 1990.
SC	Loop Displacement: An Approach for Transforming and Scheduling Loops for Parallel Execution R. Gupta Supercomputing'90, pages 388-397, New York, November 1990.
PPoPP	Employing Register Channels for the Exploitation of Instruction Level Parallelism R. Gupta ACM SIGPLAN 2nd Symposium on Principles and Practice of Parallel Programming, pages 118-127, Seattle, Washington, March 1990.
ASPLOS	The Fuzzy Barrier: A Mechanism for High-Speed Synchronization of Processors R. Gupta ACM 3rd International Conference on Architectural Support for Programming Languages and Operating Systems, pages 54-64, Boston, April 1989.

Software Analysis, Debugging, & Testing

ICS	DSGEN: Concolic Testing GPU Implementations of Concurrent Dynamic Data Structures X. Sun and R. Gupta ACM 35th International Conference on Supercomputing, pages 75-87, June 2021.
PACMPL OOPSLA	DProf: Distributed Profiler with Strong Guarantees Z. Benavides, K. Vora, and R. Gupta Proceedings of the ACM on Programming Languages, Volume 3, Issue OOPSLA, Article 156, 24 pages, Athens, Greece, October 2019.
ICSE	Dynamic Slicing for Android T. Azim, A. Alavi, I. Neamtiu, and R. Gupta 41st ACM/IEEE International Conference on Software Engineering, pages 1154-1164, Montreal, Canada, May 2019. download: https://github.com/archer29m/AndroidSlicer
CGO	White-Box Program Tuning W-C. Lee, Y. Liu, P. Liu, S. Ma, H.J. Choi, X. Zhang, and R. Gupta IEEE/ACM International Symposium on Code Generation and Optimization, 12 pages, Washington DC, Feb. 2019.
IPDPS	COMPI: Concolic Testing for MPI Applications H. Li, S. Li, Z. Benavides, Z. Chen, and R. Gupta 32nd IEEE International Parallel and Distributed Processing Symposium, pages 865-874, Vancouver, British Columbia, May 2018.
SC	ParaStack: Efficient Hang Detection for MPI Programs at Large Scale H. Li, Z. Chen, and R. Gupta ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Article No. 63, 12 pages, Denver, Colorado, November 2017.
OOPSLA	RAIVE: Runtime Assessment of Floating-Point Instability by Vectorization W-C. Lee, T. Bao, Y. Zheng, X. Zhang, K. Vora, and R. Gupta ACM SIGPLAN International Conference on Object Oriented Programming Systems, Languages and Applications, pages 623-638, Pittsburgh, Pennsylvania, October 2015.
CGO	DrDebug: Deterministic Replay based Cyclic Debugging with Dynamic Slicing Y. Wang, H. Patil, C. Pereira, G. Lueck, R. Gupta, and I. Neamtiu IEEE/ACM International Symposium on Code Generation and Optimization, pages 98-108, Orlando, Florida, February 2014. DrDebug download: Replay & Slicing
CGO	Lightweight Fault Detection in Parallelized Programs L. Tan, M. Feng, and R. Gupta IEEE/ACM International Symposium on Code Generation and Optimization, pages 1-11, Shenzhen, China, February 2013.
TOPLAS	Execution Suppression: An Automated Iterative Technique for locating Memory Errors D. Jeffrey, V. Nagarajan, and R. Gupta ACM Transactions on Programming Languages and Systems, Vol. 32, No. 5, 36 pages, May 2010.
PASTE	Learning Universal Probabilistic Models for Fault Localization M. Feng and R. Gupta Ninth ACM SIGPLAN-SIGSOFT Workshop on Program Analysis for Software Tools and Engineering, pages 81-88, Toronto, Canada, June 2010.
ICSM	Detecting Virus Mutations Via Dynamic Matching M. Feng and R. Gupta 25th IEEE International Conference on Software Maintenance, pages 105-114, Edmonton, Canada, September 2009.
ICSM	Effective and Efficient Localization of Multiple Faults Using Value Replacement D. Jeffrey, N. Gupta, and R. Gupta 25th IEEE International Conference on Software Maintenance, pages 221-230, Edmonton, Canada, September 2009.
ICSM	Identifying the Root Causes of Memory Bugs Using Corrupted Memory Location Suppression D. Jeffrey, N. Gupta, and R. Gupta International Conference on Software Maintenance, pages 356-365, Beijing, China, September 2008.
ICSM	Dynamic Slicing of Multithreaded Programs for Race Detection S. Tallam, C. Tian, and R. Gupta International Conference on Software Maintenance, pages 97-106, Beijing, China, September 2008.
ISSTA	Fault Localization Using Value Replacement D. Jeffrey, N. Gupta, and R. Gupta International Symposium on Software Testing and Analysis, pages 167-178, Seattle, July 2008.
ISSTA	Dynamic Recognition of Synchronization Operations for Improved Data Race Detection C. Tian, V. Nagarajan, R. Gupta, and S. Tallam International Symposium on Software Testing and Analysis, pages 143-154, Seattle, July 2008.
ICSM	ONTRAC: A System for Efficient ONline TRACing for Debugging V. Nagarajan, D. Jeffrey, R. Gupta, and N. Gupta International Conference on Software Maintenance, pages 445-454, Paris, September 2007.
ICSM	Matching Control Flow of Program Versions V. Nagarajan, R. Gupta, X. Zhang, M. Madou, B. De Sutter, and K. De Bosschere International Conference on Software Maintenance, pages 84-93, Paris, September 2007.
ACM TACO	Unified Control Flow and Dependence Traces S. Tallam and R. Gupta ACM Transactions on Architecture and Code Optimization, Vol. 4, No. 3, 31 pages, September 2007.
ISSTA	Enabling Tracing of Long-Running Multithreaded Programs via Dynamic Execution Reduction S. Tallam, C. Tian, X. Zhang, and R. Gupta International Symposium on Software Testing and Analysis, pages 207-218, London, July 2007.
PLDI	Towards Locating Execution Omission Errors X. Zhang, S. Tallam, N. Gupta, and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 415-424, San Diego, June 2007.
FSE	Dynamic Slicing Long Running Programs through Execution Fast Forwarding X. Zhang, S. Tallam, and R. Gupta 14th ACM SIGSOFT Symposium on Foundations of Software Engineering, pages 81-91, Portland, Oregon, November 2006.
PLDI	Pruning Dynamic Slices With Confidence X. Zhang, N. Gupta, and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 169-180, Ottawa, Canada, June 2006.
ICSE	Locating Faults Through Automated Predicate Switching X. Zhang, N. Gupta, and R. Gupta IEEE/ACM International Conference on Software Engineering, pages 272-281, Shanghai, China, May 2006.
ASE	Locating Faulty Code Using Failure-Inducing Chops N. Gupta, H. He, X. Zhang, and R. Gupta IEEE/ACM International Conference on Automated Software Engineering, pages 263-272, Long Beach, California, Nov. 2005.
ESEC -FSE	Matching Execution Histories of Program Versions X. Zhang and R. Gupta Joint 10th European Software Engineering Conference and 13th ACM SIGSOFT Symposium on the Foundations of Software Engineering, pages 197-206, Lisbon, Portugal, September 2005.
ACM TACO	Whole Execution Traces and their Applications X. Zhang and R. Gupta ACM Transactions on Architecture and Code Optimization, Vol. 2, No. 3, pages 301-334, September 2005.
PACT	Extended Whole Program Paths S. Tallam, R. Gupta, and X. Zhang International Conference on Parallel Architectures and Compilation Techniques, pages 17-26, St. Loius, Missouri, September 2005.
ACM TOPLAS	Cost and Precision Tradeoffs of Dynamic Data Slicing Algorithms X. Zhang, R. Gupta, and Y. Zhang ACM Transactions on Programming Languages and Systems, Vol. 27, No. 4, pages 631-661, July 2005.
MICRO	Whole Execution Traces X. Zhang and R. Gupta IEEE/ACM 37th International Symposium on Microarchitecture, pages 105-116, Portland, Oregan, December 2004.
PLDI	Cost Effective Dynamic Program Slicing X. Zhang and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 94-106, Washington D.C., June 2004.
ICSE	Effective Forward Computation of Dynamic Slices Using Reduced Ordered Binary Decision Diagrams X. Zhang, R. Gupta, and Y. Zhang IEEE/ACM International Conference on Software Engineering, pages 502-511, Edinburgh, UK, May 2004.
CGO	Extending Path Profiling across Loop Backedges and Procedure Boundaries S. Tallam, X. Zhang, and R. Gupta Second Annual IEEE/ACM International Symposium on Code Generation and Optimization, pages 251-262, San Jose, CA, March 2004.
ICSE	Recipient of ICSE 2003 Distinguished Paper Award. Precise Dynamic Slicing Algorithms X. Zhang, R. Gupta, and Youtao Zhang IEEE/ACM International Conference on Software Engineering, pages 319-329, Portland, Oregon, May 2003.
CGO	Hiding Program Slices for Software Security X. Zhang and R. Gupta First Annual IEEE/ACM International Symposium on Code Generation and Optimization, pages 325-336, San Francisco, CA, March 2003.
PLDI	Timestamped Whole Program Path Representation and its Applications Y. Zhang and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 180-190, Snowbird, Utah, June 2001.
ESEC -FSE	Comparison Checking: An Approach to Avoid Debugging of Optimized Code C. Jaramillo, R. Gupta, and M.L. Soffa Joint 7th European Software Engineering Conference and 7th ACM SIGSOFT Symposium on the Foundations of Software Engineering, LNCS 1687, Springer Verlag, pages 268-284, Toulouse, France, Sept. 1999.
ACM TOPLAS	A Practical Framework for Demand-Driven Interprocedural Data Flow Analysis E. Duesterwald, R. Gupta, and M.L. Soffa ACM Transactions on Programming Languages and Systems, Vol. 19, No. 6, pages 992-1030, November 1997.
ACM TOSEM	Hybrid Slicing: Integrating Dynamic Information with Static Analysis R. Gupta, M.L. Soffa, and J. Howard ACM Transactions on Software Engineering and Methodology, Vol. 6, No. 4, pages 370-397, October 1997.
ESEC -FSE	Refining Data Flow Information using Infeasible Paths R. Bodik, R. Gupta, and M.L. Soffa Joint 6th European Software Engineering Conference and 5th ACM SIGSOFT Symposium on the Foundations of Software Engineering, LNCS 1301, Springer Verlag, pages 361-377, Zurich, Switzerland, September 1997.
ICSE	A Demand-Driven Analyzer for Data Flow Testing at the Integration Level E. Duesterwald, R. Gupta, and M.L. Soffa IEEE/ACM International Conference on Software Engineering, pages 575-586, Berlin, Germany, March 1996.
FSE	Hybrid Slicing: An Approach for Refining Static Slices using Dynamic Information R. Gupta and M.L. Soffa ACM SIGSOFT 3rd Symposium on the Foundations of Software Engineering, pages 29-40, Washington, DC, October 1995.
ICSM	Priority Based Data Flow Testing R. Gupta and M.L. Soffa IEEE-CS International Conference on Software Maintenance, pages 348-357, Nice, France, October 1995.
ICSM	A Framework for Partial Data Flow Analysis R. Gupta and M.L. Soffa IEEE-CS International Conference on Software Maintenance, pages 4-13, Victoria, British Columbia, September 1994.
ACM TOSEM	A Methodology for Controlling the Size of a Test Suite M.J. Harrold, R. Gupta, and M.L. Soffa ACM Transactions on Software Engineering and Methodology, Vol. 2, No. 3, pages 270-285, July 1993.
ICSM	An Approach to Regression Testing using Slicing R. Gupta, M.J. Harrold, and M.L. Soffa IEEE-CS International Conference on Software Maintenance, pages 299-308, Orlando, Florida, November 1992.
POPL	Demand-Driven Computation of Interprocedural Data Flow E. Duesterwald, R. Gupta, and M.L. Soffa ACM SIGPLAN-SIGACT 22nd Symposium on Principles of Programming Languages, pages 37-48, San Francisco, California, January 1995.
POPL	Generalized Dominators and Post-Dominators R. Gupta ACM SIGPLAN-SIGACT 19th Symposium on Principles of Programming Languages, pages 246-257, Albuquerque, New Mexico, January 1992.
ISTAV	Loop Monotonic Computations: An Approach for the Efficient Run-time Detection of Races R. Gupta and M. Spezialetti SIGSOFT Symposium on Testing, Analysis, and Verification, pages 98-111, Victoria, Canada, October 1991.
ICSM	A Methodology for Controlling the Size of a Test Suite M.J. Harrold, R. Gupta, and M.L. Soffa IEEE-CS International Conference on Software Maintenance, pages 302-310, San Diego, CA, November 1990.

High-Performance & Embedded Processors: Compiler & Architectural Support

ACM TACO	Dynamic Access Distance Driven Cache Replacement M. Feng, C. Tian, C. Lin, and R. Gupta ACM Transactions on Architecture and Code Optimization, Vol. 8, No. 3, Article 14, 30 pages, October 2011.
HiPEAC	Compiler-Assisted Memory Encryption for Embedded Processors V. Nagarajan, R. Gupta, and A. Krishnaswamy International Conference on High Performance Embedded Architectures and Compilers, Ghent, Belgium, January 2007.
ACM TECS	Dynamic Coalescing for 16-bit Instructions A. Krishnaswamy and R. Gupta ACM Transactions on Embedded Computing Systems in special issue of selected LCTES'03 papers, Vol. 4, No. 1, pages 3-37, Feb. 2005.
MICRO	Efficient Use of Invisible Registers in Thumb Code A. Krishnaswamy and R. Gupta IEEE/ACM 38th International Symposium on Microarchitecture, pages 30-40, Barcelona, Spain, Nov. 2005.
HiPEAC	Exploiting Computation Reuse Cache to Reduce Energy in Network Processors B. Li, G. Venkatesh, B. Calder, and R. Gupta International Conference on High Performance Embedded Architectures and Compilers, LNCS 3793, Springer Verlag, pages 251-265, Barcelona, Spain, Nov. 2005.
ACM TODAES	Frequent Value Encoding for Low Power Data Buses J. Yang, R. Gupta, and C. Zhang ACM Transactions on Design Automation of Electronic Systems, Vol. 9, No. 3, pages 354-384, July 2004.
20 Years of PLDI	Retrospective -- Complete Removal of Redundant Expressions R. Bodik, R. Gupta and M.L. Soffa 20 Years of the ACM/SIGPLAN Conference on Programming Language Design and Implementation (1979-1999): A Selection, ACM SIGPLAN Notices, Vol. 39, No. 4, pages 596-597, April 2004.
CASES	Simple Offset Assignment in Presence of Subword Data B. Li and R. Gupta International Conference on Compilers, Architecture, and Synthesis of Embedded Systems, pages 12-23, San Jose, CA, October 2003.
POPL	Bitwidth Aware Global Register Allocation S. Tallam and R. Gupta 30th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pages 85-96, New Orleans, LA, January 2003.
ACM TECS	Frequent Value Locality and its Applications J. Yang and R. Gupta ACM Transactions on Embedded Computing Systems (inaugural issue), Vol. 1, No. 1, pages 79-105, November 2002.
MICRO	Energy Efficient Frequent Value Data Cache Design J. Yang and R. Gupta IEEE/ACM 35th International Symposium on Microarchitecture, pages 197-207, Istanbul, Turkey, November 2002.
CASES	Bit Section Instruction Set Extension of ARM for Embedded Applications B. Li and R. Gupta International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, pages 69-78, Grenoble, France, October 2002.
ICS	Load and Store Reuse Using Register File Contents S. Onder and R. Gupta ACM 15th International Conference on Supercomputing, pages 289-302, Sorrento, Naples, Italy, June 2001.
MICRO	Frequent Value Compression in Data Caches J. Yang, Y. Zhang, and R. Gupta IEEE/ACM 33rd International Symposium on Microarchitecture, pages 258-265, Monterey, CA, December 2000.
ASPLOS	Frequent Value Locality and Value-Centric Data Cache Design Y. Zhang, J. Yang, and R. Gupta ACM 9th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 150-159, Cambridge, MA, November 2000.
PLDI	ABCD: Eliminating Array Bounds Checks on Demand R. Bodik, R. Gupta, and V. Sarkar ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 321-333, Vancouver B.C., Canada, June 2000.
MICRO	Dynamic Memory Disambiguation in the Presence of Out-of-order Store Issuing S. Onder and R. Gupta IEEE/ACM 32nd International Symposium on Microarchitecture, pages 170-176, Haifa, Israel, November 1999.
PACT	Caching and Predicting Branch Sequences for Improved Fetch Effectiveness S. Onder, J. Xu, and R. Gupta International Conference on Parallel Architectures and Compilation Techniques, pages 294-302, Newport Beach, California, October 1999.
PLDI	Load-Reuse Analysis: Design and Evaluation R. Bodik, R. Gupta, and M.L. Soffa ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 64-76, Atlanta, Georgia, May 1999.
ISCA	Value Prediction in VLIW Machines T. Nakra, R. Gupta, and M.L. Soffa ACM/IEEE 26th International Symposium on Computer Architecture, pages 258-269, Atlanta, Georgia, May 1999.
HPCA	Global Context-based Value Prediction T. Nakra, R. Gupta, and M.L. Soffa IEEE 5th International Symposium on High Performance Computer Architecture, pages 4-12, Orlando, Florida, January 1999.
PACT	Capturing the Effects of Code Improving Transformations C. Jaramillo, R. Gupta, and M.L. Soffa International Conference on Parallel Architectures and Compilation Techniques, pages 118-123, Paris, France, October 1998.
PACT	Superscalar Execution with Direct Data Forwarding S. Onder and R. Gupta International Conference on Parallel Architectures and Compilation Techniques, pages 130-135, Paris, France, October 1998.
PLDI	Complete Removal of Redundant Expressions R. Bodik, R. Gupta and M.L. Soffa ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 1-14, Montreal, Canada, June 1998.
ICCL	Automatic Generation of Microarchitecture Simulators S. Onder and R. Gupta IEEE International Conference on Computer Languages, pages 80-89, Chicago, Illinois, May 1998.
ICCL	Path Profile Guided Partial Redundancy Elimination Using Speculation R. Gupta, D. Berson, and J.Z. Fang IEEE International Conference on Computer Languages, pages 230-239, Chicago, Illinois, May 1998.
MICRO	Resource-Sensitive Profile-Directed Data Flow Analysis for Code Optimization R. Gupta, D. Berson, and J.Z. Fang IEEE/ACM 30th International Symposium on Microarchitecture, pages 558-568, Research Triangle Park, North Carolina, December 1997.
PACT	Path Profile Guided Partial Dead Code Elimination Using Predication R. Gupta, D. Berson, and J.Z. Fang International Conference on Parallel Architectures and Compilation Techniques, pages 102-115, San Francisco, California, November 1997.
PLDI	Partial Dead Code Elimination using Slicing Transformations R. Bodik and R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 159-170, Las Vegas, Nevada, June 1997.
PLDI	Interprocedural Conditional Branch Elimination R. Bodik, R. Gupta, and M.L. Soffa ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 146-158, Las Vegas, Nevada, June 1997.
PACT	Resource Spackling: A Framework for Integrating Register Allocation in Local and Global Schedulers D. Berson, R. Gupta, and M.L. Soffa International Conference on Parallel Architectures and Compilation Techniques, IFIP Transactions A-50, pages 135-146, Montreal, Canada, August 1994.
ACM LOPLAS	Optimizing Array Bound Checks Using Flow Analysis R. Gupta ACM Letters on Programming Languages and Systems, Vol.2, Nos.1-4, pages 135-150, March-December 1994.
ACM TOPLAS	Efficient Register Allocation Via Coloring Using Clique Separators R. Gupta, M.L. Soffa, and D. Ombres ACM Transactions on Programming Languages and Systems, Vol. 16, No. 3, pages 370-386, May 1994.
PLDI	A Practical Data Flow Framework for Array Reference Analysis and its Application in Optimizations E. Duesterwald, R. Gupta, and M.L. Soffa ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 68-77, Albuquerque, New Mexico, June 1993.
PACT	URSA: A Unified ReSource Allocator for Registers and Functional Units in VLIW Architectures D. Berson, R. Gupta, and M.L. Soffa Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism, IFIP Transactions A-23, pages 243-254, Orlando, Florida, January 1993.
SC	Improving Instruction Cache Performance by Reducing Cache Pollution R. Gupta and Chi-Hung Chi Supercomputing'90, pages 82-91, New York, November 1990.
PLDI	A Fresh Look at Optimizing Array Bound Checks R. Gupta ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 272-282, White Plains, NY, June 1990.
IEEE TSE	Region Scheduling: An Approach for Detecting and Redistributing Parallelism R. Gupta and M.L. Soffa IEEE Transactions on Software Engineering, Vol. 16, No. 4, pages 421-431, April 1990.
PLDI	Register Allocation via Clique Separators R. Gupta, M.L. Soffa, and T.F. Steele ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 264-275, Portland, Oregon, June 1989.
PPEALS	Compile-time Techniques for Efficient Utilization of Parallel Memories R. Gupta and M.L. Soffa ACM SIGPLAN Symposium on Parallel Programming: Experience with Applications, Languages and Systems, pages 235-246, New Haven, July 1988.