Publications in CSRankings.org Venues

High-Performance Computing; Operating Systems; Real-Time Systems; Programming Languages and Compilers; Computer Architecture; Software Engineering

USENIX ATC (1), EuroSys (2), HPDC (3), ICS (5), SC (7), RTSS (2), OOPSLA (3), POPL (3), PLDI (14), ASPLOS (6), ISCA (2), HPCA (3), MICRO (13), ICSE (5), ASE (1), ESEC-FSE/FSE (5), ISSTA (3).

78. USENIX ATC Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing (2016)
77. EuroSys   Tripoline: Generalized Incremental Graph Processing via Graph Triangle Inequality (2021)
76. EuroSys Subway: Minimizing Data Transfer during Out-of-GPU-Memory Graph Processing (2020)
75. HPDC Efficient Processing of Large Graphs via Input Reduction (2016)
74. HPDC Parallel Execution Profiles (2016)
73. HPDC CuSha: Vertex-Centric Graph Processing on GPUs (2014)
72. ICS   DSGEN: Concolic Testing GPU Implementations of Concurrent Dynamic Data Structures (2021)
71. ICS CuMAS: Data Transfer Aware Multi-Application Scheduling for Shared GPUs (2016)
70. ICS PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM Synchronization (2015)
69. ICS Address-aware Fences (2013)
68. ICS Load and Store Reuse Using Register File Contents (2001)
67. SC ParaStack: Efficient Hang Detection for MPI Programs at Large Scale (2017)
66. SC Fence Scoping (2014)
65. SC Compiled Communication for All-Optical TDM Networks (1996)
64. SC Techniques for Integrating Parallelizing Transformations and Compiler Based Scheduling Methods (1992)
63. SC Loop Displacement: An Approach for Transforming and Scheduling Loops for Parallel Execution (1990)
62. SC Improving Instruction Cache Performance by Reducing Cache Pollution (1990)
61. SC The Design of a RISC based Multiprocessor Chip (1990)
60. RTSS Busy-Idle Profiles and Compact Task Graphs: Compile-time Support for ... Scheduling of Real-Time Tasks (1994)
59. RTSS Applying Compiler Techniques to Scheduling in Real Time Systems (1990)
58. OOPSLA DProf: Distributed Profiler with Strong Guarantees (2019)
57. OOPSLA RAIVE: Runtime Assessment of Floating-Point Instability by Vectorization (2015)
56. OOPSLA ASPIRE: Exploiting Asynchronous Parallelism in Iterative Algorithms using a Relaxed Consistency based DSM (2014)
55. POPL Bitwidth Aware Global Register Allocation (2003)
54. POPL Demand-Driven Computation of Interprocedural Data Flow (1995)
53. POPL Generalized Dominators and Post-Dominators (1992)
52. PLDI Effective Parallelization of Loops in the Presence of I/O Operations (2012)
51. PLDI Supporting Speculative Parallelization in the Presence of Dynamic Data Structures (2010)
50. PLDI Towards Locating Execution Omission Errors (2007)
49. PLDI Pruning Dynamic Slices With Confidence (2006)
48. PLDI Cost Effective Dynamic Program Slicing (2004)
47. PLDI Timestamped Whole Program Path Representation and its Applications (2001)
46. PLDI ABCD: Eliminating Array Bounds Checks on Demand (2000)
45. PLDI Load-Reuse Analysis: Design and Evaluation (1999)
44. PLDI Complete Removal of Redundant Expressions (1998)
43. PLDI Partial Dead Code Elimination using Slicing Transformations (1997)
42. PLDI Interprocedural Conditional Branch Elimination (1997)
41. PLDI A Practical Data Flow Framework for Array Reference Analysis and its Application in Optimizations (1993)
40. PLDI A Fresh Look at Optimizing Array Bound Checks (1990)
39. PLDI Register Allocation via Clique Separators (1989)

38. ASPLOS     PnP: Pruning and Prediction for Point-To-Point Iterative Graph Analytics (2019)
37. ASPLOS KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations (2017)
36. ASPLOS CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing (2017)
35. ASPLOS Efficient Sequential Consistency via Conflict Ordering (2012)
34. ASPLOS Frequent Value Locality and Value-Centric Data Cache Design (2000)
33. ASPLOS The Fuzzy Barrier: A Mechanism for High-Speed Synchronization of Processors (1989)
32. ISCA ECMon: Exposing Cache Events for Monitoring (2009)
31. ISCA Value Prediction in VLIW Machines (1999)
30. MICRO JetStream: Graph Analytics on Streaming Data with Event-Driven Hardware Accelerator (2021)
29. MICRO GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing (2020)
28. MICRO Efficient Warp Execution in Presence of Divergence with Collaborative Context Collection (2015)
27. MICRO Copy Or Discard Execution Model For Speculative Parallelization On Multicores (2008)
26. MICRO Efficient Use of Invisible Registers in Thumb Code (2005)
25. MICRO Whole Execution Traces (2004)
24. MICRO Energy Efficient Frequent Value Data Cache Design (2002)
23. MICRO Frequent Value Compression in Data Caches (2000)
22. MICRO Dynamic Memory Disambiguation in the Presence of Out-of-order Store Issuing (1999)
21. MICRO Resource-Sensitive Profile-Directed Data Flow Analysis for Code Optimization (1997)
20. MICRO A Shape Matching Approach for Scheduling Fine-Grained Parallelism (192)
19. MICRO Executing Loops on a Fine-Grained MIMD Architecture (1991)
18. MICRO A Fine-grained MIMD Architecture based upon Register Channels (1990)
17. HPCA SENSS: Security Enhancement to Symmeteric Shared Memory Multiprocessors (2005)
16. HPCA Global Context-based Value Prediction (1999)
15. HPCA Distributed Path Reservation Algorithms for Multiplexed All-Optical Interconnection Networks (1997)
14. ICSE Dynamic Slicing for Android (2019)
13. ICSE Locating Faults Through Automated Predicate Switching (2006)
12. ICSE Effective Forward Computation of Dynamic Slices Using Reduced Ordered Binary Decision Diagrams (2004)
11. ICSE Precise Dynamic Slicing Algorithms (2003), Recipient of ICSE 2003 Distinguished Paper Award
10. ICSE A Demand-Driven Analyzer for Data Flow Testing at the Integration Level (1996)
9. ASE Locating Faulty Code Using Failure-Inducing Chops (2005)
8. FSE Dynamic Slicing Long Running Programs through Execution Fast Forwarding (2006)
7. ESEC-FSE Matching Execution Histories of Program Versions (2005)
6. ESEC-FSE Comparison Checking: An Approach to Avoid Debugging of Optimized Code (1999)
5. ESEC-FSE Refining Data Flow Information using Infeasible Paths (1997)
4. FSE Hybrid Slicing: An Approach for Refining Static Slices using Dynamic Information (1995)
3. ISSTA Fault Localization Using Value Replacement (2008)
2. ISSTA Dynamic Recognition of Synchronization Operations for Improved Data Race Detection (2008)
1. ISSTA Enabling Tracing of Long-Running Multithreaded Programs via Dynamic Execution Reduction (2007)