Zizhong (Jeffrey) Chen

Department of Computer Science and Engineering
University of California, Riverside
900 University Avenue, Riverside, CA 92521

Office:    422 Winston Chung Hall
Telephone: +1 (951) 827 2403
Fax:       +1 (951) 827 4643
Email:     My_Last_Name AT cs.ucr.edu

Home | Teaching | Publications | Service | Research Group

Announcement: Research opportunities are available for visiting scholars, graduate students, and undergraduate students intereted in high performance computing, parallel and distributed systems, or big data processing. If you are interested in working with me, please feel free to write me emails.

Biographical Sketch

    Dr. Zizhong (Jeffrey) Chen is a faculty member in the Department of Computer Science and Engineering at the University of California, Riverside. He is interested in high performance computing, parallel and distributed systems, big data analytics, cluster and cloud computing, algorithm-based fault tolerance (ABFT), power and energy efficient computing, numerical algorithms and software, and large scale computer simulations. His research has been supported by National Science Foundation, Department of Energy, CMG Reservoir Simulation Foundation, Abu Dhabi National Oil Company, Nvidia, and Microsoft Corporation. He has published over 70 papers with many in highly competitive conferences and journals such as HPDC, PPoPP, SC, ICS, IPDPS, TPDS, TC, JPDC, PARCO, SIMAX, SISC, and IBMRD. He has received a CAREER Award from the U.S. National Science Foundation and a Best Paper Award from the International Supercomputing Conference. Dr. Chen is a Senior Member of the IEEE and a Life Member of the ACM. He currently serves as Subject Area Editor for Elsevier Parallel Computing journal and Associate Editor for IEEE Transactions on Parallel and Distributed Systems.

Recent Activities

  • Technical Program Committee: SC'16, ICS'16, IPDPS'16, SC'15, PACT'15.
  • Finance and Registration Chair: ICS'15.
  • General Chair: IEEE NAS'16, Long Beach, CA, August 8-10, 2016.
  • Subject Area Editor: Parallel Computing.
  • Associate Editor: IEEE Transactions on Parallel and Distributed Systems.

Selected Publications (with my students underlined, Full List >>>)

SC'16 Jieyang Chen*, Li Tan*, Panruo Wu, Dingwen Tao, Hongbo Li, Xin Liang, Sihuan Li, Rong Ge, Laxmi Bhuyan, and Zizhong Chen
GreenLA: Green Linear Algebra Software for GPU-Accelerated Heterogeneous Computing,
Proceedings of the 28th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Salt Lake City, Utah, USA, Nov 13- 18, 2016. Acceptance Rate: 18.4% (82/446). *Authors contributed equally.
HPDC'16 Panruo Wu, Qiang Guan, Nathan DeBardeleben, Sean Blanchard, Dingwen Tao, Xin Liang, Jieyang Chen, and Zizhong Chen
Towards Practical Algorithm Based Fault Tolerance in Dense Linear Algebra,
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing,
Kyoto, JAPAN, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
HPDC'16 Panruo Wu, Dong Li, Zizhong Chen, Jeffrey S. Vetter, Sparsh Mittal
Algorithm-Directed Data Placement in Hybrid Memory,
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing,
Kyoto, JAPAN, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
HPDC'16 Dingwen Tao, Shuaiwen Leon Song, Sriram Krishnamoorthy, Panruo Wu, Xin Liang, Zheng Eddy Zhang, Darren Kerbyson, and Zizhong Chen
New-Sum: A Novel Online ABFT Scheme For General Iterative Methods,
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing,
Kyoto, JAPAN, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
IPDPS'16 Jieyang Chen, Xin Liang, and Zizhong Chen
Online Algorithm-Based Fault Tolerance for Cholesky Decomposition on Heterogeneous Systems with GPUs,
Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium,
Chicago, Illinois, USA, May 23-27, 2016. Acceptance Rate: 22.98% (114/496).
TACO'16 Li Tan, Zizhong Chen, and Suaiwen Leon Song
Scalable Energy Efficiency with Resilience for High Performance Computing Systems: A Quantitative Methodology,
ACM Transactions on Architecture and Code Optimization,
Volume 12 Issue 4, January 2016
IPDPS'15 Li Tan, Shuaiwen Song, Panruo Wu, Zizhong Chen, Rong Ge, and Darren Kerbyson
Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing,
Proceedings of the 29th IEEE International Parallel & Distributed Processing Symposium,
Hyderabad, India, May 25-29, 2015. Acceptance Rate: 21.77% (108/496).
TPDS'15 Doug Hakkarinen, Panruo Wu, and Zizhong Chen
Fail-Stop Failure Algorithm-Based Fault Tolerance for Cholesky Decomposition,
IEEE Transactions on Parallel and Distributed Systems,
Volume: 26, Issue: 5, Page 1323-1335,May, 2015.
HPDC'14 Panruo Wu and Zizhong Chen
FT-ScaLAPACK: Correcting Soft Errors On-Line for ScaLAPACK Cholesky, QR, and LU Factorization Routines,
Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing,
Vancouver, Canada, June 23-27, 2014. Acceptance Rate: 16.2% (21/130).
PARCO'14 Li Tan, Shashank Kothapalli, Longxiang Chen, Omar Hussaini, Ryan Bissiri, and Zizhong Chen
A Survey of Power and Energy Efficient Techniques for High Performance Numerical Linear Algebra Operations,
Parallel Computing,
Vol. 40, No. 10, pp. 559-573, Dec. 2014.
PPoPP'13 Zizhong Chen
Online-ABFT: An Online Algorithm Based Fault Tolerance Scheme for Soft Error Detection in Iterative Methods,
Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Shenzhen, China, February 23-27, 2013. Acceptance Rate: 17.8% (26/146).
HPDC'13 Teresa Davies and Zizhong Chen
Correcting Soft Errors Online in LU Factorization,
Proceedings of the 22nd ACM International Symposium on High-Performance Parallel and Distributed Computing,
New York City, NY, USA. June 17-21, 2013. Acceptance Rate: 15.3% (20/131).
SC'13      Dong Li, Zizhong Chen, Panruo Wu, and Jeffrey Vetter
Rethinking Algorithm-Based Fault Tolerance with a Cooperative Software-Hardware Approach,
Proceedings of the 25th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Denver, CO, November 17-22, 2013. Acceptance Rate: 19.7% (90/457).
TC'13 Doug Hakkarinen and Zizhong Chen
Multi-Level Diskless Checkpointing,
IEEE Transactions on Computers,
Vol. 62, No. 4, Page 772-783, April, 2013.
HPDC'11 Zizhong Chen
Algorithm-Based Recovery for Iterative Methods without Checkpointing,
Proceedings of the 20th ACM International Symposium on High-Performance Parallel and Distributed Computing,
San Jose, California, June 8-11, 2011. Acceptance Rate: 12.9% (22/170).
ICS'11 Teresa Davies, Christer Karlsson, Hui Liu, Chong Ding, and Zizhong Chen
High Performance Linpack Benchmark: A Fault Tolerant Implementation without Checkpointing,
Proceedings of the 25th ACM International Conference on Supercomputing,
Tucson, Arizona, May 31 - June 4, 2011. Acceptance Rate: 21.7% (35/161).
IPDPS'10 Doug Hakkarinen and Zizhong Chen
Algorithmic Cholesky Factorization Fault Recovery,
Proceedings of the 24th IEEE International Parallel & Distributed Processing Symposium,
Atlanta, GA, USA, April 19-23, 2010. Acceptance Rate: 24.1% (127/527).
SC'09 Zizhong Chen
Optimal Real Number Codes for Fault Tolerant Matrix Operations,
Proceedings of the 21st ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Portland, OR, November 14-20, 2009. Acceptance Rate: 22.6% (59/261).
TC'09 Zizhong Chen and Jack Dongarra
Highly Scalable Self-Healing Algorithms for High Performance Scientific Computing,
IEEE Transactions on Computers,
Vol. 58, No. 11, November, 2009.
JPDC'09 Qishi Wu, Jinzhu Gao, Zizhong Chen, and Mengxia Zhu
Pipelining Parallel Image Compositing and Delivery for Efficient Remote Visualization,
Journal of Parallel and Distributed Computing,
Vol. 69, No. 3, March, 2009.
TPDS'08 Zizhong Chen and Jack Dongarra
Algorithm-Based Fault Tolerance for Fail-Stop Failures,
IEEE Transactions on Parallel and Distributed Systems,
Vol. 19, No. 12, December, 2008.
SISC'07 Julien Langou, Zizhong Chen, George Bosilca, and Jack Dongarra
Recovery Patterns for Iterative Methods in a Parallel Unstable Environment,
SIAM Journal on Scientific Computing,
Vol. 30, No. 1, pp. 102-116, November, 2007.
IBMRD'06 Jack Dongarra, George Bosilca, Zizhong Chen, Victor Eijkhout, Graham Fagg, Erika Fuentes, Julien Langou, Piotr Luszczek, Jelena Pjesivac-Grbovic, Keith Seymour, Haihang You, and Satish S. Vadiyar
Self Adapting Numerical Software (SANS) Effort,
IBM Journal of Research and Development,
Volume 50, Number 2/3, Page 223-238, 2006.
IPDPS'06 Zizhong Chen and Jack Dongarra.
Algorithm-Based Checkpoint-Free Fault Tolerance for Parallel Matrix Multiplications on Volatile Resources,
Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium,
Rhodes Island, Greece, April 25-29, 2006.
SIMAX'05 Zizhong Chen and Jack Dongarra.
Condition Numbers of Gaussian Random Matrices,
SIAM Journal on Matrix Analysis and Applications,
Volume 27, Number 3, Page 603-620, 2005.
PPoPP'05 Zizhong Chen, Graham E. Fagg, Edgar Gabriel, Julien Langou, Thara Angskun, George Bosilca, and Jack Dongarra.
Fault Tolerant High Performance Computing by a Coding Approach,
Proceedings of the 10th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Chicago, Illinois, USA, June 15-17, 2005.
PARCO'03 Zizhong Chen, Jack Dongarra, Piotr Luszczek, and Kenneth Roche.
Self Adapting Software for Numerical Linear Algebra and LAPACK for Clusters,
Parallel Computing,
Volume 29, Number 11-12, Page 1723-1743, November-December, 2003.

Last update: Sept 28, 2015.