Long Zhang (张龙)

Senior SRE at Electrolux (Sweden)
Ph.D. in Software Reliability and Chaos Engineering

Long Zhang (Gluck Zhang) is now a senior SRE at Electrolux working on its IoT system's observability and reliability. Long holds a Ph.D. degree in software reliability from KTH Royal Institute of Technology, Sweden. His research work focuses on self-healing software, chaos engineering, and antifragile systems. During Long's Ph.D. study, he was supervised by Professors Martin Monperrus and Benoit Baudry, and funded by Wallenberg AI, Autonomous Systems and Software Program (WASP). Long received his BE degree and ME degree in software engineering from Harbin Institute of Technology, China. Before his Ph.D. study, Long was hired by Tencent as a software developer and project manager, who was responsible for university-enterprise cooperation projects design and development.

Interests

Publications

[TDSC] Highly Available Blockchain Nodes With N-Version Design (Javier Ron, César Soto-Valero, Long Zhang, Benoit Baudry, Martin Monperrus), Research paper, TDSC, 2023.

[DiVA] Application-level Chaos Engineering (Long Zhang), Ph.D. thesis, DiVA, 2022.

[DLT] Chaos Engineering of Ethereum Blockchain Clients (Long Zhang, Javier Ron, Benoit Baudry and Martin Monperrus), Research paper, ACM DLT, 2023.

[TDSC] Maximizing Error Injection Realism for Chaos Engineering with System Calls (Long Zhang, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, TDSC, 2021.

[TR] Production Monitoring to Improve Test Suites (Deepika Tiwari, Long Zhang, Martin Monperrus, Benoit Baudry), Research paper, TR, 2021.

[arXiv] Automatic Observability for Dockerized Java Applications (Long Zhang, Deepika Tiwari, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, 1912.06914, arXiv, 2019.

[ISSRE 2019] TripleAgent: Monitoring, Perturbation And Failure-obliviousness for Automated Resilience Improvement in Java Applications (Long Zhang and Martin Monperrus), Research paper, ISSRE 2019.

[FGCS] Observability and Chaos Engineering on System Calls for Containerized Applications in Docker (Jesper Simonsson, Long Zhang, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, Future Generation Computer Systems, 2021.

[TSE] A Chaos Engineering System for Live Analysis and Falsification of Exception-handling in the JVM (Long Zhang, Brice Morin, Philipp Haller, Benoit Baudry and Martin Monperrus), Research paper, TSE 2019.

Co-supervision

[DiVA] Cloud native chaos engineering for IoT systems (Björnberg, Adam), Master thesis, DiVA, 2021.

[DiVA] Chaos Engineering for Containerized Applications with Multi-Version Deployments (Knapen, Adriaan), Master thesis, DiVA, 2021.

[DiVA] Self-healing Middleware Support for Django Web Applications (Tu, Yi-Pei), Master thesis, DiVA, 2020.

[DiVA] Distributed Trace Comparisons for Code Review: A System Design and Practical Evaluation (Rabo, Hannes), Master thesis, DiVA, 2020.

[DiVA] Observability and Chaos Engineering for System Calls in Containerized Applications (Simonsson, Jesper), Master thesis, DiVA, 2019.

[DiVA] Simulation of chaos engineering for Internet-scale software with ns-3 (Zubayer Anton, Luong Tai), Bachelor thesis, DiVA, 2018.

Education

Ph.D. student in computer science, KTH Royal Institute of Technology, 2018 - 2022

Master in software engineering, Harbin Institute of Technology, 2013 - 2015

Bachelor in software engineering, Harbin Institute of Technology, 2009 - 2013

Experience

Senior Site Reliability Engineer at ElectroluxSince 2022.10

Manager of University Relations at Tencent2015.07 ~ 2017.11

Assistant Engineer (Intern) at Tencent2014.07 ~ 2015.07 、 2012.07 ~ 2013.07

Contact

Email: gluckzhang[at]gmail.com

LinkedIn: gluckzhang

Address: Electrolux, S:t Göransgatan 143, 112 17 Stockholm, Sweden