Long Zhang (张龙)

Senior SRE at Electrolux (Sweden)
Ph.D. in Software Reliability and Chaos Engineering

Long Zhang (Gluck Zhang) is now a senior SRE at Electrolux Group working on its IoT system's observability and reliability. Long holds a Ph.D. degree in software reliability from KTH Royal Institute of Technology, Sweden. His research focuses on self-healing software, chaos engineering, and antifragile systems. During Long's Ph.D. study, he was supervised by Professors Martin Monperrus and Benoit Baudry, and funded by Wallenberg AI, Autonomous Systems and Software Program (WASP). Long received his BE degree and ME degree in software engineering from Harbin Institute of Technology, China. Before his Ph.D. study, Long was hired by Tencent as a software developer and project manager, who was responsible for university-enterprise cooperation projects design and development.

Interests

Talks

[2024] Beyond Connectivity: Full Life-Cycle Infrastructure Orchestration and Cost Engineering Code Europe 2024, Kraków, Poland, Jun 10, 2024

[2024] Bring Chaos Engineering to Your Organization in a Fun and Continuous Way co-speaker Kristina Kondrashevich, Conf42 Chaos Engineering 2024, Online, Feb 15, 2024

[2024] 3 Steps on How to Bring Chaos Engineering as a daily Responsibility for Highly Complex Systems co-speaker Kristina Kondrashevich, Chaos Carnival 2024, Online, Jan 24, 2024

[2022] Chaos Engineering of Ethereum Blockchain Clients Chaos Carnival 2024, Online, Feb 01, 2022

[2021] Improve Reproducibility of Research Using Docker WARA-SW Workshop, Online, Dec 22, 2021

[2021] A Chaos Engineering System for Live Analysis and Falsification of Exception-Handling in the JVM 43rd International Conference on Software Engineering (ICSE 2021), Journal-First Papers Track, Online, May 28, 2021

[2021] Maximizing Error Injection Realism for Chaos Engineering with System Calls Conf42 Chaos Engineering 2021, Online, Feb 25, 2021

[2020] Application-Level Chaos Engineering in JVM Conf42 Chaos Engineering 2020, London, Jan 23, 2020

[2019] TripleAgent: Monitoring, Perturbation And Failure-obliviousness for Automated Resilience Improvement in Java Applications The 2nd Vienna Software Seminar (VSS) on DevOps and Microservice APIs, Vienna, Austria, Aug 29, 2019

Publications

[TDSC] Highly Available Blockchain Nodes With N-Version Design (Javier Ron, César Soto-Valero, Long Zhang, Benoit Baudry, Martin Monperrus), Research paper, TDSC, 2023.

[DiVA] Application-level Chaos Engineering (Long Zhang), Ph.D. thesis, DiVA, 2022.

[DLT] Chaos Engineering of Ethereum Blockchain Clients (Long Zhang, Javier Ron, Benoit Baudry and Martin Monperrus), Research paper, ACM DLT, 2023.

[TDSC] Maximizing Error Injection Realism for Chaos Engineering with System Calls (Long Zhang, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, TDSC, 2021.

[TR] Production Monitoring to Improve Test Suites (Deepika Tiwari, Long Zhang, Martin Monperrus, Benoit Baudry), Research paper, TR, 2021.

[arXiv] Automatic Observability for Dockerized Java Applications (Long Zhang, Deepika Tiwari, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, 1912.06914, arXiv, 2019.

[ISSRE 2019] TripleAgent: Monitoring, Perturbation And Failure-obliviousness for Automated Resilience Improvement in Java Applications (Long Zhang and Martin Monperrus), Research paper, ISSRE 2019.

[FGCS] Observability and Chaos Engineering on System Calls for Containerized Applications in Docker (Jesper Simonsson, Long Zhang, Brice Morin, Benoit Baudry and Martin Monperrus), Research paper, Future Generation Computer Systems, 2021.

[TSE] A Chaos Engineering System for Live Analysis and Falsification of Exception-handling in the JVM (Long Zhang, Brice Morin, Philipp Haller, Benoit Baudry and Martin Monperrus), Research paper, TSE 2019.

Co-supervision

[DiVA] Cloud native chaos engineering for IoT systems (Björnberg, Adam), Master thesis, DiVA, 2021.

[DiVA] Chaos Engineering for Containerized Applications with Multi-Version Deployments (Knapen, Adriaan), Master thesis, DiVA, 2021.

[DiVA] Self-healing Middleware Support for Django Web Applications (Tu, Yi-Pei), Master thesis, DiVA, 2020.

[DiVA] Distributed Trace Comparisons for Code Review: A System Design and Practical Evaluation (Rabo, Hannes), Master thesis, DiVA, 2020.

[DiVA] Observability and Chaos Engineering for System Calls in Containerized Applications (Simonsson, Jesper), Master thesis, DiVA, 2019.

[DiVA] Simulation of chaos engineering for Internet-scale software with ns-3 (Zubayer Anton, Luong Tai), Bachelor thesis, DiVA, 2018.

Education

Ph.D. student in computer science, KTH Royal Institute of Technology, 2018 - 2022

Master in software engineering, Harbin Institute of Technology, 2013 - 2015

Bachelor in software engineering, Harbin Institute of Technology, 2009 - 2013

Experience

Senior Site Reliability Engineer at ElectroluxSince 2022.10

Manager of University Relations at Tencent2015.07 ~ 2017.11

Assistant Engineer (Intern) at Tencent2014.07 ~ 2015.07 、 2012.07 ~ 2013.07

Contact

Email: gluckzhang[at]gmail.com

LinkedIn: gluckzhang

Address: Electrolux, S:t Göransgatan 143, 112 17 Stockholm, Sweden