ESP Journal of Engineering & Technology Advancements |
© 2021 by ESP JETA |
Volume 1 Issue 1 |
Year of Publication : 2021 |
Authors : Abhishek Gupta |
: 10.56472/25832646/JETA-V1I1P122 |
Abhishek Gupta, 2021. "Scalable Distributed Tracing and Performance Optimization in Microservices", ESP Journal of Engineering & Technology Advancements 1(1): 210-224.
Distributed tracing and performance optimization are two important factors required to keep microservices-based architecture flexible and fault-tolerant. However, as the adoption of microservices grows within organizations, monitoring and dealing with latency, issues, and inter-service coupling arises. This paper discusses the approach to doing distributed tracing or distributed logging in a system with one or more microservices and as explores ways to improve the efficiency of the same. We use Open Telemetry for tracing efforts, Kubernetes for scalable deployment, and machine learning for anomaly detection. Latency and response rate are observed, along with details of service interdependencies. This work provides improved system transparency, MTTR, and resource utilization. Conclusions, approaches, and guidance on its application are offered, thus making the material exceptionally valuable to system architects and developers who work with microservices within cloud platforms.
[1] Ghofrani, J., & Lübke, D. (2018). Challenges of Microservices Architecture: A Survey on the State of the Practice. ZEUS, 2018, 1-8.
[2] Bass, L., Weber, I., & Zhu, L. (2015). DevOps: A software architect’s perspective. Addison-Wesley Professional.
[3] Zvara, Z., Szabó, P. G., Balázs, B., & Benczúr, A. (2019). "Optimizing distributed data stream processing by tracing." Future Generation Computer Systems, 90, 578-591.
[4] Shkuro, Y. (2019). Mastering Distributed Tracing: Analyzing performance in microservices and complex systems. Packt Publishing Ltd.
[5] Chen, Z., & Mace, J. (2020). Towards Energy-Efficient Microservices: Key Performance Indicators and Sustainability Metrics for Cloud Computing. ACM Transactions on Cloud Computing (TOCC), 2020, 1-24.
[6] Baškarada, S., Nguyen, V., & Koronios, A. (2020). Architecting microservices: Practical opportunities and challenges. Journal of Computer Information Systems.
[7] Freitag, F., Caubet, J., & Labarta, J. (2002). On the scalability of tracing mechanisms. In European Conference on Parallel Processing (pp. 97-104). Berlin, Heidelberg: Springer Berlin Heidelberg.
[8] Smit, R. D., & Koster, C. (2020, September). Evaluating the performance of distributed algorithms in large-scale systems. In Proceedings of the International Symposium on Parallel and Distributed Computing (pp. 83-90). Paris, France: Springer.
[9] Las-Casas, P., Papakerashvili, G., Anand, V., & Mace, J. (2019, November). Sifter: Scalable sampling for distributed traces, without feature engineering. In Proceedings of the ACM Symposium on Cloud Computing (pp. 312-324).
[10] Chen, Z., & Mace, J. (2020). Towards Energy-Efficient Microservices: Key Performance Indicators and Sustainability Metrics for Cloud Computing. ACM Transactions on Cloud Computing (TOCC), 2020, 1-24.
[11] Baškarada, S., Nguyen, V., & Koronios, A. (2020). Architecting microservices: Practical opportunities and challenges. Journal of Computer Information Systems.
[12] Munaf, R. M., Ahmed, J., Khakwani, F., & Rana, T. (2019). Microservices architecture: Challenges and proposed conceptual design. In 2019 International Conference on Communication Technologies (ComTech) (pp. 82-87). IEEE.
[13] Chen, Z., & Mace, J. (2020). Towards Energy-Efficient Microservices: Key Performance Indicators and Sustainability Metrics for Cloud Computing. ACM Transactions on Cloud Computing (TOCC), 2020, 1-24.
[14] Munaf, R. M., Ahmed, J., Khakwani, F., & Rana, T. (2019, March). Microservices architecture: Challenges and proposed conceptual design. In 2019 International Conference on Communication Technologies (ComTech) (pp. 82-87). IEEE.
[15] Shkuro, Y. (2019). Mastering Distributed Tracing: Analyzing performance in microservices and complex systems. Packt Publishing Ltd.
[16] Zvara, Z., Szabó, P. G., Balázs, B., & Benczúr, A. (2019). Optimizing distributed data stream processing by tracing. Future Generation Computer Systems, 90, 578-591.
[17] Zvara, Z., Szabó, P. G., Hermann, G., & Benczúr, A. (2017, September). Tracing distributed data stream processing systems. In 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS* W) (pp. 235-242). IEEE.
[18] Popa, N. M., & Oprescu, A. (2019, December). A data-centric approach to distributed tracing. In 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) (pp. 209-216). IEEE.
[19] Popa, N. M., & Oprescu, A. (2019). "A data-centric approach to distributed tracing." In 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), pp. 209-216. IEEE.
[20] Chen, Z., & Mace, J. (2020). Towards Energy-Efficient Microservices: Key Performance Indicators and Sustainability Metrics for Cloud Computing. ACM Transactions on Cloud Computing (TOCC), 2020, 1-24.
Distributed Tracing, Microservices, Performance Optimization, OpenTelemetry, Kubernetes, Anomaly Detection, Observability, MTTR, Cloud Computing.