The paper examines the features of front-end technology formation in cloud environments based on the client–server model and provides mathematical justification for applications of the Smart City type. A generalized latency model of user requests is proposed, in which the components correspond to network delays, server processing, data access, and serialization/rendering costs in the browser. For multi- endpoint REST interfaces, a weighted evaluation is introduced that accounts for the frequency and criticality of calls, while for aggregated GraphQL queries, a parallel model is proposed that captures the impact of the “slowest” dependency. It is shown how the parameters of user request latency can be reduced by means of cloud technologies, including CDN/Edge networks and HTTP/3, API Gateway/BFF with partial delivery, caching and incremental queries, as well as progressive/streaming rendering and Web Workers on the client side. For capacity planning, the M/M/N queuing system (Erlang-C model) is applied to estimate expected waiting times, and SLO benchmarks are established for urban scenarios. Using the example of a dashboard integrating transportation, air quality, and power grid data, it is demonstrated that GraphQL reduces the number of requests and stabilizes latency compared to multiple REST calls, whereas REST remains effective for simple, well-cached resources. The results provide practical recommendations for designing cloud-based front-end architectures with predictable perfor- mance and manageable operational costs.
- Stünkel, Patrick & Bargen, Ole & Rutle, Adrian & Lamo, Yngve. (2020). GraphQL Federation: A Model-Based Approach.. The Journal of Object Technology. 19. 18:1. 10.5381/jot.2020.19.2.a18.
- Beyer, B., Jones, C., Petoff, J. and Murphy, N. (2016) Site Reliability Engineering: How Google Runs Production Systems. O’Reilly.
- OpenTelemetry: Future-Proofing Your Instrumentation https://www.cncf.io/blog/2020/08/06/opentelemetry- future-proofing-your-instrumentation/
- Fielding, R.T. (2000) Architectural Styles and the Design of Network-based Software Architectures. PhD thesis, UC Irvine.
- Pölöskei, I., & Bub, U. (2021). Enterprise-level migration to micro frontends in a multi-vendor environment. Acta Polytechnica Hungarica, 18(8), 143–160.
- Jain, V. (2023). Web vitals and core metrics for web performance optimization. Journal Name, 7, 198–205. https://doi.org/10.5281/zenodo.14915137
- GraphQL Foundation. (n.d.). What does the GraphQL Foundation do? Retrieved October 6, 2025, from https://graphql.org/faq/foundation/
- Harchol-Balter, Mor. (2013). Performance Modeling and Design of Computer Systems: Queueing Theory in Action. Cambridge University Press 10.1017/CBO9781139226424.
- IETF (2016) Brotli Compressed Data Format. RFC 7932.
- IETF (2021) QUIC: A UDP-Based Multiplexed and Secure Transport. RFC 9000. IETF (2022a) HTTP/3. RFC 9114.
- IETF (2022b) HTTP/2. RFC 9113.
- IETF (2022c) Extensible Prioritization Scheme for HTTP. RFC 9218. IETF (2022d) HTTP Caching. RFC 9111.
- Kleinrock, L. (1976). Queueing Systems Volume 1: Theory. Journal of the American Statistical Association. Wiley https://doi.org/10.2307/2285630
- Kleppmann, M. (2017) Designing Data-Intensive Applications. O’Reilly. https://unidel.edu.ng/focelibrary/ books/Designing%20Data-Intensive%20Applications%20The%20Big%20Ideas%20Behind%20Reliable,%20Scalable,%20and%20Maintainable%20Systems%20by%20Martin%20Kleppmann%20(z-lib.org).pdf
- Mezzalira, L. (2021) Building Micro-Frontends. O’Reilly. https://dl.ebooksworld.ir/books/Building.Micro- Frontends.Luca.Mezzalira.OReilly.-9781492082996.EBooksWorld.ir.pdf
- Shah Sanket (2024) React 18 – Streaming Server Rendering. https://www.dhiwise.com/author/sanket-shah?page=1 W3C (2023) Server-Timing. W3C Working Draft.