Big Data applications

Classifying Serialization Formats for Inter-service Communication in Distributed Systems

This study focuses on classifying serialization formats used in inter-service communication (ISC) within distributed systems and exploring their historical development. We have examined key features of humanreadable formats such as XML, JSON, and YAML, binary formats like Protocol Buffers and Apache Avro, and columnar formats such as Apache Parquet and ORC, among others. Our results have indicated a significant shift toward binary formats optimized for speed and compactness in recent years.