Latency and throughput overhead benchmarks
m5a.4xlarge
machine with 16 vCPUs and 64 GiB memory. Both the client and
server were on the same machine in order to eliminate network noise. In each
benchmark, 16 parallel clients continuously sent requests to a server running
running under Subtrace for 10 seconds.