Commit 2096991
authored
perf: add multiplexing performance tests for AsyncMultiRangeDownloader (#16501)
## Overview
This PR introduces new microbenchmarks to measure and expose the
performance bottleneck caused by lock contention in the
`AsyncMultiRangeDownloader`. It provides a concrete way to compare the
previous serialized implementation against the new multiplexed
architecture.
## Before vs. After: The Performance Gap
### Before (Serialized via Lock)
In the previous implementation, `download_ranges` used a shared lock to
prevent concurrent access to the bidi-gRPC stream. This meant that even
with multiple coroutines, only one could "own" the stream at a time. The
entire download cycle (Send -> Receive All) had to complete before
another task could start.
**Execution Flow:**
```mermaid
sequenceDiagram
participant C1 as Coroutine 1
participant C2 as Coroutine 2
participant S as gRPC Stream
C1->>C1: Acquire Lock
C1->>S: Send Requests
S-->>C1: Receive Data (Streaming...)
S-->>C1: End of Range
C1->>C1: Release Lock
Note over C2: Waiting for Lock...
C2->>C2: Acquire Lock
C2->>S: Send Requests
S-->>C2: Receive Data (Streaming...)
S-->>C2: End of Range
C2->>C2: Release Lock
```
### After (Multiplexed Concurrent)
With the introduction of the `_StreamMultiplexer`, multiple coroutines
can now share the same stream concurrently. Requests are interleaved,
and a background receiver loop routes incoming data to the correct task
using `read_id`.
**Execution Flow:**
```mermaid
sequenceDiagram
participant C1 as Coroutine 1
participant C2 as Coroutine 2
participant M as Multiplexer
participant S as gRPC Stream
C1->>M: Send Requests
M->>S: Forward Req 1
C2->>M: Send Requests
M->>S: Forward Req 2
Note over C1,C2: Tasks wait on their own queues
S-->>M: Data for C1
M-->>C1: Route to Q1
S-->>M: Data for C2
M-->>C2: Route to Q2
S-->>M: Data for C1
M-->>C1: Route to Q1
```
## How the Benchmark Works
This PR adds a `read_rand_multi_coro` workload that:
1. Spawns multiple asynchronous tasks (coroutines).
2. Shares a single `AsyncMultiRangeDownloader` instance across all
tasks.
3. Simulates the old serialized behavior by explicitly passing a
`shared_lock` to `download_ranges`.
4. Measures total throughput (MiB/s) and resource utilization.
## Key Changes
- **`test_reads.py`**: Refactored to support launching concurrent
coroutines within a single worker process.
- **`config.yaml`**: Added `read_rand_multi_coro` with 1, 16 coroutines
to stress the downloader.
- **`config.py`**: Updated naming convention to include coroutine count
(e.g., `16c`) in reports for easier differentiation.1 parent d3d6840 commit 2096991Copy full SHA for 2096991
5 files changed
+45-40Lines changed: 45 additions & 40 deletions
File tree
Expand file treeCollapse file tree
Open diff view settings
Filter options
- packages/google-cloud-storage
- tests/perf/microbenchmarks/time_based
- reads
Expand file treeCollapse file tree
Open diff view settings
Collapse file
packages/google-cloud-storage/output.json
Copy file name to clipboardExpand all lines: packages/google-cloud-storage/output.jsonWhitespace-only changes.
Collapse file
packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/conftest.py
Copy file name to clipboardExpand all lines: packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/conftest.py+1-1Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| ||
17 | 17 | |
18 | 18 | |
19 | 19 | |
20 | | - |
| 20 | + |
21 | 21 | |
Collapse file
packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/config.py
Copy file name to clipboardExpand all lines: packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/config.py+2-2Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| ||
80 | 80 | |
81 | 81 | |
82 | 82 | |
83 | | - |
| 83 | + |
84 | 84 | |
85 | 85 | |
86 | | - |
| 86 | + |
87 | 87 | |
88 | 88 | |
89 | 89 | |
|
Collapse file
packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/config.yaml
Copy file name to clipboardExpand all lines: packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/config.yaml+2-1Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| ||
20 | 20 | |
21 | 21 | |
22 | 22 | |
23 | | - |
| 23 | + |
24 | 24 | |
25 | 25 | |
| 26 | + |
26 | 27 | |
27 | 28 | |
28 | 29 | |
Collapse file
packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/test_reads.py
Copy file name to clipboardExpand all lines: packages/google-cloud-storage/tests/perf/microbenchmarks/time_based/reads/test_reads.py+40-36Lines changed: 40 additions & 36 deletions
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| ||
115 | 115 | |
116 | 116 | |
117 | 117 | |
118 | | - |
119 | | - |
120 | 118 | |
121 | 119 | |
122 | 120 | |
123 | | - |
124 | | - |
125 | | - |
126 | | - |
127 | | - |
128 | | - |
129 | | - |
130 | | - |
131 | | - |
132 | | - |
133 | | - |
134 | | - |
135 | | - |
136 | | - |
137 | | - |
138 | | - |
139 | | - |
140 | | - |
141 | | - |
142 | | - |
143 | | - |
144 | | - |
145 | | - |
146 | | - |
147 | | - |
148 | | - |
149 | | - |
150 | | - |
151 | | - |
152 | | - |
153 | | - |
154 | | - |
155 | | - |
| 121 | + |
| 122 | + |
| 123 | + |
| 124 | + |
| 125 | + |
| 126 | + |
| 127 | + |
| 128 | + |
| 129 | + |
| 130 | + |
| 131 | + |
| 132 | + |
| 133 | + |
| 134 | + |
| 135 | + |
| 136 | + |
| 137 | + |
| 138 | + |
| 139 | + |
| 140 | + |
| 141 | + |
| 142 | + |
| 143 | + |
| 144 | + |
| 145 | + |
| 146 | + |
| 147 | + |
| 148 | + |
| 149 | + |
| 150 | + |
| 151 | + |
| 152 | + |
| 153 | + |
| 154 | + |
| 155 | + |
| 156 | + |
| 157 | + |
| 158 | + |
| 159 | + |
156 | 160 | |
157 | 161 | |
158 | | - |
| 162 | + |
159 | 163 | |
160 | 164 | |
161 | 165 | |
|
0 commit comments