Improve performance of `share_room_key` #2862

richvdh · 2023-11-20T01:42:25Z

Encryption doesn't usually require access to the user identity, so we can skip loading that from the store.

Unfortunately that means a fair bit of refectoring, in the form of replacing Device with ReadOnlyDevice.

This reduces the time taken to encrypt a message in a large room from about 3 seconds to 1 second.

codecov · 2023-11-20T01:52:16Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (7e53c68) 82.37% compared to head (c0ab6a4) 82.37%.
Report is 4 commits behind head on main.

Files	Patch %	Lines
crates/matrix-sdk-crypto/src/identities/device.rs	93.33%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2862   +/-   ##
=======================================
  Coverage   82.37%   82.37%           
=======================================
  Files         211      211           
  Lines       21604    21607    +3     
=======================================
+ Hits        17796    17799    +3     
  Misses       3808     3808

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

poljar · 2023-11-20T08:50:38Z

When we're trying to improve the performance of things, it would be very nice to apply a bit of engineering and science to the measuring part of the PR:

Especially for things we have benchmarks for, it only takes a couple of minutes to get the report from criterion.

richvdh · 2023-11-20T10:01:11Z

Especially for things we have benchmarks for, it only takes a couple of minutes to get the report from criterion.

To be honest, I did run it through the benchmark, and yes, saw a 5% improvement. But I think that's deceptive; the real wins come when you're using indexeddb which can be a bit slow with thousands of separate transactions, and I didn't feel in the mood to try and set up a benchmark with indexeddb.

poljar · 2023-11-20T10:39:13Z

To be honest, I did run it through the benchmark, and yes, saw a 5% improvement. But I think that's deceptive; the real wins come when you're using indexeddb which can be a bit slow with thousands of separate transactions, and I didn't feel in the mood to try and set up a benchmark with indexeddb.

So this is restricted to the IndexedDB scenario?

This reduces the time taken to encrypt a message in a large room from about 3 seconds to 1 second.

I think we should be explicit about that.

Not sure what's deceptive about the criterion report, more data should always be less deceptive than less data. A 5% improvement for the memory store and SQLite store is great.

Posting the above criterion report with a sentence mentioning that the improvement is even better on IndexedDB (even though we don't have a benchmark for it) would have been my ideal way of doing this.

poljar

I left a question that we should at least ponder on for a bit. The changes make sense though there might be an alternative approach since the changes do make the API a bit more confusing due to the two Device types.

poljar · 2023-11-20T11:02:35Z

crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs

@@ -283,14 +284,21 @@ impl GroupSessionManager {
        let mut share_infos = BTreeMap::new();
        let mut withheld_devices = Vec::new();

-        let encrypt = |device: Device, session: OutboundGroupSession| async move {
-            let encryption_result = device.maybe_encrypt_room_key(session).await?;
+        // XXX is there a way to do this that doesn't involve cloning the


I don't think there is, you are moving the Device into a spawned task. The task itself might be moved between threads.

my thinking went: we have an Arc<CryptoStoreWrapper> which is passed into this method, and which we know will live as long as any of those tasks (because the tasks are all spawned and join_all-ed inside this method). So shouldn't I be able to borrow a reference to the CryptoStoreWrapper contained in that Arc, and just pass a copy of that reference into each spawned task?

poljar · 2023-11-20T11:15:03Z

crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs

-            let user_devices = self.store.get_user_devices_filtered(user_id).await?;
+            let user_devices = self.store.get_readonly_devices_filtered(user_id).await?;
+
+            // We only need the user identity if settings.only_allow_trusted_devices is set.


Presumably this is the part that improved the performance of things? I wonder if a simple cache, so we only load the device_owner_identity once would have given us the same performance improvement as well.

Presumably no, since the memory store shows a 5% improvement as well. Any thoughts on this?

Yes, this is the performance improvement. It's partly not requesting the owning user's identity, and partly not requesting our own identity on each iteration of the loop.

richvdh · 2023-11-20T14:10:00Z

the changes do make the API a bit more confusing due to the two Device types.

I don't disagree that it's confusing having two Device types. But honestly a lot of me feels that horse has already bolted: we have the two types and have to deal with them.

I think I'd argue that the high-level Device type is overcomplicated and is trying to do too much. It's probably possible to make it go quickly with sufficient caching, but caching isn't without its disadvantages and if we can skip all the stuff about identities altogether, it's better to do that.

crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs

richvdh · 2023-11-20T17:07:52Z

Posting the above criterion report with a sentence mentioning that the improvement is even better on IndexedDB (even though we don't have a benchmark for it) would have been my ideal way of doing this.

Noted, by the way. Will try to be more proactive in including benchmark reports in PRs where they are relevant.

Until #2862, `GroupSessionManager::encrypt_session_for` called `Device::encrypt` (via `Device::maybe_encrypt_room_key`. `Device::encrypt` is a thin wrapper for `ReadOnlyDevice::encrypt`, but it also has an `instrument` annotation. #2862 short-cuts `Device::encrypt` and calls `ReadOnlyDevice::encrypt` directly: that was functionally fine but of course means that we no longer benefit from the `instrument` annotation. This PR rectifies the situation by pushing the annotation down to `ReadOnlyDevice::encrypt`.

Until #2862, `GroupSessionManager::encrypt_session_for` called `Device::encrypt` (via `Device::maybe_encrypt_room_key`. `Device::encrypt` is a thin wrapper for `ReadOnlyDevice::encrypt`, but it also has an `instrument` annotation. #2862 short-cuts `Device::encrypt` and calls `ReadOnlyDevice::encrypt` directly: that was functionally fine but of course means that we no longer benefit from the `instrument` annotation. This PR rectifies the situation by pushing the annotation down to `ReadOnlyDevice::encrypt`. It also adds some documentation for that function, since we are using it in more places now. (Longer-term, I think we should probably aim to get rid of `Device::encrypt` altogether, but that's a refactor I don't want to take on today.)

richvdh added 10 commits November 20, 2023 01:38

Push message_id generation down to ReadOnlyDevice

8778882

Move maybe_encrypt_room_key down to ReadOnlyDevice

e1d2a7f

Convert withheld device handling to ReadOnlyDevice

9825df5

Convert encrypt_session_for to ReadOnlyDevice

a989591

encrypt_session_for and encrypt_request

0ae039d

convert is_shared_with to ReadOnlyDevice

d236fe1

convert return of collect_session_recipients

65b6465

Use get_readonly_devices_filtered

2bfb809

Drop unused method

9b43d1b

Changelog

c96cd73

richvdh requested a review from a team as a code owner November 20, 2023 01:42

richvdh requested review from poljar and removed request for a team November 20, 2023 01:42

poljar reviewed Nov 20, 2023

View reviewed changes

poljar approved these changes Nov 20, 2023

View reviewed changes

richvdh commented Nov 20, 2023

View reviewed changes

crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs Outdated Show resolved Hide resolved

richvdh commented Nov 20, 2023

View reviewed changes

crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs Outdated Show resolved Hide resolved

Update crates/matrix-sdk-crypto/src/session_manager/group_sessions.rs

c0ab6a4

richvdh enabled auto-merge (squash) November 20, 2023 17:08

richvdh disabled auto-merge November 20, 2023 18:32

richvdh marked this pull request as draft November 20, 2023 18:32

richvdh marked this pull request as ready for review November 20, 2023 18:32

richvdh enabled auto-merge (squash) November 20, 2023 18:33

richvdh merged commit c536760 into main Nov 20, 2023

richvdh deleted the rav/faster_encrypt branch November 20, 2023 18:46

richvdh mentioned this pull request Nov 21, 2023

Element-R: sending messages takes several seconds element-hq/element-web#26375

Closed

richvdh mentioned this pull request Nov 22, 2023

Reinstate tracing instrumentation on ReadOnlyDevice::encrypt #2873

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of `share_room_key` #2862

Improve performance of `share_room_key` #2862

richvdh commented Nov 20, 2023

codecov bot commented Nov 20, 2023 •

edited

Loading

poljar commented Nov 20, 2023

richvdh commented Nov 20, 2023

poljar commented Nov 20, 2023

poljar left a comment

poljar Nov 20, 2023

richvdh Nov 20, 2023

poljar Nov 20, 2023

richvdh Nov 20, 2023

richvdh commented Nov 20, 2023

richvdh commented Nov 20, 2023

Improve performance of share_room_key #2862

Improve performance of share_room_key #2862

Conversation

richvdh commented Nov 20, 2023

codecov bot commented Nov 20, 2023 • edited Loading

Codecov Report

poljar commented Nov 20, 2023

richvdh commented Nov 20, 2023

poljar commented Nov 20, 2023

poljar left a comment

Choose a reason for hiding this comment

poljar Nov 20, 2023

Choose a reason for hiding this comment

richvdh Nov 20, 2023

Choose a reason for hiding this comment

poljar Nov 20, 2023

Choose a reason for hiding this comment

richvdh Nov 20, 2023

Choose a reason for hiding this comment

richvdh commented Nov 20, 2023

richvdh commented Nov 20, 2023

Improve performance of `share_room_key` #2862

Improve performance of `share_room_key` #2862

codecov bot commented Nov 20, 2023 •

edited

Loading