Option to replicate user profiles to another server #3112

dbkr · 2018-04-16T17:39:13Z

No description provided.

Also be consistent with underscores

…ation

turt2live · 2018-04-16T18:42:32Z

synapse/handlers/profile.py

+            } for r in batch_rows
+        }
+
+        url = "https://%s/_matrix/federation/v1/replicate_profiles" % (host,)


There doesn't seem to be a handler for this. Are profiles intended to be replicated to a non-homeserver?

correct; this is an experiment in replicating user profiles over to the IS so that for private IS deployments the IS can act as a global address book.

and same for the profile replication

ara4n · 2018-04-25T00:34:12Z

synapse/config/registration.py

@@ -108,6 +112,10 @@ def default_config(self, **kwargs):
            - vector.im
            - riot.im

+        # If enabled, user IDs, display names and avatar URLs will be replicated
+        # to this server whenever they change.


we need to say that this is an experimental addition to the identity server API for supporting global user directories, and that the targets here should be identity server hostnames (otherwise folks will assume it's for HSes)

Done, although it's in the /_matrix/federation path so I've said it's an API that's implemented by sydent rather than part of the IS API as such.

ara4n · 2018-04-25T00:38:49Z

synapse/handlers/profile.py

+            if len(self.hs.config.replicate_user_profiles_to) > 0:
+                reactor.callWhenRunning(self._assign_profile_replication_batches)
+                reactor.callWhenRunning(self._replicate_profiles)
+                self.clock.looping_call(


It'd be good to spell out in a comment why we're looping _replicate_profiles. I can see we'll need to do a catchup job at launch, but why do we then keep running it every 2 minutes, given whenever we set profile data it kicks off a sync anyway?

ara4n · 2018-04-25T00:40:12Z

synapse/handlers/profile.py

+        logger.info("Assigned %d profile batch numbers", total)
+
+    @defer.inlineCallbacks
+    def _replicate_profiles(self):


How does this handle overlapping calls? (e.g. two _replicate_profiles runs being kicked off by two users setting profile data at the same time, or colliding with the looping syncer)?

They'll get different batch numbers depending on which one got serviced first, then probably the first call to _replicate_profiles will send both batches and the second will do nothing. If it collides with the looping syncer it'll depend which db txn gets done first.

ara4n · 2018-04-25T00:43:17Z

this looks very plausible - lgtm mod comments.

richvdh · 2018-04-25T12:36:57Z

synapse/handlers/profile.py

+        signed_body = sign_json(body, self.hs.hostname, self.hs.config.signing_key[0])
+        try:
+            yield self.http_client.post_json_get_json(url, signed_body)
+            self.store.update_replication_batch_for_host(host, batchnum)


doesn't this need a yield?

ah, yes it does, good spot

richvdh · 2018-04-25T12:38:12Z

synapse/storage/profile.py

+            txn.execute("SELECT MAX(batch) as maxbatch FROM profiles")
+            rows = self.cursor_to_dict(txn)
+            return rows[0]['maxbatch']
+        max_batch = yield self.runInteraction(


you could just return self.runInteraction(...) rather than using inlineCallbacks

likewise below.

richvdh · 2018-04-25T12:39:06Z

synapse/storage/schema/delta/48/profiles_batch.sql

+
+CREATE INDEX profiles_batch_idx ON profiles(batch);
+
+CREATE TABLE profile_replication_status (


please can we get into the habit of commenting our schema files to describe what the tables and the columns therein do.

richvdh · 2018-04-25T12:42:45Z

tests/handlers/test_profile.py

@@ -75,7 +75,7 @@ def register_query_handler(query_type, handler):
    @defer.inlineCallbacks
    def test_get_my_name(self):
        yield self.store.set_profile_displayname(
-            self.frank.localpart, "Frank"
+            self.frank.localpart, "Frank", 1


it'd be nice to add trailing commas while you're here

ah yep, done (plus the others)

Unnecessary inlineCallbacks, missing yield, SQL comments & trailing commas.

dbkr added 4 commits April 10, 2018 17:41

Written but untested profile replication

e654230

Trigger profile replication on profile change

4e12b10

Include origin_server in the sig!

1147ce7

Also be consistent with underscores

Merge remote-tracking branch 'origin/dinsic' into dbkr/profile_replic…

3c446d0

…ation

dbkr assigned ara4n Apr 16, 2018

turt2live reviewed Apr 16, 2018

View reviewed changes

dbkr added 7 commits April 17, 2018 10:28

Handle current batch number being null

7285afa

pep8

8743f42

Fix tests

5fc3477

Fix other tests

b4b7c80

Update profile cache only on master

22e416b

and same for the profile replication

Don't do profile repl if no repl targets

dde01ef

pep8 again

3add16d

dbkr assigned richvdh Apr 19, 2018

dbkr mentioned this pull request Apr 19, 2018

Option to defer user_directory search to an ID server #3123

Merged

ara4n reviewed Apr 25, 2018

View reviewed changes

dbkr added 2 commits April 25, 2018 11:51

Add 'ex[erimental API' comment

de341be

Comment why the looping call loops

7fafa83

richvdh suggested changes Apr 25, 2018

View reviewed changes

richvdh reviewed Apr 25, 2018

View reviewed changes

PR feedback

47ed4a4

Unnecessary inlineCallbacks, missing yield, SQL comments & trailing commas.

richvdh approved these changes Apr 26, 2018

View reviewed changes

dbkr merged commit e2adb36 into dinsic Apr 26, 2018

hawkowl deleted the dbkr/profile_replication branch September 20, 2018 13:59

babolivier mentioned this pull request Mar 7, 2022

Redesigning user discovery in Tchap matrix-org/synapse-dinsic#125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to replicate user profiles to another server #3112

Option to replicate user profiles to another server #3112

dbkr commented Apr 16, 2018

turt2live Apr 16, 2018

ara4n Apr 16, 2018

ara4n Apr 25, 2018

dbkr Apr 25, 2018

ara4n Apr 25, 2018

dbkr Apr 25, 2018

ara4n Apr 25, 2018

dbkr Apr 25, 2018

ara4n commented Apr 25, 2018

richvdh Apr 25, 2018

dbkr Apr 25, 2018

richvdh Apr 25, 2018

richvdh Apr 25, 2018

dbkr Apr 25, 2018

richvdh Apr 25, 2018

dbkr Apr 25, 2018

richvdh Apr 25, 2018

dbkr Apr 25, 2018


		CREATE INDEX profiles_batch_idx ON profiles(batch);

		CREATE TABLE profile_replication_status (

Option to replicate user profiles to another server #3112

Option to replicate user profiles to another server #3112

Conversation

dbkr commented Apr 16, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ara4n commented Apr 25, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment