Telemetry #408

visnup · 2023-12-19T21:10:48Z

Resolves #327
Depends on https://github.com/observablehq/observablehq/pull/15524

Questions for reviewers:

Anyone want to copy edit the banner text, balancing disclosure, empathy, and helpfulness?
Are there docs anywhere yet for me to stub out telemetry explanation and so I can set the proper URL for the banner?
Does anyone care deeply about the timestamp format? Current version leverages performance.now which feels more aligned with first-pass questions around "how long does X take?"
Does time zone offset seem anonymous enough to capture? It's always been on my wishlist to better understand what local time of day people are doing things.
Any other environmental information to capture? Node version?
What preview events should we collect?

todo:

Probably in a follow up PR based on everyone's feedback:

more build data
- number of files?
- sizes of files?
preview stop event

Collects and sends telemetry to https://events.observablehq.com/cli. What's currently collected:

type TelemetryIds = {
   device: uuid; // uuid v4 saves to ~/.observablehq to attempt stability
   project: string; // one-way salted + hashed value of a project's git url falling back to current directory name
   session: uuid; // uuid v4 held in memory for the duration of the process
 };
 type TelemetryEnvironment = {
   version: string; // version of cli from package.json
   systemPlatform: string; // darwin, linux, win32...
   systemRelease: string;
   systemArchitecture: string; // arm64, x64...
   cpuCount: number;
   cpuModel: string | null;
   cpuSpeed: number | null;
   memoryInMb: number; // truncated to mb for more anonymity
   isCI: string | boolean;
   isDocker: boolean;
   isWSL: boolean;
 };
 type TelemetryTime = {
   now: number; // performance.now
   timeOrigin: number; // performance.timeOrigin
   timeZoneOffset: number; // minutes from utc, to derive wall clock hour
 };
 type TelemetryData = {
   event: "build" | "deploy" | "preview";
   step: "start" | "finish";
 };

Presents a banner once, when first run with a URL pointing to more information. Showing the banner saves timestamp to ~/.observablehq to know we've already shown it. That means checking to see if we should show it is async and therefore the banner could show up "late" and interspersed with other log messages. Is there a better way to check and save banner state?

.gitignore

mythmon · 2023-12-19T22:06:01Z

src/build.ts

@@ -144,6 +145,7 @@ export async function build(
      await effects.copyFile(sourcePath, outputPath);
    }
  }
+  telemetry.record({event: "build", step: "finish"});


I bet this kind of "start/end" telemetry is going to often be interesting for us. Is it possible to link ends to their starts at all? Maybe telemetry.record could return a message ID that we could note in the end event?

the session id will be stable across tied events.

So the workflow on the analytic side would be to find a start event, and then look for end events with the same session id? Will that be annoying if there is an event that happens multiple times in a telemetry session?

oh, do you mean like in the case of maybe two builds happening concurrently and the events get interleaved?

I mean if we have a start event for something more granular, like one per page. Is that just not what this is for? Having average time-per-page and number-of-pages metrics would be useful.

yeah, we could do something like that. so just easier rolled-up timings of blocks of code, right? maybe a premature idea would be to provide a telemetry.measure({event: "something"}, () => { /* block */ }). I guess I'm still not sure exactly what we'll want to measure so am willing to iterate a bunch and probably throw out old data as we do.

I could return an identifier (a counter probably that just increases with every call to record) if people want to reference these in other events:

const ts = telemetry.record({event: "build", step: "start"}); // ... telemetry.record({event: "build", step: "milestone", start: ts});

The block version is familiar, but it feels heavy handed. I'd really like language support for something like destructors, which makes this very nice, but we don't have those.

I think the ts version makes sense, and I'd be happy with that.

Another idea I had was to have the return value of record be a function or object with methods so you could do something like

const telemetrySpan = telemetry.startSpan({event: "build", step: "start"}); // ... telemetrySpan.note({step: "milestone", foo: "bar"}); // ... telemetrySpan.end({step: "end", status: "success"})

That's probably overkill though. Having .record always give back an auto incrementing ID sounds like a good approach.

src/config.ts

src/deploy.ts

src/telemetry.ts

mythmon

My concerns about the start/end pairing are a theoretical thing that don't apply directly to any telemetry events added in this PR. We can deal with that if we actually add events that need that.

Fil · 2024-01-11T20:27:35Z

❯ yarn dev

↳ http://127.0.0.1:3002/

node:internal/process/promises:289
            triggerUncaughtException(err, true /* fromPromise */);
            ^

[Error: EACCES: permission denied, open '/Users/fil/.observablehq'] {
  errno: -13,
  code: 'EACCES',
  syscall: 'open',
  path: '/Users/fil/.observablehq'
}

Node.js v20.3.0

❯ git bisect bad
c45051d is the first bad commit

Fil · 2024-01-11T20:37:04Z

🛑 Same blocking error for yarn build

visnup · 2024-01-11T20:40:48Z

OBSERVABLE_TELEMETRY_DISABLE=1 yarn dev to unblock you, but now I'm unsure why you can't write to ~/.observablehq. I guess I can just fallback and not care if that happens and not save persistent values anywhere.

Fil · 2024-01-11T20:44:44Z

probably something I tried during an earlier exploration; I like to make things break :)

I had this:

❯ ls -altr /Users/fil/.observablehq
-r--r--r--@ 1 fil  staff  103 20 nov 22:17 /Users/fil/.observablehq

setting it to writeable solved the issue

* Record anonymous usage telemetry * Stricter types * Better persistence * Detect CI and Docker environments * Don't fail tests * Configurable origin * Fix lint * Better time information * Add isWSL heuristic * Some tests * Show a banner on first run * Test debug disables telemetry too * More testable * Ironically, disable telemetry during tests * Base telemetry origin on ui origin * Some documentation of what we're collecting * Manage our own singleton-ness * Initial telemetry documentation

visnup changed the title ~~Record anonymous usage telemetry~~ Telemetry Dec 20, 2023

visnup force-pushed the visnup/telemetry branch from 43212c0 to 1f2ca2a Compare January 9, 2024 22:53

visnup marked this pull request as ready for review January 10, 2024 03:43

visnup requested review from Fil, mbostock and mythmon January 10, 2024 03:44

mbostock reviewed Jan 10, 2024

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

visnup added 12 commits January 9, 2024 21:56

Record anonymous usage telemetry

0bfde1b

Stricter types

fd73dcb

Better persistence

990f970

Detect CI and Docker environments

38031fe

Don't fail tests

4b7d003

Configurable origin

2451e14

Fix lint

8cc2ad8

Better time information

be683fb

Add isWSL heuristic

486dac7

Some tests

fb9a1e3

Show a banner on first run

88d439b

Test debug disables telemetry too

d34e876

visnup force-pushed the visnup/telemetry branch from 1a244ec to d34e876 Compare January 10, 2024 05:56

visnup added 4 commits January 10, 2024 08:57

Merge branch 'main' into visnup/telemetry

1a8b64c

More testable

8e98080

Merge branch 'main' into visnup/telemetry

4f69094

Ironically, disable telemetry during tests

a0f8e84

mythmon reviewed Jan 10, 2024

View reviewed changes

visnup added 4 commits January 10, 2024 20:04

Merge branch 'main' into visnup/telemetry

c84f78d

Base telemetry origin on ui origin

490f28f

Some documentation of what we're collecting

78cb0ea

Manage our own singleton-ness

5b63e88

mythmon approved these changes Jan 11, 2024

View reviewed changes

Initial telemetry documentation

cddf7c2

visnup enabled auto-merge (squash) January 11, 2024 20:00

Merge branch 'main' into visnup/telemetry

aea94ed

visnup merged commit c45051d into main Jan 11, 2024
2 checks passed

visnup deleted the visnup/telemetry branch January 11, 2024 20:01

visnup mentioned this pull request Jan 11, 2024

Be ok if we can't save a ~/.observablehq file for telemetry #507

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Telemetry #408

Telemetry #408

visnup commented Dec 19, 2023 •

edited

Loading

mythmon Dec 19, 2023

visnup Jan 10, 2024

mythmon Jan 10, 2024

visnup Jan 10, 2024

mythmon Jan 10, 2024

visnup Jan 11, 2024

visnup Jan 11, 2024 •

edited

Loading

mythmon Jan 11, 2024

mythmon left a comment

Fil commented Jan 11, 2024 •

edited

Loading

Fil commented Jan 11, 2024

visnup commented Jan 11, 2024

Fil commented Jan 11, 2024 •

edited

Loading

Telemetry #408

Telemetry #408

Conversation

visnup commented Dec 19, 2023 • edited Loading

mythmon Dec 19, 2023

Choose a reason for hiding this comment

visnup Jan 10, 2024

Choose a reason for hiding this comment

mythmon Jan 10, 2024

Choose a reason for hiding this comment

visnup Jan 10, 2024

Choose a reason for hiding this comment

mythmon Jan 10, 2024

Choose a reason for hiding this comment

visnup Jan 11, 2024

Choose a reason for hiding this comment

visnup Jan 11, 2024 • edited Loading

Choose a reason for hiding this comment

mythmon Jan 11, 2024

Choose a reason for hiding this comment

mythmon left a comment

Choose a reason for hiding this comment

Fil commented Jan 11, 2024 • edited Loading

Fil commented Jan 11, 2024

visnup commented Jan 11, 2024

Fil commented Jan 11, 2024 • edited Loading

visnup commented Dec 19, 2023 •

edited

Loading

visnup Jan 11, 2024 •

edited

Loading

Fil commented Jan 11, 2024 •

edited

Loading

Fil commented Jan 11, 2024 •

edited

Loading