This is written for anyone who wants to contribute to the Rerun repository.
ARCHITECTURE.md
BUILD.md
rerun_py/README.md
- build instructions for Python SDKCODE_OF_CONDUCT.md
CODE_STYLE.md
RELEASES.md
-
Examples: We welcome any examples you would like to add. Follow the pattern of the existing examples in the
examples/
folder. -
Bug reports and issues: Open them at https://github.com/rerun-io/rerun/issues.
You can also look at our good first issue
tag.
We use Trunk Based Development, which means we encourage small, short-lived branches. Open draft PR:s to get some early feedback on your work.
All PR:s are merged with Squash and Merge
, meaning they all get squashed to just one commit on the main
branch. This means you don't need to keep a clean commit history on your feature branches. In fact, it is preferable to add new commits to a branch rather than rebasing or squashing. For one, it makes it easier to track progress on a branch, but rebasing and force-pushing also discourages collaboration on a branch.
Our CI will run benchmarks on each merged PR. The results can be found at https://rerun-io.github.io/rerun/dev/bench/.
Pull requests from external contributors require approval for CI runs. This can be done manually, by clicking the Approve and run
button:
Members of the rerun-io
organization and collaborators in the rerun-io/rerun
repository may enable auto-approval of workflow runs for a single PR by commenting with @rerun-bot approve
:
Every CI job would in its ideal state consist of only two steps:
- Install tools and libraries1
- Run a script
In which the script is written and tested locally before being wrapped in a CI workflow file. This does not mean that scripts are merely reproducible locally (though that is also true), it means that they must be written with a local-first mindset, as if they are not supposed to run on CI at all.
This approach has a number of benefits:
- Instead of Bash embedded in YAML, scripts may be written in an Actual Programming Language™
- Significantly lower iteration times when working on CI
- Ability to perform a job manually in case the CI fails
Additionally, always output any artifacts produced by CI to GCS instead of the GHA artifact storage. This can be a serious lifesaver when something breaks, as it allows anyone to download the output of a script and continue from where it failed, instead of being forced to start over from scratch.
Here are some guidelines to follow when writing such scripts:
Local-first means easy for contributors to run.
The following should be documented in each script:
- Dependencies
- Files and directories
- Environment variables
- Usage examples
Inputs should be passed in explicitly via arguments, and use sane defaults. If an input has a default value, it should be documented in its description.
Every input should be checked as early as possible. This includes:
- Checking if authentication credentials are valid
- Validating inputs and parsing into more specific types where possible:
- Numeric ranges
- String character sets/encodings
- Length limits
- Date formats
- etc.
- Checking that input file paths are valid and the files they point to exist
Input and output file paths should also accept GCS paths (gs://bucket/blob/path
) and stdin/stdout (-
), if it makes sense.
Be extra descriptive in error messages, it may be the only piece of information someone debugging a CI failure has available to figure out what went wrong. Print frequently to hint at what is going on and display progress to the user.
Environment variables should only be used for authentication with external services and configuring output (e.g. disabling color). Many SDKs support some form of persistent/default authentication, and scripts should take advantage of this where possible. For example, GCP has Application Default Credentials.
If the script performs destructive or otherwise irreversible actions, then it should support a --dry-run
option if possible.
Be thoughtful when adding dependencies. Each new dependency is a liability which lead to increased compile times, a bigger binary, more code that can break, a larger attack surface, etc. Sometimes it is better to write a hundred lines of code than to add a new dependency.
Whenever you add a new dependency in a PR, make sure you motivate it:
- Why use the dependency instead of rolling our own?
- Why this dependency instead of another?
For Rust, make sure you use default-features = false
if it makes sense, to minimize the amount of new code that is pulled in.
When reviewing a PR, always check the diff of Cargo.lock
(it is collapsed by default in GitHub 😤).
For a guide on picking good dependencies, see https://gist.github.com/repi/d98bf9c202ec567fd67ef9e31152f43f.
Any full cargo update
should be its own stand-alone PR. Make sure you include the output of it in the commit message.
The main crates are found in the crates/
folder, with examples in the examples/
folder.
To get an overview of the crates, read their documentation with:
cargo doc --no-deps --open
To learn about the viewer, run:
cargo run -p rerun -- --help
We use the just
command runner tool for repository automation. See here for installation instructions. To see available automations, use just --list
.
We use cargo cranky and specify our clippy lints in Cranky.toml
. Usage: cargo cranky
.
We use cargo deny to check our dependency tree for copy-left licenses, duplicate dependencies and rustsec advisories. You can configure it in deny.toml
. Usage: cargo deny check
.
Configure your editor to run cargo fmt
on save. Also configure it to strip trailing whitespace, and to end each file with a newline. Settings for VSCode can be found in the .vscode
folder and should be applied automatically. If you are using another editor, consider adding good setting to this repository!
To check everything in one go, run ./scripts/check.sh
. check.sh
should ideally check approximately the same things as our CI.
Prior to pushing changes to a PR, at a minimum, you should always run just fast-lint
. This is designed to run
in a few seconds and should catch the more trivial issues to avoid wasting CI time.
We recommend adding the rerun pre-push hook to your local checkout, which among other-things will run
just fast-lint
for you.
To install the hooks, simply copy them into the .git/hooks
directory of your local checkout.
cp hooks/pre-push .git/hooks/pre-push
chmod +x .git/hooks/pre-push
or if you prefer you can configure git to use this directory as the hooks directory:
git config core.hooksPath hooks
You can use bacon to automatically check your code on each save. For instance, running just bacon
will re-run cargo cranky
each time you change a rust file. See bacon.toml
for more.
You can set up sccache
to speed up re-compilation (e.g. when switching branches). You can control the size of the cache with export SCCACHE_CACHE_SIZE="256G"
.
You can view higher log levels with export RUST_LOG=debug
or export RUST_LOG=trace
.
Footnotes
-
For some larger jobs, we prefer to use a docker image to make managing dependencies simpler, and to keep everything locked to a specific version as much as possible. In this case, it's still good practice to install dependencies, because it ensures the job continues to work even if the docker image is out of date. ↩