Machine-readable output of tests #1284

hauleth · 2015-09-17T11:58:37Z

skade · 2015-09-17T12:58:07Z

text/0000-machine-readable-tests-output.md

+
+### Structure
+
+Output is stream of lines containing JSON objects separated by new line. Each


It would make sense to agree on something like http://dataprotocols.org/ndjson/ or http://jsonlines.org/ for that or is that up to TAP-J to define?

It could also make usage of \1E ASCII code which refers to "Record Separator". This is still under consideration, but 1 record per line is simple enough.

I'd take '\n' or '\r\n', it's a pretty common format in the NoSQL-world (e.g. Elasticsearch bulk uses it) and JSON can be written newline-free very well.

But is inconsistent between platform as Windows still uses \r\n as a line separator and I believe that println! macro respect that. Although this needs reconsideration as I chosen new line as it was simple enough to implement in first draft of specs.

That inconsistency doesn't matter in that case as '{"test": "foo"}\r' is the same JSON value as '{"test": "foo"}'. Even if your IO isn't prepared for that, this is not an issue.

killercup · 2015-09-17T17:54:01Z

I really like this proposal. Thanks for writing this!

alexcrichton · 2015-09-17T18:52:08Z

cc @rust-lang/tools, definitely our territory!

Thanks for the RFC @hauleth! Some thoughts of mine:

Perhaps there could be a record for when a test starts, as well as when it finishes? For interactive displays it could perhaps be nice to see what tests are in progress.
We probably want to start off pretty conservative and not have many fields required by default unless necessary. For example could we drop build and rustc from the suite record?
For a test record, the duration field could be dropped if we had start/stop records.
Could this spec how the libtest library will be changed? E.g. it's good that this is preserving backwards compatibility, but we'll probably want a new flag to test binaries which generates this form of output.
Does this output also affect rustdoc --test? I think it probably should and we'd just want to forward flags to the "test binary" basically.

I personally always find it a good exercise to implement features like this ahead of time to get a good feeling about what's needed to implement the current functionality we have today. Along those lines it'd certainly help weed out what's needed for maintaining the same output in terms of benchmarks and tests, but it may also be a pretty significant chunk of work!

hauleth · 2015-09-17T20:38:52Z

About first point. I assumed that suite object will be the beginning of tests so additional object would be unneeded.

About second. It is completely possible to do so. I've just added that to be sure that we are running valid binary.

About third. Of course it could be solution, but I don't see reason behind providing timestamp of beginning and ending of test, duration will do as well (maybe it's only me, but I doesn't care when my test started, I want to know how long it has been running).

About last two it would be recommended to do so. It could simplify calls.

About implementing that: I had that in mind, but currently I hadn't enough time to dig in rustc to find where and what so I what I loved in Ruby and power it up to work nice with Rust and provide functionality I need in my work. So it is very opinionated.

jgraham · 2015-09-17T20:56:41Z

For comparison we designed a similar format for logging browser test results at Mozilla. You probably don't need everything in that format, and may want some different things, but it's another point in the possible design space to consider. One of the driving considerations there was when you have tests from a third party source (e.g. because you are implementing some specification) there can be tests that you expect to fail but which you don't wish to edit. If that isn't a case you care about here you are likely to make differnt design choices.

emberian · 2015-09-17T22:50:48Z

In general I'm a big fan of anything TAP based, and this JSON encoding removes most of problems with TAP. Human-readable output should definitely be the default, though. Perhaps an environment variable such as RUST_MACHINE_OUTPUT or a universally understood command-line to enable machine output.

These tools are primarily for people, not machines. Changing Cargo to consume the machine-readable format is not an issue.

Looking forward to seeing something like this develop!

diwic · 2015-09-18T13:08:35Z

cargo test should continue to have human readable format, and machine output could be available with cargo test --json or so.

erickt · 2015-09-19T06:16:18Z

This is a great start! I think it could also be useful to capture the rustc command line arguments so that we can observe optimization levels, enabled feature flags, and etc.

For reference, a bunch of other test libraries produce the xUnit XML File Format.

tomjakubowski · 2015-09-19T23:55:12Z

In general I'm a big fan of anything TAP based, and this JSON encoding removes most of problems with TAP. Human-readable output should definitely be the default, though. Perhaps an environment variable such as RUST_MACHINE_OUTPUT or a universally understood command-line to enable machine output.

I don't see why it should be an environment variable; environment variables are much harder to reason about than command line flags (it's a bit like dynamic scope vs. lexical scope), and it's not like we need to adjust some behavior deep within a stack of executing programs.

A flag on the compiled test runner (--format=tap, bikeshedding welcome) combined with a flag on cargo test itself seems like the right fit to me.

alexcrichton · 2015-09-21T17:21:56Z

@hauleth

About first point. I assumed that suite object will be the beginning of tests so additional object would be unneeded.

Ah yeah I meant in addition to the suite object there'd also be an object for "this test has started to run". That gives consumers a notion of what tests are currently running, e.g. those you've seen start records for and haven't seen end records for. Additionally if a consumer of the output wants to provide a progress bar this would be useful information perhaps.

About third. Of course it could be solution, but I don't see reason behind providing timestamp of beginning and ending of test, duration will do as well (maybe it's only me, but I doesn't care when my test started, I want to know how long it has been running).

Ah just in the sense that we don't have a lot of timestamp support in the standard library just yet, so it'd be difficult to implement this in-tree (e.g. whereas it'd be pretty easy to do it out-of-tree), so for ease of implementation we may want to avoid this for now.

About implementing that: I had that in mind, but currently I hadn't enough time to dig in rustc to find where and what so I what I loved in Ruby and power it up to work nice with Rust and provide functionality I need in my work. So it is very opinionated.

Ah yeah no worries! You're not on the hook to implement this or anything like that, just some musings from me!

yoshuawuyts · 2015-09-29T03:28:48Z

What are the limitations on TAP that warrant the invention of another format? To my understanding this format would be TAP-J based, meaning it's something new.

Though being slightly inconvenient to parse, I think that using TAP yields more benefits in terms of tooling, adoption, familiarity and interoperability than creating something new.

It's quite annoying if every language comes with their own custom way of formatting test output. E.g. I don't think Rust should fall into the same trap Golang fell into with their test output:

$ go test -v
=== RUN TestPrintSomething
Say hi
--- PASS: TestPrintSomething (0.00 seconds)
    v_test.go:10: Say bye
PASS
ok      so/v    0.002s

hauleth · 2015-09-29T07:54:13Z

@yoshuawuyts the only problem with TAP is that this is primitive format, that doesn't provide way to message a lot of things (like type of test, performance, etc.) in uniform way. RusTAP is created to fit into Rust testing framework as it has some quirks that original TAP-J doesn't cover (i.e. benches).

IanConnolly · 2015-10-27T05:28:53Z

I'd be happy to help with moving this along, as I've been wanting this myself recently.

brson · 2015-11-04T19:15:00Z

My quick comments:

Never heard of TAP-J. Need to consider it.
Doesn't consider cargo integration
- people do testing through cargo
- cargo has other types of tests and I want to be to analyze their results
Printing to stdout has problems
- cargo also prints to stdout
- test cases can print to stdout when the test runner is not capturing

hauleth · 2016-01-08T22:41:38Z

Closing as I want to rewrite it for TAP13 protocol (wider usage, and already existing tools).

tj · 2016-01-09T03:57:14Z

IMO the problem with Go's is that you can't replace the output generation. I know the Go team has an issue open for considering JSON output, but to me It would be so much nicer if you could just:

import (
  _ "my/fancy/test/output" // register a replacement hook
)

Then you don't have to fight about JSON, TAP, etc, just use whatever you like. Maybe Rust could go that route instead of a base format?

andrew-d · 2016-01-09T07:48:01Z

👍 for that - being able to write a "test output plugin" (or whatever it ends up being called) seems like the best way to handle this.

jtepe · 2016-01-09T08:53:59Z

Then there still has to be a sensible default or fallback for those that do not provide such a plugin.

jtepe · 2016-01-09T08:55:14Z

However, I am also in favor of this plugin approach.

hauleth · 2016-01-09T10:31:36Z

I am in the progress of rewriting libtest to allow writing reporters in sensible way. About TAP - TAP version 13 allow YAML test description after each test which seems IMHO good approach. As YAML is superset of JSON it solves both problems.

Łukasz Jan Niemier

Dnia 9 sty 2016 o godz. 09:55 Jonas Tepe [email protected] napisał(a):

However, I am also in favor of this plugin approach.

—
Reply to this email directly or view it on GitHub.

kamalmarhubi · 2016-01-24T20:28:34Z

@hauleth

Closing as I want to rewrite it for TAP13 protocol (wider usage, and already existing tools).

Do you need any help with this? I'd very much like testing to get better in Rust!

milgner · 2016-10-09T12:41:29Z

Is this still being worked on? I think Rust would greatly benefit from this as it makes integration of tests into CI systems much more elegant. TAP 13 seems like a good format, too, even if it doesn't explicitly contain a differentiation between regular tests and benchmarks. But I guess a # benchmark directive could be added to address this?

dnsco · 2017-09-03T21:09:29Z

Can you reopen this?

First version of RFC

25e6e61

skade reviewed Sep 17, 2015
View reviewed changes

Minor fixes

eb752fd

alexcrichton added the T-dev-tools Relevant to the development tools team, which will review and decide on the RFC. label Sep 17, 2015

withoutboats mentioned this pull request Sep 26, 2015

i10n #1292

Closed

hauleth mentioned this pull request Oct 28, 2015

Publish results somewhere briansmith/crypto-bench#7

Closed

hauleth closed this Jan 8, 2016

nagisa mentioned this pull request Mar 30, 2016

Allow more-easily-parsable output format for libtest benchmark harness rust-lang/rust#32595

Closed

killercup mentioned this pull request Nov 11, 2017

Libtest json output rust-lang/rust#45923

Closed

killercup mentioned this pull request Dec 11, 2017

Added libtest_json_output #2234

Closed

phansch mentioned this pull request Oct 8, 2018

Fail or warn when using an overrestricitve test filter that filters all tests rust-lang/cargo#6151

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine-readable output of tests #1284

Machine-readable output of tests #1284

hauleth commented Sep 17, 2015

skade Sep 17, 2015

hauleth Sep 17, 2015

skade Sep 17, 2015

hauleth Sep 17, 2015

skade Sep 17, 2015

killercup commented Sep 17, 2015

alexcrichton commented Sep 17, 2015

hauleth commented Sep 17, 2015

jgraham commented Sep 17, 2015

emberian commented Sep 17, 2015

diwic commented Sep 18, 2015

erickt commented Sep 19, 2015

tomjakubowski commented Sep 19, 2015

alexcrichton commented Sep 21, 2015

yoshuawuyts commented Sep 29, 2015

hauleth commented Sep 29, 2015

IanConnolly commented Oct 27, 2015

brson commented Nov 4, 2015

hauleth commented Jan 8, 2016

tj commented Jan 9, 2016

andrew-d commented Jan 9, 2016

jtepe commented Jan 9, 2016

jtepe commented Jan 9, 2016

hauleth commented Jan 9, 2016

kamalmarhubi commented Jan 24, 2016

milgner commented Oct 9, 2016

dnsco commented Sep 3, 2017


		### Structure

		Output is stream of lines containing JSON objects separated by new line. Each

Machine-readable output of tests #1284

Machine-readable output of tests #1284

Conversation

hauleth commented Sep 17, 2015

skade Sep 17, 2015

Choose a reason for hiding this comment

hauleth Sep 17, 2015

Choose a reason for hiding this comment

skade Sep 17, 2015

Choose a reason for hiding this comment

hauleth Sep 17, 2015

Choose a reason for hiding this comment

skade Sep 17, 2015

Choose a reason for hiding this comment

killercup commented Sep 17, 2015

alexcrichton commented Sep 17, 2015

hauleth commented Sep 17, 2015

jgraham commented Sep 17, 2015

emberian commented Sep 17, 2015

diwic commented Sep 18, 2015

erickt commented Sep 19, 2015

tomjakubowski commented Sep 19, 2015

alexcrichton commented Sep 21, 2015

yoshuawuyts commented Sep 29, 2015

hauleth commented Sep 29, 2015

IanConnolly commented Oct 27, 2015

brson commented Nov 4, 2015

hauleth commented Jan 8, 2016

tj commented Jan 9, 2016

andrew-d commented Jan 9, 2016

jtepe commented Jan 9, 2016

jtepe commented Jan 9, 2016

hauleth commented Jan 9, 2016

kamalmarhubi commented Jan 24, 2016

milgner commented Oct 9, 2016

dnsco commented Sep 3, 2017