Apply differential testing based on a reference implementation #60

danwt · 2022-04-22T18:14:16Z

Idea

Use a reference implementation to generate correct states from actions

It might be feasible to take a property based testing approach. We could implement the relevant system in very simple, single threaded, in-memory, imperative code without any regards to performance (this is the 'model') and execute actions against it in order to get correct states for each action. The actions could be generated dumb-randomly or smart-randomly (like quickcheck ect) or something inbetween. Then we combine the actions and states, forming traces, and run the traces MBT style.

Note: there might be some existing tools we could use here, maybe even leveraging an existing PBT framework.

Wider context

I think this could be a good thing to do first as by implementing a reference implementation(s) I would learn a lot and this would help me write the TLA+ models later, which will be used for traditional verification (@mpoke) and also regular MBT. This work would also complement efforts to create a better test driver (see e.g. #58 (comment)) as a driver will be needed to execute traces. That interface of the driver could hook into e.g. a fork of the ibc-go testing framework.

(I discussed this idea with @jtremback earlier today).

Questions

The key question is whether it is cost effective to write a reference implementation. It must be protocol correct, of course! My feeling is that it will be very doable.

jtremback · 2022-04-22T21:58:43Z

I think that in reference to both this issue and #61, the important part will be to implement a driver that takes json traces and applies them to either a real network (like our integration tests) or the IBC-Go simulated chains.

This will be the bulk of the work, and from here it will be very easy to test in a variety of different ways, from hand written traces, to trivially randomized traces that just try to break invariants ("fuzzing"?), to property based or model based testing.

I would prioritize writing the driver, unless you think that the driver would be much easier with the knowledge gained from writing a reference implementation.

jtremback · 2022-04-22T22:00:36Z

Actually, since we already have a driver for the integration tests, maybe an attempt to drive that with a reference implementation would be a good way to highlight any issues in the driver, and also help us learn what the limitations of tests on a real network are.

mpoke · 2022-04-25T13:07:22Z

I like the idea of a reference implementation, but I don't know how difficult would be to make one that contains enough details to be relevant. For example, we could implement the entire IBC communication as a queue, but a straightforward implementation would not cover relaying delays.

danwt · 2022-04-25T13:22:03Z

Cool.

but a straightforward implementation would not cover relaying delays.

There could be an action 'deliver' to move a message from the queue to the chain so not calling that action might simulate a delay.

how difficult would be to make one that contains enough details to be relevant

Yeah that's an open question but one I'll have to solve to write a model checkable model anyway

danwt · 2022-04-26T13:45:35Z

@sainoe this is the best issue for describing the methodology we discussed in our meeting

mpoke · 2022-04-26T15:39:54Z

The action space for CCV (i.e., external API):

on provider
- delegate(D, V, A), where D is a delegator, V is a validator and A is an amount
- undelegate(D, V, A)
- redelegate(D, Vsrc, Vdst, A)
- slash(V, infractionHeight, slashFactor)
- jumpToBlock(h), where h is a height larger than the current height
on consumer
- slash(V, infractionHeight, slashFactor)
- jumpToBlock(h)

As discussed with @danwt, for now we assume that the channel initialization is established.

danwt · 2022-05-04T16:50:18Z

The first application of this methodology is captured by two issues

I will do (1) first. In my experience it is better to start with a minimal driver.

danwt · 2022-05-09T17:23:47Z

I like the idea of a reference implementation, but I don't know how difficult would be to make one that contains enough details to be relevant. For example, we could implement the entire IBC communication as a queue, but a straightforward implementation would not cover relaying delays.

From looking at what ibc-go/testing offers, and what hermes offer, in terms of testing it should be easy to model the network as a queue with pushOne(x) and popAll(), but it might be harder to implement popOne. This is because sendPacket on the sender will be like pushOne and UpdateClient on the recipient chain will do popAll() (same for hermes tx raw packet-recv).

konnov · 2022-05-20T18:39:57Z

Hey @danwt, it just occurred to me that we might have missed the following option in the discussions. How about:

Using TLC in the simulation mode to produce random traces,
converting them to ITF and
executing them with Atomkraft or your custom driver?

I don't see any drawbacks in comparison to PBT here. You would get non-determinism and expressiveness of TLA+ for free. Obviously, you would have the spec in TLA+, not a reference model in Python. By using the simulator, you can avoid the slowdown of TLC and Apalache, which we are experiencing with the full-scale model checking.

danwt · 2022-06-08T16:38:49Z

Closed as PR

In-memory differential testing for established chains #126

tackles this partially but now it is blocked by the TDD dev cycle. That is: the tests in the PR fail due to a the SUT not being fully implemented. I'm not sure if it makes sense to merge the PR currently, but further work is fine grained and deserves its own issue. Furthe more, this issue is not well written with closing criteria ect.

chore!: port to SDK v0.50

danwt self-assigned this Apr 22, 2022

mpoke added this to Replicated Security Apr 26, 2022

mpoke moved this to Todo in Replicated Security Apr 26, 2022

mpoke added the scope: testing Code review, testing, making sure the code is following the specification. label Apr 26, 2022

This was referenced May 4, 2022

Write differential testing driver using ibc-go framework #87

Closed

Write reference implementation to generate traces for differential testing #88

Closed

cosmos deleted a comment from danwt May 25, 2022

danwt changed the title ~~Apply 'property based testing' based on a reference implementation~~ Apply differential testing based on a reference implementation Jun 6, 2022

danwt closed this as completed Jun 8, 2022

Repository owner moved this from Todo to Done in Replicated Security Jun 8, 2022

mpoke added a commit that referenced this issue Jul 25, 2024

Merge pull request #60 from informalsystems/marius/read-only-v50

f254514

chore!: port to SDK v0.50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply differential testing based on a reference implementation #60

Apply differential testing based on a reference implementation #60

danwt commented Apr 22, 2022 •

edited

Loading

jtremback commented Apr 22, 2022

jtremback commented Apr 22, 2022 •

edited

Loading

mpoke commented Apr 25, 2022

danwt commented Apr 25, 2022

danwt commented Apr 26, 2022

mpoke commented Apr 26, 2022

danwt commented May 4, 2022 •

edited

Loading

danwt commented May 9, 2022

konnov commented May 20, 2022 •

edited

Loading

danwt commented Jun 8, 2022

Apply differential testing based on a reference implementation #60

Apply differential testing based on a reference implementation #60

Comments

danwt commented Apr 22, 2022 • edited Loading

Idea

Use a reference implementation to generate correct states from actions

Wider context

Questions

jtremback commented Apr 22, 2022

jtremback commented Apr 22, 2022 • edited Loading

mpoke commented Apr 25, 2022

danwt commented Apr 25, 2022

danwt commented Apr 26, 2022

mpoke commented Apr 26, 2022

danwt commented May 4, 2022 • edited Loading

danwt commented May 9, 2022

konnov commented May 20, 2022 • edited Loading

danwt commented Jun 8, 2022

danwt commented Apr 22, 2022 •

edited

Loading

jtremback commented Apr 22, 2022 •

edited

Loading

danwt commented May 4, 2022 •

edited

Loading

konnov commented May 20, 2022 •

edited

Loading