Cache for translations #227

jerinphilip · 2021-10-09T11:43:52Z

Mostly copied over from #202 (comment), with a few modifications.

kpu · 2021-10-09T12:02:24Z

I had designed it with the intention you could take an existing shared_ptr object and stuff it in the cache. Is that a useful case?

jerinphilip · 2021-10-09T12:35:54Z

Is that a useful case?

Yes? Start with Ptr<History> don't convert it since you fiercely hate conversions? For us:

using TranslationCache = AtomicCache<size_t, Ptr<History>>;

I'm still suspicious how there won't be any ABA in this cache, but doesn't seem to violate anything in the unit-tests (which are not that big, TBH). We are not editing one memory location, but a whole lot of it. Instructions (which enable lock-free atomics) operate at 1 mem-loc at a time, does it not? If it were this simple, why cant I find this everywhere online, I wonder?

kpu · 2021-10-09T12:49:59Z

I hate unnecessary conversions. I also hate unnecessary memory allocations especially if they are shared_ptr that doubles the memory allocations.

Do I really need shared_ptr<AtomicCache::Record<size_t, shared_ptr<History> > ?

Do not understand your concern about ABA. The object pointed to is kept alive the entire time it's in custody of a shared_ptr so nobody can create a second object there.

jerinphilip · 2021-10-09T13:56:21Z

Do I really need shared_ptr<AtomicCache::Record<size_t, shared_ptr<History> > ?

I ran into std::atomic<..> requires is_trivially_copyable with AtomicCache::Record<size_t, shared_ptr<History>. 15ee836 (#227)

Even without Ptr<History> as value (AtomicCache<int, int>) instead, I am getting linking failures to atomic_load and atomic_store. aa6320a (#227).

I'm not sure where I'm headed, but here's some documentation. Looks like we have conversion, and ProcessedRequestSentence / storage is staying.

https://en.cppreference.com/w/cpp/atomic/atomic#Primary_template

The primary std::atomic template may be instantiated with any TriviallyCopyable type T satisfying both CopyConstructible and CopyAssignable. The program is ill-formed if any of following values is false:

https://en.cppreference.com/w/cpp/types/is_trivially_copyable:

Objects of trivially-copyable types that are not potentially-overlapping subobjects are the only C++ objects that may be safely copied with std::memcpy or serialized to/from binary files with std::ofstream::write()/std::ifstream::read().

kpu · 2021-10-09T16:21:54Z

What was the problem with the version I wrote using an array of std::shared_ptr<History> ?

jerinphilip · 2021-10-09T16:27:05Z

What was the problem with the version I wrote using an array of std::shared_ptr<History>?

I barely tried to get that one to compile (assuming you wrote in a GitHub comment to provide a rough idea, not after testing), which is 2779090 (#227). Record simply came because the following inconsistency.

Find()
      const std::shared_ptr<Entry> &bucket = entries_[hash_(key) % entries_.size()];
                                                      ^

Store()
	        std::shared_ptr<Entry> &bucket = entries_[hash_(*entry) % entries_.size()];
                                                             ^

2779090 (#227) seems to at-least compile with no link errors (although I haven't tried to instantiate a specialization with size_t from marian::Words key and Ptr<History> value).

kpu · 2021-10-09T16:38:19Z

There are two ways to make a store's interface.

Have a key-value struct which should be std::pair just for consistency with the standard library.
Have a single opaque Entry class. The Hash functor is overloaded to accept both Entry and Key. Similarly the Equal functor is overloaded to accept both Entry and Key making a comparison on that. This is what you were missing.

But I think the issue is that the key is not a function of History so much as a function of the input sentence and model used. Therefore you need a size_t key alongside the History object. In that case a key-value store makes sense. It may as well be std::pair<std::size_t, History>. The History object looks moveable. You may as well move (not copy) it out of std::shared_ptr<History> misery into the std::pair<std::size_t, History>. Then we'll have std::vector<std::shared_ptr<std::pair<std::size_t, History> > > as the ultimate type stored by the class (though don't write it that way, use some typedefs).

…etting

…porary setting" This reverts commit 715c9f0.

This reverts commit 15ee836.

This reverts commit aa6320a.

jerinphilip · 2021-10-09T18:01:41Z

Have a single opaque Entry class. The Hash functor is overloaded to accept both Entry and Key. Similarly the Equal functor is overloaded to accept both Entry and Key making a comparison on that. This is what you were missing.

Nothing about this looks pleasant even after multiple re-reads. I genuinely want to know when this works for a solution - could you please provide a reference to better understand the motivations of such an interface?

As for the previous comment, I've done the following:

using Record = std::pair<Key, Value>;

Do I really need shared_ptr<AtomicCache::Record<size_t, shared_ptr<History> > ?

Because History is not TriviallyCopyable, std::pair<size_t, History> is also not TriviallyCopyable. Set that aside, std::pair is not TriviallyCopyable to begin with. Hence we need std::shared_ptr<std::pair<...>> is my understanding as of now.

…ching mechanism

jerinphilip · 2021-10-11T23:46:57Z

@kpu The bergamot-translator library part of cache is added now. I've changed base to a development branch, and to avoid major trouble at reviewing 1000+LoC, for review to be done incrementally. There will be broken/untested intermediates. Further changes will have CLI alterations (I can probably settle for a chaotic "union" config, at the cost of more headhurt) which will get me to test-apps (which contribute to a decent LoC). I request a review happen now so I can work on top of this to generate smaller diffs.

Do I really need shared_ptr<AtomicCache::Record<size_t, shared_ptr<History>> ?

This can be avoided by taking responsibility of mutex-buckets, and is done. (Key, Value) = (size_t(model, words), Ptr<History>). Since Ptr<History> is already allocated and cost incurred, an std::move into a History object only adds the value of hiding things (underlying objects are still Ptr Hypothesis linked list inside Beam). I expect WebAssembly to downgrade gracefully (it only complained at OS specific Semaphores in PCQueue). The atomic_load/share proposition internally had mutexes anyway.

kpu · 2021-10-11T23:54:50Z

src/translator/cache.h

+    std::lock_guard<std::mutex> lock(mutexBuckets_[mutexId]);
+    Record &candidate = records_[index];
+
+    if (!used_[index]) {


Do we actually care about these statistics enough to have an extra vector<bool>?

I have no means to tell if I hit cache or not placing reasonable asserts (at test-apps) without this. Are there alternatives I can use?

The current assert is put in a sentence, get translation. Put same sentence again. assert(cache hits the second time).

The used variable is gone, hit/miss statistics remain.

Also bugfixes.

…ributed

graemenail

Is there some art to arriving at the default values for the cache in the different apps using it. It is clear that the BlockingService should have just a single mutex, but the others?

src/translator/cache.h

src/translator/request.cpp

src/tests/units/cache_tests.cpp

src/translator/cache.h

src/tests/units/cache_tests.cpp

src/translator/cache.h

jerinphilip · 2021-10-26T09:29:50Z

Is there some art to arriving at the default values for the cache in the different apps using it.

My understanding is this is a coarse-to-fine-grained locking control knob. An upper bound could simply be the min(number of processors, workers) simultaneously accessing the shared memory thus providing enough partitions to avoid contention. At 1 bucket, there's max contention. At min(worker, cores) there's hopefully minimum contention.

We did a search for this for L4 (#202 (comment)). Could do if necessary, bit cumbersome to setup.

I see that this should be documented somewhere. Will try to see where best fits.

The base branch was changed.

XapaJIaMnu · 2021-11-01T08:53:25Z

src/translator/cache.h

+    const Record &candidate = records_[index];
+    if (equals_(key, candidate.first)) {
+      value = candidate.second;
+      stats_.hits += 1;


I am a bit late to this party, but shouldn't that be atomic?

There's a mutex above making this section atomic..?

Oh wait, you're right with regards to stats..

std::atomic<int>

Adding @kpu's atomic cache with a unit test

2779090

jerinphilip changed the title ~~Cache with std::atomic on shared-pointers~~ Cache with atomic on shared-pointers Oct 9, 2021

Jerin Philip added 2 commits October 9, 2021 13:43

Trying to shake shared_ptr off

aa6320a

History is not trivially copyable compiler error on CI

15ee836

Jerin Philip added 7 commits October 9, 2021 17:24

Remove history compile issues, advance with -latomic in a temporary s…

715c9f0

…etting

Revert "Remove history compile issues, advance with -latomic in a tem…

cee01a0

…porary setting" This reverts commit 715c9f0.

Revert "History is not trivially copyable compiler error on CI"

64ed124

This reverts commit 15ee836.

Revert "Trying to shake shared_ptr off"

75e1b70

This reverts commit aa6320a.

Trying to instantiate shared_ptr<size_t, History> cache

eb5ca7c

using Record = std::pair<Key, Value> to confirm with convention

5153606

Use query, key to avoid confusion

cf7e5fe

Jerin Philip added 6 commits October 9, 2021 18:03

query == value rename fix

903c86d

Take responsiblity for mutexes, shake shared_ptrs off

910da34

Add Stats, remove obsolete comments

8037433

Wiring TranslationCache in the marian::bergamot layer

6f0fd2a

Style matching rest of bergamot for cache

b0711fb

Further integration bergamot layer - modelId, hash(model, words), bat…

64652bf

…ching mechanism

jerinphilip changed the base branch from main to cache October 11, 2021 23:28

kpu reviewed Oct 11, 2021

View reviewed changes

Jerin Philip added 2 commits October 12, 2021 00:21

Add documentation for model

f9c2eb7

Unit tests: Place some asserts on stats

e53ec86

Also bugfixes.

Jerin Philip added 7 commits October 12, 2021 00:49

Indices randomly distributed, mod on lesser size_t also randomly dist…

b9f7d22

…ributed

A minimal integration test

bece8bc

Removing the used member and unused statistics

b02f1b7

[BRT]: Checking in tests, will fail due to random eviction

f49ae6c

Relax second time hits = first time misses condition

0924499

Fixes after testing cache with WebAssembly codepath

0468af1

Parsing, fix cache.size parsing

2bcd8af

This was referenced Oct 25, 2021

Caching translations implementation #202

Closed

Python bindings and a module #234

Closed

graemenail requested changes Oct 26, 2021

View reviewed changes

jerinphilip commented Oct 26, 2021

View reviewed changes

src/translator/cache.h Outdated Show resolved Hide resolved

Jerin Philip added 5 commits October 26, 2021 10:03

Making using Record = <...> private

fa9353a

Removing extra seed using modelId directly

0916754

Using formerly unused variable equals_ for equality checking

c6039dc

Removing commented code

35d38f3

Document at config parameters to explain what values to choose

56fd072

graemenail self-requested a review October 27, 2021 08:17

graemenail previously approved these changes Oct 27, 2021

View reviewed changes

jerinphilip changed the base branch from cache to main October 27, 2021 18:17

Merge branch 'main' into atomic-cache

2ea2d54

jerinphilip changed the title ~~Cache with atomic on shared-pointers~~ Cache for translations Oct 27, 2021

jerinphilip merged commit 2b98c67 into browsermt:main Oct 27, 2021

jerinphilip mentioned this pull request Oct 31, 2021

Caching translations #201

Closed

XapaJIaMnu reviewed Nov 1, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache for translations #227

Cache for translations #227

jerinphilip commented Oct 9, 2021

kpu commented Oct 9, 2021 •

edited

Loading

jerinphilip commented Oct 9, 2021

kpu commented Oct 9, 2021

jerinphilip commented Oct 9, 2021 •

edited

Loading

kpu commented Oct 9, 2021 •

edited

Loading

jerinphilip commented Oct 9, 2021 •

edited

Loading

kpu commented Oct 9, 2021 •

edited

Loading

jerinphilip commented Oct 9, 2021

jerinphilip commented Oct 11, 2021

kpu Oct 11, 2021

jerinphilip Oct 11, 2021

jerinphilip Oct 20, 2021

graemenail left a comment

jerinphilip commented Oct 26, 2021

XapaJIaMnu Nov 1, 2021

jerinphilip Nov 1, 2021

jerinphilip Nov 1, 2021

XapaJIaMnu Nov 1, 2021

Cache for translations #227

Cache for translations #227

Conversation

jerinphilip commented Oct 9, 2021

kpu commented Oct 9, 2021 • edited Loading

jerinphilip commented Oct 9, 2021

kpu commented Oct 9, 2021

jerinphilip commented Oct 9, 2021 • edited Loading

kpu commented Oct 9, 2021 • edited Loading

jerinphilip commented Oct 9, 2021 • edited Loading

kpu commented Oct 9, 2021 • edited Loading

jerinphilip commented Oct 9, 2021

jerinphilip commented Oct 11, 2021

kpu Oct 11, 2021

Choose a reason for hiding this comment

jerinphilip Oct 11, 2021

Choose a reason for hiding this comment

jerinphilip Oct 20, 2021

Choose a reason for hiding this comment

graemenail left a comment

Choose a reason for hiding this comment

jerinphilip commented Oct 26, 2021

XapaJIaMnu Nov 1, 2021

Choose a reason for hiding this comment

jerinphilip Nov 1, 2021

Choose a reason for hiding this comment

jerinphilip Nov 1, 2021

Choose a reason for hiding this comment

XapaJIaMnu Nov 1, 2021

Choose a reason for hiding this comment

kpu commented Oct 9, 2021 •

edited

Loading

jerinphilip commented Oct 9, 2021 •

edited

Loading

kpu commented Oct 9, 2021 •

edited

Loading

jerinphilip commented Oct 9, 2021 •

edited

Loading

kpu commented Oct 9, 2021 •

edited

Loading