Skip to content

An alternative to Riak Counters with idempotent writes within a client defined window.

License

Notifications You must be signed in to change notification settings

drewkerrigan/riak-ruby-ledger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Riak-Ruby-Ledger

Gem Version

An alternative to Riak Counters with idempotent writes within a client defined window.

Summary

Quick Links

Below are a few documents that are relevant to this gem, please read before considering using this gem for anything important.

Riak Ruby Ledger Docs
Document Link Description
[docs/riak_counter_drift.md] Why Riak Counters may or may not work for your use case (Counter Drift).
[docs/implementation.md] Implementation details about this gem as well as some of the reasoning behind the approach.
[docs/usage.md] Suggested usage of this gem from your application, and implications of changing various settings.
[docs/release_notes.md] Information about what changed in each version

Counter Drift

Why shouldn't I use Riak Counters?

CRDT PNCounters (two plain GCounters) such as Riak Counters are non-idempotent and store nothing about a counter request_id other than the final value. This means that if an increment operation fails in any number of ways (500 response from server, process that made the call dies, network connection is interrupted, operation times out, etc), your application now has no idea whether or not the increment actually happened.

What is Counter Drift?

In the above situation of a failed increment operation, your application has two choices:

  1. Retry the operation: This could result in the operation occuring twice causing what is called positive counter drift
  2. Don't retry the operation: This could result in the operation never occuring at all causing negative counter drift

As such it doesn't make sense to use plain GCounters or PNCounters to store any counter that needs to be accurate.

More information about Riak Counters and Drift: [docs/riak_counter_drift.md]

Implementation

The data type implemented is a PNCounter CRDT with an ordered array of request_ids for each GCounter actor. Request ids are stored with the GCounter, so operations against this counter are idempotent while the request_id remains in any actor's array.

High Level API

Function Description
Riak::Ledger.new Creates a new Ledger instance
Riak::Ledger.find! Finds an existing Ledger in Riak, merges it locally, and then writes the merged value back to Riak
#credit!, #debit!, #update! Reads the existing state of the ledger from Riak, merges it locally, and adds a new request_id and positive or negative value

Ledger Options

Name Description
:retry_count[Integer] When a write to Riak is a "maybe" (500, timeout, or any other error condition), resubmit the request :retry_count number of times, and return false if it is still unsuccessful
:history_length[Integer] Keep up to :history_length number of request_ids in each actor's section of the underlying GCounter. When the (:history_length + 1)th request_id is written then merged, add the oldest request_id's value to the actor's total

More information about the implementation and how edge cases can be avoided: [docs/implementation.md]

Suggested Usage and Configuration

Depending on your use case, you may want to tweak the configuration options :history_length and :retry_count.

The default :history_length is 10. This means that if a request_id fails, but your application is unable to determine whether or not the counter was actually incremented, you have a buffer space or window of 9 additional request_ids on that counter before you can no longer retry the original failed request_id without assuming counter drift is happening.

The default :retry_count is also 10. This means that if a request_id fails, the actor that attempted the request_id will continue trying 9 more times. If the request to change the counter still fails after the 10th try, the operation will return false for failure. At this point your application can attempt to try the request_id again, or return a failure to the user with a note that the request_id will be retried in the future.

An example of a failure might look like the following:

  1. request_id1 fails with actor1, and because of the nature of the failure, your application is unsure whether or not the counter was actually incremented.

    1. If your :retry_count is low, you can quickly determine in your application that something went wrong, and inform the user that the request_id was unsuccessful for now, but will be attempted later
    2. If your :retry_count is high, the user will be kept waiting longer, but the odds of the request_id eventually working are higher
  2. If after the initial retries, the request_id was still a failure, your application must decide what to do next

    1. If your :history_length is low, your options are limited. You must continue to retry that same failed request_id for that user (using any available actor) until it is successful. If you allow additional request_ids to take place on the same counter before retrying, you run a high risk of counter drift.
    2. If your :history_length is medium-high, then you have an allowance of (:history_length - 1) additional request_ids for that counter before you run the risk of counter drift.

Note

This gem cannot guarentee request_id idempotence of a counter for greater than :history_length number of request_ids.

More information about configuration and implications of changing various settings: [docs/usage.md]

Additional Reading

Document Link Description
[http://hal.upmc.fr/docs/00/55/55/88/PDF/techreport.pdf] CRDT paper from Shapiro et al. at INRIA
[http://basho.com/counters-in-riak-1-4/] Riak Counters
[github.com/basho/riak_dt] Other Riak Data Types

Installation

Add this line to your application's Gemfile:

gem 'riak-ruby-ledger'

And then execute:

$ bundle

Or install it yourself as:

$ gem install riak-ruby-ledger

Usage

Initialize

require 'riak' # riak-client gem
require 'ledger' # riak-ruby-ledger gem

# Name each of your threads
Thread.current["name"] = "ACTOR1"

# Create a Riak::Client instance
client = Riak::Client.new pb_port: 8087

# Get the Riak::Bucket instance
bucket = client["ledgers"]

# Set allow_mult to true
bucket.allow_mult = true unless bucket.allow_mult

# Default option values
options = {
	:actor => Thread.current["name"], # Actor ID, one per thread or serialized writer
	:history_length => 10, # Number of request_ids to store per actor per type (credit or debit)
	:retry_count => 10 # Number of times to retry Riak requests if they fail
}

# Create the ledger object
ledger = Riak::Ledger.new(bucket, "player_1", options)

Credit and debit

With a request id:

ledger.credit!(50, "request_id1")
ledger.value # 50
ledger.debit!(10, "request_id2")
ledger.value # 40

ledger.debit!(10, "request_id2")
ledger.value # 40

Without a request id:

If no request id is given, a UUID will be generated. If a credit! or debit! call fails, the generated request id will be provided for retries

ledger.credit!(50)
ledger.value # 50
ledger.debit!(10)
ledger.value # 40

ledger.debit!(10)
ledger.value # 40

Finding an exisitng Ledger

ledger = Riak::Ledger.find!(client["ledgers"], "player_1", options)
ledger.value # 40

Request success

If a call to #debit! or #credit! does not return false, then the request_id can be considered saved, because it would have retried otherwise. Still, for debugging, testing, or external failure policies, #has_request_id? is also exposed

ledger.has_request_id? "request_id2" # true
ledger.has_request_id? "request_id1" # true

Merging after history_length is reached

For this example, the :history_length is lowered to 3 so it gets reached faster.

options = {:history_length => 3}
ledger = Riak::Ledger.new(client["ledgers"], "player_2", options)

ledger.credit!(10, "req1")
ledger.credit!(10, "req2")
ledger.credit!(10, "req3")
ledger.credit!(10, "req4")
ledger.credit!(10, "req5")

ledger.value #60

ledger.has_request_id? "req1" #false
ledger.has_request_id? "req2" #false
ledger.has_request_id? "req3" #true

# req3 is still in the history because the most previous write does not trigger a merge of the actor's total
# Performing a find! will trigger the merge however
ledger = Riak::Ledger.find!(client["ledgers"], "player_2", options)

ledger.has_request_id? "req3" #false
ledger.has_request_id? "req4" #true
ledger.has_request_id? "req5" #true
ledger.has_request_id? "req6" #true

Deleting a ledger

ledger.delete()

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

About

An alternative to Riak Counters with idempotent writes within a client defined window.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages