Skip to content

ActiveColumn is a data management framework for Cassandra. It includes data migrations similar to ActiveRecord, and a data mapping framework for "time line" modeled data.

Notifications You must be signed in to change notification settings

FewKinG/active_column

 
 

Repository files navigation

IMPORTANT: If you are reading this on the main ActiveColumn page on github, please go to the actual README page so that links bring you to the right place.

ActiveColumn

ActiveColumn is a framework for working with data in Cassandra. It currently includes two features:

  • Database migrations
  • "Time line" model data management

Data migrations are very similar to those in ActiveRecord, and are documented in Migrate.

Time line data management is loosely based on concepts in ActiveRecord, but is adapted to saving data in which rows in Cassandra grow indefinitely over time, such as in the oft-used Twitter example for Cassandra. This usage is documented in:

Installation

Add ActiveColumn to your Gemfile:

gem 'active_column'

Install with bundler:

bundle install

Usage

Configuration

ActiveColumn requires Cassandra 0.7 or above, as well as the cassandra gem, version 0.9 or above. You must also be sure to use the Cassandra 0.7 support in the gem, which can be done by adding Cassandra to your Gemfile like this:

gem 'cassandra', '>= 0.9', :require => 'cassandra/0.7'

Data migrations in ActiveColumn are used within a Rails project, and are driven off of a configuration file, config/cassandra.yml. It should look something like this:

config/cassandra.yml

test:
  servers: "127.0.0.1:9160"
  keyspace: "myapp_test"
  thrift:
    timeout: 3
    retries: 2

development:
  servers: "127.0.0.1:9160"
  keyspace: "myapp_development"
  thrift:
    timeout: 3
    retries: 2

You can use embedded ruby code in the YAML file to determine host/machine specific settings.

  production:
    servers: "<%=get_from_file('abc.conf')%>:9160"
    keyspace: "<%=get_from_file('abc.conf')%>"
    disable_node_auto_discovery: true
    thrift:
      timeout: 3
      retries: 2

Node Auto Discovery

You can set disable_node_auto_discovery to off by setting disable_node_auto_discovery flag in your cassandra.yml

In order to get time line modeling support, you must provide ActiveColumn with an instance of a Cassandra object. Since you have your cassandra.yml from above, you can do this very simply like this:

config/initializers/cassandra.rb

config = YAML.load_file(Rails.root.join("config", "cassandra.yml"))[Rails.env]
$cassandra = Cassandra.new(config['keyspace'],
                           config['servers'],
                           config['thrift'])

ActiveColumn.connection = $cassandra

As you can see, I create a global $cassandra variable, which I use in my tests to validate data directly in Cassandra.

Examples

Add column family

  create_column_family :impressions do |cf|
    cf.comment = 'impressions for something'
    cf.comparator_type = :utf8 
    cf.key_validation_class = :utf8 
  end

Drop column family

  drop_column_family :impressions

Rename column family

  rename_column_family :impressions, :showings

Update column family

  update_column_family :impressions do |cf|
    cf.comment = "blah"
    cf.gc_grace_seconds = 3600
  end

One other thing to note is that you obviously must have Cassandra installed and running! Please take a look at the mama_cass gem for a quick way to get up and running with Cassandra for development and testing.

About

ActiveColumn is a data management framework for Cassandra. It includes data migrations similar to ActiveRecord, and a data mapping framework for "time line" modeled data.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Ruby 100.0%