With this extension enabled, the manage view for organizations is provided with the additional tab data.world. Within the data.world tab organization admins can specify syncronization options that will apply for that organization.
CKAN version 2.4 or greater (including 2.7).
All versions support celery backend, but version 2.7 will use RQ. There are no changes required to use new backend - just start it using:
paster --plugin=ckan jobs worker -c /config.ini
instead of:
paster --plugin=ckan celeryd run -c /config.ini
Details at http://docs.ckan.org/en/latest/maintaining/background-tasks.html
To install ckanext-datadotworld:
Activate your CKAN virtual environment, for example:
. /usr/lib/ckan/default/bin/activate
If you already have an older version of this extension, remove it first:
pip uninstall -y ckanext-datadotworld
Install the ckanext-datadotworld Python package into your virtual environment:
pip install git+https://github.com/datadotworld/ckanext-datadotworld
Add
datadotworld
to theckan.plugins
setting in your CKAN config file (by default the config file is located at/etc/ckan/default/production.ini
).Create DB tables:
paster --plugin=ckanext-datadotworld datadotworld init -c /config.ini paster --plugin=ckanext-datadotworld datadotworld upgrade -c /config.ini
Start celery daemon either with suprevisor or using paster:
paster --plugin=ckan celeryd run -c /config.ini
Attempts to push failed datasets can be scheduled by adding the following line to cron:
* 8 * * * paster --plugin=ckanext-datadotworld datadotworld push_failed -c /config.ini
A similar solution enables syncronization with remote (i.e. not uploaded) resources with data.world:
* 8 * * * paster --plugin=ckanext-datadotworld datadotworld sync_resources -c /config.ini
Delay option
There is a 1 second delay configured by default. This delay period can be controlled by modifying the "ckan.datadotworld.request_delay" configuration variable within the CKAN ini file.
For example:
ckan.datadotworld.request_delay = 1
To ensure that the delay will work correctly, you also need to configure Celery to work in single thread mode. To do this, add the following flag to the Celery start command:
--concurrency=1
Details at http://celery.readthedocs.io/en/latest/userguide/workers.html#concurrency.
In order to add data.world banner on dataset page(currently it seats at the top of package_resources block) you may add next snippet to template with datadotworld_extras variable that contains object(model) with currently viewed package's datadotworld extras and org_id - owner organization of viewed packaged:
{% snippet 'snippets/datadotworld/banner.html', org_id=pkg.owner_org, datadotworld_extras=c.pkg.datadotworld_extras %}
Sidebar label may be added by placing next snippet to your template(org_id is ID of viewed organization):
{% snippet 'snippets/datadotworld/label.html', org_id=organization.id %}
To install ckanext-datadotworld for development, activate your CKAN virtualenv and do the following:
git clone https://github.com/datadotworld/ckanext-datadotworld.git cd ckanext-datadotworld python setup.py develop paster datadotworld init -c /config.ini
Make sure you follow the CKAN testing guide (http://docs.ckan.org/en/latest/contributing/test.html). To run the tests, do the following:
nosetests --ckan --nologcapture --with-pylons=test.ini
To run the tests and produce a coverage report, first make sure you have coverage installed in your virtualenv (pip install coverage
) then run:
nosetests --ckan --nologcapture --with-pylons=test.ini --with-coverage --cover-package=ckanext.datadotworld --cover-inclusive --cover-erase --cover-tests