Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(DATASET) Revenue and water production records from seven rural piped water service providers, 2016-2020 #24

Open
uptimeandrew opened this issue Nov 6, 2023 · 1 comment

Comments

@uptimeandrew
Copy link

This dataset has been compiled as part of a PhD project from secondary, longitudinal records maintained by seven agencies that operate piped water services in rural areas of ten countries. Three of the agencies are international nongovernmental organisations that operate as social enterprises. Three agencies are private companies that offer a range of engineering, construction, and management services. One agency is a public utility. Operational data have been extracted, with permission, from proprietary online and offline electronic databases maintained by the participating agencies. Deidentified data have been transferred to a master database where additional transformations and aggregations have been performed. The master database contains separate databases for water service areas, geographic regions, financials, and service levels which are linked with a record identification number corresponding to unique piped water service areas. Transformed data are documented with comments that explain relationships between variables. The full dataset covers roughly 5,500 waterpoints and represents services provided to more than half a million people spanning the years 2016 to 2020.

In general, the dataset consists of the following:

  • Service area characteristics (geolocations and jurisdictions, planning and implementation approach, installation dates)
  • Financials (monthly revenues, tariff rates, payment modalities)
  • Service levels (monthly waterpoint types and counts, volumes, scheme configuration details)

Furthermore, the data adhere to the following criteria:

  1. Services cover rural or a mixture of rural and peri-urban areas
  2. Infrastructure consists of small to medium-sized piped schemes covering one or multiple villages with metered on and off premises connections
  3. Users make regular financial contributions
  4. At least 12 concurrent monthly revenue and water volume records are available

Since the assembled dataset holds long-term value to the public, the master database containing anonymised data has been preserved under controlled access in the Oxford University Research Archive at https://ora.ox.ac.uk/objects/uuid:85bb166a-1065-4c4c-867d-5823f11831c9.

@larnsce
Copy link
Contributor

larnsce commented Nov 8, 2023

Thank you @uptimeandrew for sharing this with us. The dataset sounds very valuable and I had a look at and can see that it also good documentation.

We will discuss in the team how to move forward with a publication process using our R data package workflow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants