SQL based databases that scale horizontally.
See hive.md
See impala.md
See apache-drill.md
- open source
- distributed SQL database
- built on Lucene + Elasticsearch (as a library for cluster state, node discovery/mgmt, sharding + replication)
- masterless
- shared nothing
- good for containers
- dynamic schemas
- uses Presto's optimizer
- no ACID or relational
- full-text search via Lucene
- faster properly distributed aggregations than Elasticsearch
- JOINs unlike Cassandra