You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
Obsolete:2015 analytics datastore evaluation
This page will help us decide on a centralized data store to handle analytics at WMF.
- Spark SQL
- this is not a datastore? Hm, I guess Hive isn't really eitiher?
- Hive on Spark
- Not yet released.
- Hive on Tez
- OpenTSDB (HBase)
- Apache Drill
- This is like Impala, but Impala would probably be easier (and maybe better)
- Mondrian + PostgreSQL
- Crate (Elastic Search)
This page is helpful: http://blog.matthewrathbone.com/2014/06/08/sql-engines-for-hadoop.html