You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Analytics/Data Lake/Events

From Wikitech-static
< Analytics‎ | Data Lake
Revision as of 21:42, 21 September 2021 by imported>Shay Nowick (Accessing Event Data #raddocs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Accessing Event Data

As of September 2020, you have a choice of three engines that can run SQL queries against the Data Lake: Presto, Hive, and Spark. If you're not sure which to choose, Hive is good to start with. All three engines can be used from the Analytics clients.

  • Event vs event_sanitizied*