You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org
Analytics/Systems/Presto/Administration
Roll restart the Presto cluster
There is a Spicerack cookbook for it!
Check metrics about Presto
The first thing to check is the Grafana dashboard: https://grafana.wikimedia.org/d/pMd25ruZz/presto
Presto is currently deployed in the following hosts:
- an-coord1001 - Presto Server (acting as coordinator)
- an-presto100x - Presto Server (acting as worker)
All the queries are managed by the Presto Coordinator, that uses the workers to fetch data from HDFS when needed.
Who/What uses Presto?
Presto is a new service that the Analytics team is experimenting with to provide a faster and better read-access to HDFS data compared to other systems like Hive. It is used mostly by Analysts/Researchers via CLI or via Superset.
Presto server down
There is a nagios check to control that the Presto server daemon is up and running on every Presto node. As described above, if one of the daemons is reported down please check metrics and contact the Analytics team.