You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Difference between revisions of "User:Razzi"

From Wikitech-static
Jump to navigation Jump to search
imported>Razzi
imported>Razzi
Line 138: Line 138:


https://grafana.wikimedia.org/d/000000258/analytics-hadoop?orgId=1
https://grafana.wikimedia.org/d/000000258/analytics-hadoop?orgId=1
DONE

Revision as of 20:17, 22 June 2021


Learning the Wikimedia stack!

<InputBox> type=create placeholder=Article name prefix=User:Razzi/ buttonlabel=Create user article </InputBox>
<inputbox> type=create prefix=User:Razzi/ default=2021-12-08 buttonlabel=Create article for day </inputbox>
<inputbox> type=commenttitle page=User:Razzi buttonlabel=New section on this page </inputbox>

Documentation

User account "Razzi" is not registered.

No changes were found matching these criteria.

Lists (https://gtdfh.liw.fi/quickie-overview/)

Questions

How does refine use salts? https://gerrit.wikimedia.org/r/c/operations/puppet/+/679939

Is /system a default directory for hadoop, or can we remove it?

Is there a place that lists the vlans?

How to check vlan for a host?

Q: Is it expected that when reimaging a host, we see the old name when running homer?

[edit interfaces interface-range disabled]
-    member ge-1/0/13;
[edit interfaces interface-range vlan-analytics1-d-eqiad]
+    member ge-1/0/13;
     member ge-1/0/43 { ... }
[edit interfaces]
+   ge-1/0/13 {
+       description "db1125 {#2221}";
+   }

^ this is while decommissioning db1125

A: No, I skipped some netbox steps; when I fixed them this didn't show up

Q: How to submit a test job to the yarn queue to test if it is accepting jobs?

Q: What to do about this warning on analytics1068?

May 06 21:03:35 analytics1068 systemd[1]: /run/systemd/generator.late/hadoop-yarn-nodemanager.service:18: PIDFile= late/hadoop-yarn-nodemanager.service:18: PIDFile= references path below legacy directory /var/run/, updating /var/run/hadoop-yarn/yarn-yarn-nodemanager.pid → /run/hadoop-yarn/yarn-yarn-nodemanager.pid → /run/hadoop-yarn/yarn-yarn-nodemanager.pid; please update the unit file accordingly.

Q: Server Lifecycle#Rename while reimaging when to merge homer patch?

A: homer patch is for firewall, not having to do with the reimaging process. Merge after reimage complete

Q: What is the order for creating puppet patches when it comes to server lifecycle? Some things that might need to be avoided: having site.pp for node that is being decommissioned, having site.pp for node that doesn't exist yet

Ideas

Script to show what tickets are currently in progress

Add homer-public to codesearch

Remove legacy analytics-hadoop from grafana

Random notes

sudo lsof -Xd DEL - lists the files that have been deleted but are still held open by a running process

Puppet

https://www.digitalocean.com/community/tutorials/getting-started-with-puppet-code-manifests-and-modules

Why does sshing into mgmt not accept the password?

Because you forgot the `root@` part!

Instead of ssh dbstore1007.mgmt.e

do `ssh root@dbstore1007.mgmt.e`

Or make ssh use the root user in your ~/.ssh/config: https://stackoverflow.com/questions/10197559/ssh-configuration-override-the-default-username

refactor this to run automatically

https://wikitech.wikimedia.org/wiki/Analytics/Systems/AQS#Deploy_new_History_snapshot_for_Wikistats_Backend

Why no homer diff?

TBD

how to check what vlan a host belongs to?

???

Proposal: stop using conda for infrastructure

Why not use standard pip?

How to apply hadoop config changes?

For example https://gerrit.wikimedia.org/r/c/operations/puppet/+/698194/1/hieradata/common.yaml

linux-host-entries.ttyS0-115200 versus linux-host-entries.ttyS1-115200

a mystery

sudo gnt-instance console an-airflow1002.eqiad.wmnet is stuck, is this normal?

Gotta stop and start, the old reboot trick

sudo gnt-instance stop an-airflow1003.eqiad.wmnet

how to restart services on hadoop coordinator?

for https://phabricator.wikimedia.org/T283067

Want to restart services for an-test-coord1001 and an-coord*

But how to do this safely?

Set boot order to disk - "upstream is aware" - any issue to track?

Ganeti#Create a VM

Can we delete the hadoop-analytics grafana section now?

https://grafana.wikimedia.org/d/000000258/analytics-hadoop?orgId=1

DONE