You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Analytics/Systems/Cluster/Gobblin: Revision history

Jump to navigation Jump to search

Diff selection: Mark the radio buttons of the revisions to compare and hit enter or the button at the bottom.
Legend: (cur) = difference with latest revision, (prev) = difference with preceding revision, m = minor edit.

17 May 2022

22 March 2022

26 October 2021

1 October 2021

3 August 2021

29 July 2021

  • curprev 15:0715:07, 29 July 2021imported>Ottomata 1,373 bytes +1,373 Created page with "[https://gobblin.apache.org/ Apache Gobblin] is Hadoop ingestion software used at WMF primarily to import data from Kafka into HDFS. == Gobblin jobs == Gobblin jobs are [https://github.com/wikimedia/puppet/blob/production/modules/profile/manifests/analytics/refinery/job/gobblin.pp declared in puppet]. == WMF's Gobblin fork == The Data Engineering team maintains a [https://gerrit.wikimedia.org/g/analytics/gobblin fork of Gobblin]. We use this fork to maintain our own [..."