You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Revision history of "Analytics/Systems/Cluster/Gobblin"

Jump to navigation Jump to search

Diff selection: Mark the radio boxes of the revisions to compare and hit enter or the button at the bottom.
Legend: (cur) = difference with latest revision, (prev) = difference with preceding revision, m = minor edit.

  • curprev 22:21, 3 August 2021imported>Neil P. Quinn-WMF 1,554 bytes +181 Add some historical information
  • curprev 15:07, 29 July 2021imported>Ottomata 1,373 bytes +1,373 Created page with "[https://gobblin.apache.org/ Apache Gobblin] is Hadoop ingestion software used at WMF primarily to import data from Kafka into HDFS. == Gobblin jobs == Gobblin jobs are [https://github.com/wikimedia/puppet/blob/production/modules/profile/manifests/analytics/refinery/job/gobblin.pp declared in puppet]. == WMF's Gobblin fork == The Data Engineering team maintains a [https://gerrit.wikimedia.org/g/analytics/gobblin fork of Gobblin]. We use this fork to maintain our own [..."