You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Data Catalog Application Evaluation Rubric/DataHub

From Wikitech-static
< Data Catalog Application Evaluation Rubric
Revision as of 08:47, 26 January 2022 by imported>Razzi
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

If you look closely at the datahub dotted box in the middle, you can make out the requirements

- relational database: (mysql / mariadb / postgres)

- graph database: neo4j

- search index: elasticsearch

- ingestion: kafka

- frontend: seems like standard npm-installable react

More notes on requirements:

- java 8 only (java 11 support https://github.com/linkedin/datahub/issues/1699)

- requires confluent's schema registry for kafka (citation needed)

- there is no documented way to run datahub outside of docker... though building locally as though one is developing datahub seems to use local gradle