You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org
Discovery/Analytics/Glent: Difference between revisions
imported>Ebernhardson |
imported>Ebernhardson |
||
(One intermediate revision by one other user not shown) | |||
Line 20: | Line 20: | ||
All deployments of glent code must be accompanied by a release to make the jars available to production services. Releases are handled by the <code>maven-release-plugin</code> and can be run in a two stage process: | All deployments of glent code must be accompanied by a release to make the jars available to production services. Releases are handled by the <code>maven-release-plugin</code> and can be run in a two stage process: | ||
#<code>./mvnw | #<code>./mvnw release:prepare</code> in the source repository which updates the version numbers. If your system username is different that the one in gerrit, use <code>-Dusername=...</code> option. | ||
#<code>./mvnw | #<code>./mvnw release:perform</code> in the source repository - this deploys the artifacts to archiva. | ||
Note that for the above you will need archiva deployment credentials in your [[labsconsole:Archiva#Deploy_to_Archiva|~/.m2/settings.xml file]] for the <code>archiva.releases</code> and <code>archiva.snapshots</code> repositories. | Note that for the above you will need archiva deployment credentials in your [[labsconsole:Archiva#Deploy_to_Archiva|~/.m2/settings.xml file]] for the <code>archiva.releases</code> and <code>archiva.snapshots</code> repositories. | ||
If there is some problem with preparing the release the local repository can be reset with: | |||
<code>./mvnw release:rollback</code> | |||
== Deployment == | == Deployment == | ||
The analytics portion of glent is run by an airflow dag in the gerrit project <code>wikimedia/discovery/analytics</code>. Once a new version of glent has been released you will need to [[Discovery/Analytics#Updating_java_jars|update the deployed jars]] | The analytics portion of glent is run by an airflow dag in the gerrit project <code>wikimedia/discovery/analytics</code>. Once a new version of glent has been released you will need to [[Discovery/Analytics#Updating_java_jars|update the deployed jars]]. | ||
Once the jars are updated, [[Discovery/Analytics#How to deploy|deploy the jars to Airflow production]]. |
Latest revision as of 17:51, 28 September 2020
Glent is a search platform project that generates query suggestions based on search logs in a batch process. The suggestions are regenerated once a week and shipped to the production cirrussearch clusters to be presented to users when their search query is similar to a query glent has suggestions for. This documention is limited to the analytics portion of the glent suggestions pipeline. Querying the glent data and presenting suggestions to users is implemented in the CirrusSearch MediaWiki extension.
Development Environment
You will need java and maven for the analytics portion of glent.
Code
The source code is in gerrit project search/glent
. To start working with glent clone the repository:
git clone https://gerrit.wikimedia.org/r/search/glent
Build
You can build the distribution package by running:
./mvnw package
and the package will be in the target/
directory.
Release
All deployments of glent code must be accompanied by a release to make the jars available to production services. Releases are handled by the maven-release-plugin
and can be run in a two stage process:
./mvnw release:prepare
in the source repository which updates the version numbers. If your system username is different that the one in gerrit, use-Dusername=...
option../mvnw release:perform
in the source repository - this deploys the artifacts to archiva.
Note that for the above you will need archiva deployment credentials in your ~/.m2/settings.xml file for the archiva.releases
and archiva.snapshots
repositories.
If there is some problem with preparing the release the local repository can be reset with:
./mvnw release:rollback
Deployment
The analytics portion of glent is run by an airflow dag in the gerrit project wikimedia/discovery/analytics
. Once a new version of glent has been released you will need to update the deployed jars.
Once the jars are updated, deploy the jars to Airflow production.