Primary data repository

Aus MelaTAMP

Wechseln zu: Navigation, Suche

Primary data repository

The primary corpus data for our project is held and versioned in a git repository, the remote of which is hosted on the GitLab instance of Humboldt-Universität.

The URL is https://scm.cms.hu-berlin.de/gitlab/druskats/melatamp-data/. The repository itself is private and currently only accessible by members of the project team.

The repository’s branch structure is as follows.

  • master holds only releases of our corpus data. This branch must never be worked upon directly. Changes to it must onl come in the form of pull requests from other branches (i.e., development).
  • development holds working copies of the data. This is the branch that shuold be worked on directly, and changes in the data itself (e.g., annotations) should be pushed to this branch only.
  • annis holds the results of conversion to the ANNIS format via Pepper, and related files.