Contributing

Contributors may familiarize themselves with Celery itself by going through the First Steps with Celery tutorial.

Development model

GWCelery operates on a fork-and-merge development model (see GitLab basics for an introduction).

To contribute to GWCelery development, follow these steps:

  1. Create a personal fork of GWCelery.

  2. Make your changes on a branch.

  3. Open a merge request.

Note that GWCelery uses fast-forward merges.

Where new code should go

New code will generally consist of adding Celery tasks. Tasks are organized by functionality into submodules of gwcelery.tasks. If your new task does not match with one of the existing submodules, please create a new submodule.

Guidelines for tasks

  • Tasks should be short. When deciding where a new task should go, start from the following loose rules of thumb:

    1. If it’s less than a screenful of code, and related to functionality in an existing module, then put the code in a new task in that module.

    2. If it’s up to a few screenfuls of code, or not related to functionality in an existing module, then try to break it into a few smaller functions or tasks and put it in a new module.

    3. If it’s more than a few screenfuls of code, or adds many additional dependencies, then it should go in a separate package.

    See also the note on Granularity in the Celery manual’s Tips and Best Practices section.

  • Tasks should avoid saving files to disk. Output should be placed directly in GraceDB. Temporary files that are written in /tmp are OK but should be cleaned up promptly.

    See also the Celery manual’s notes on Data locality and State.

  • Dependencies should be installable by pip. Dependencies of tasks should be listed in the requirements.txt file so that they are installed automatically when GWCelery is installed with pip.

    There are two extra steps involved in making changes to the dependencies:

    1. The Sphinx-generated documentation (that is to say, this manual) is generally built without most of the dependencies installed. Whenever you add a new package to requirements.txt, you should also add any modules that are imported from that package to the autodoc_mock_imports list in the Sphinx configuration file, doc/conf.py.

    2. We use poetry to make the precise versions of packages reproducible in our deployment. If you make changes to requirements.txt, then run poetry update and commit the changes to poetry.lock.

Unit tests

Unit tests and code coverage measurement are run automatically for every branch and for every merge request. New code contributions must have 100% test coverage. Modifications to existing code must not decrease test coverage. To run the unit tests and measure code coverage, run the following commands in the top directory of your local source checkout:

$ poetry install --extras=test
$ poetry shell
$ pytest --cov --cov-report html

This will save a coverage report that you can view in a web browser as htmlcov/index.html.

Eager mode

Most of GWCelery’s unit tests use eager mode, which causes all tasks to execute immediately and synchronously, even if they are invoked via apply_async() or delay(). This simplifies writing unit tests, but sacrifices realism: it may mask concurrency bugs that may only occur when the tasks are executed asynchronously.

It is preferable to write unit tests that use a live worker so that they are subject to realistic, asynchronous task execution. To opt in to using a live worker, simply decorate your test with the live_worker marker, like this:

@pytest.mark.live_worker
def test_some_task():
    async_result = some_task.delay()
    result = async_result.get()
    assert result == 'foobar'
    # etc.

Code style

Code should be written in the PEP 8 style and must pass linting by Flake8. To check code style, run the following commands in the top of your source directory:

$ pip install flake8 pep8-naming
$ flake8 --show-source .

Documentation

Documentation strings should be written in the Numpydoc style.

To build the documentation, first, install the extra test dependencies in the Poetry-managed virtual environment by running this command:

$ poetry install --extras=doc

Then, run these commands to build the docs:

$ poetry shell
$ make -C doc html

Finally, open the file doc/_build/html/index.html in your favorite web browser.