Contributing¶

Stargazer has multiple contributor shapes — researchers writing notebooks, agent users driving the MCP server, code authors adding tasks, image maintainers publishing releases. Notebook authoring, snapshot freezing, and the first step of task promotion all happen in the hosted notebook (see Notebooks → User Archetypes); this guide covers working on the source itself — the SDK, the app tier, and the images — natively against the repo.

Setup¶

git clone https://github.com/StargazerBio/stargazer.git
cd stargazer
mamba install -y -c bioconda -c conda-forge bwa bwa-mem2 samtools gatk4
uv sync --group dev

You now have:

The stargazer package installed in editable mode in a project venv
All Python deps from the lockfile, plus the dev group (pytest, ruff, pre-commit)
Bioconda CLIs on PATH (only needed if you'll run gatk_env / general tasks locally)

If you don't have mamba/conda on your host, install miniforge first. The bioconda step is skippable if you only intend to work on scrna tasks (pure Python) or the MCP server.

Running on the Devbox¶

Use flyte start devbox to spin up a local Flyte cluster for development. See the official devbox docs for setup instructions.

This environment is much closer to production and lets you actually test your task and app environments.

Running Tests¶

pytest tests/

Tests run with no PINATA_JWT. The harness points LocalStorageClient at a temporary directory and never mutates client internals. Tests that need Pinata behavior mock the API or skip.

pytest tests/unit/
pytest tests/integration/

Code Style¶

ruff --fix .

Pre-commit enforces ruff formatting and docstr-coverage (100% module-level docstrings required).

Building Images¶

Image rebuilds are only needed when you change config.py (a Flyte task tool/env) or the Dockerfile (a system tool in the human-runnable note/chat images). Routine code work doesn't need this — uv add covers Python deps via the lockfile, and contributors pulling your branch pick the change up automatically on their next uv sync. See Configuration → Container Images for the split between Flyte task images and human-runnable images.

All builds below stay local — nothing is pushed to a registry, so you don't need docker login or write access to ghcr.io/stargazerbio. CI will handle publishing on merge to main.

Flyte task images (stargazer-scrna, stargazer-gatk):

stargazer-build-images   # builds scrna and gatk into the local docker cache

image.builder: local in .flyte/config.yaml is the default (needs a working Docker daemon). The Flyte images have no registry= set in config.py, so the docker builder uses --load and the results land in docker images rather than being pushed. For Union backends you can flip to image.builder: remote and the build runs on the cluster instead.

Human-runnable images (stargazer-note, stargazer-chat):

docker build --target note -t ghcr.io/stargazerbio/stargazer-note:latest .
docker build --target chat -t ghcr.io/stargazerbio/stargazer-chat:latest .

Tag both with the published ghcr.io/stargazerbio/... URL even though you're not pushing — docker run resolves them from the local cache by that name. (The hosted notebook pods use a different image, notebook-app, built by the admin deploy entrypoint — see App → Images.) The shared base stage (bioconda CLIs + uv + project venv) is reused between targets, so the second docker build is mostly cache hits.