Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
@openproblems-bio

Open Problems in Single-Cell Analysis

Open Problems in Single‑Cell Analysis

Define and benchmark challenges in single-cell data science via open standards and continuous leaderboards to promote method development and guide users in method selection. This organization hosts the code, tasks, datasets, and docs behind the Open Problems platform.


What lives in this org

  • Core platform - openproblems: the living, extensible, community‑guided benchmarking framework.
  • Common datasets - datasets: workflows for managing and processing common datasets.
  • Benchmarks as repositories: task‑specific repos named task_* (e.g., task_batch_integration, task_label_projection, task_spatially_variable_genes, etc.).
  • Task template - task_template: scaffolding to start a new benchmark with the correct structure.
  • Shared libraries & images - core: helper R/Python packages and base Docker images used across tasks.

How the platform works (in short)

More detailed documentation below.

  • Tasks define an API, reference datasets, and quantitative metrics.
  • Methods implement that task API.
  • Continuous evaluation runs standardized workflows to score methods and update leaderboards.
  • Reproducibility is enforced via containers and declarative workflows.

Tech stack highlights:

  • Nextflow for portable workflows; Viash for modular components; AnnData for standardized I/O; GitHub Actions for CI; deployment via Nextflow Tower with cloud backends (e.g., AWS Batch/S3). See the Technology stack for details.

Quick start

  1. Explore current leaderboards Check out live tasks and results on the Benchmarks page.

  2. Run a benchmark locally Read the platform Documentation for install requirements and common commands. Components are containerized; workflows run on laptop, HPC, or cloud.

  3. Add your method to a task Follow the docs ("Create component → Add a method") and open a PR in the corresponding task_* repo. See repo READMEs for task‑specific APIs.

  4. Propose or start a new task Start from task_template and the docs ("Create a new task"). Open an issue to coordinate scope and maintainership.

  5. Join the community


Governance & community expectations

  • Governance: Open, consensus‑seeking model with defined roles (Core team, Task teams, Infrastructure, etc.). Read the current Governance.
  • Code of Conduct: We follow the Contributor Covenant. Report issues to community@openproblems.bio. See the full Code of Conduct.

Citation

If you use Open Problems, please cite:

Luecken, M.D., Gigante, S., Burkhardt, D.B. et al. Defining and benchmarking open problems in single‑cell analysis. Nature Biotechnology (2025). https://doi.org/10.1038/s41587-025-02694-w

To reference specific tasks or datasets, please cite the corresponding task or dataset publications mentioned in the task descriptions and dataset pages. For example, to reference our Open Problems multimodal BMMC datasets, please cite Luecken et al., NeurIPS 2021.

Also see earlier NeurIPS challenge reports and proceedings referenced on the Events page.


Licensing

Open Problems is free open source software and distributed under the MIT License. However, Open Problems tasks may include references to data or code distributed by a third party under a different license. If any question about license arises, please consult the specific repository in which a particular asset is hosted. If further clarification is needed, please open a GitHub Issue referencing the asset if any clarification is needed.

  • Code in this org is MIT unless stated otherwise in the repo.
  • The website repo uses mixed licensing: Markdown/JSON content under CC‑BY‑4.0 and code under MIT (see that repo's LICENSE files).
  • Datasets retain their original source licenses/terms; check dataset pages before downstream use.

Acknowledgments

Open Problems is supported by a growing community and sponsors including the Chan Zuckerberg Initiative, Data Intuitive, Helmholtz Munich, Saturn Cloud, and Seqera. See the website for the latest list.


Development notes

  • Primary languages: Python and R (task repos may include Bash/Nextflow/TeX for workflows and docs).
  • CI builds and unit tests run via GitHub Actions; component images are maintained centrally in core.

Maintainers & contact

Pinned Loading

  1. openproblems openproblems Public

    Formalizing and benchmarking open problems in single-cell genomics

    TeX 414 86

  2. website website Public

    Legacy website

    TeX 5 8

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 39 repositories

Most used topics

Loading…

Morty Proxy This is a proxified and sanitized view of the page, visit original site.