PanODE-Topic

Topic-flow-matching models for interpretable single-cell representation learning.

The repository currently focuses on the Topic-FM family:

Topic-FM-Base
Topic-FM-Transformer
Topic-FM-Contrastive
Topic-FM-GAT (optional, when graph dependencies are installed)

Matched baselines are included through the Pure-VAE family.

Repository Layout Highlights

This intentionally non-exhaustive overview lists the main public-facing areas and omits local data paths, generated outputs, and registry-specific details.

PanODE-Topic/
├── models/                   # Topic-FM, Topic, and Pure-VAE implementations
├── benchmarks/               # Training, evaluation, and validation workflows
├── eval_lib/                 # Baseline wrappers and evaluation utilities
├── article/                  # Manuscript-facing materials
├── experiments/              # External comparison and analysis workflows
├── refined_figures/          # Figure refinement utilities
├── scripts/                  # Maintenance and workflow helpers
├── utils/                    # Shared training, data, and visualization helpers
├── src/                      # Visualization helpers
└── vcd/                      # Visual consistency diagnostics

Model Families

Model	Encoder	Prior	Flow Matching
`Topic-FM-Base`	MLP	Dirichlet / logistic-normal	Yes
`Topic-FM-Transformer`	Self-attention	Dirichlet / logistic-normal	Yes
`Topic-FM-Contrastive`	MLP + MoCo	Dirichlet / logistic-normal	Yes
`Topic-FM-GAT`	GAT over kNN	Dirichlet / logistic-normal	Yes
`Pure-VAE`	MLP	Gaussian	No
`Pure-Transformer-VAE`	Self-attention	Gaussian	No
`Pure-Contrastive-VAE`	MLP + MoCo	Gaussian	No

Default Settings

Parameter	Value
Learning rate	`1e-3`
Batch size	`128`
Topics / latent dim	`10`
Epochs	`1000`
KL weight	`0.01` for Topic-FM
Flow warmup	`50` epochs
Flow weight	`0.1`
HVGs	`3000`
Max cells	`3000`

Running

python benchmarks/runners/benchmark_base.py --series topic --data-path <dataset-path>
python benchmarks/runners/benchmark_base.py --models Topic-FM-Transformer Pure-VAE --data-path <dataset-path>
python benchmarks/runners/benchmark_crossdata.py --datasets <dataset-key> <dataset-key>

Notes

For base or single-dataset runs, pass a local dataset with --data-path.
For cross-dataset runs, --datasets selects configured registry keys; align your local registry/config before running those workflows.
README examples use placeholders for local data and registry-specific values.
The maintained benchmark registry targets the Topic-FM and Pure-VAE families.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PanODE-Topic

Repository Layout Highlights

Model Families

Default Settings

Running

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
benchmarks		benchmarks
eval_lib		eval_lib
experiments		experiments
model-arch-viewer		model-arch-viewer
models		models
refined_figures		refined_figures
scripts		scripts
src		src
utils		utils
vcd		vcd
.gitignore		.gitignore
AUDIT_REPORT.md		AUDIT_REPORT.md
README.md		README.md
merge_external_results.py		merge_external_results.py

Folders and files

Latest commit

History

Repository files navigation

PanODE-Topic

Repository Layout Highlights

Model Families

Default Settings

Running

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages