Fix #401: [Model] ComparativeContainment by GiggleLiu · Pull Request #662 · CodingThrust/problem-reductions

GiggleLiu · 2026-03-16T10:04:51Z

Summary

add an implementation plan for the ComparativeContainment model
prepare execution for issue [Model] ComparativeContainment #401

Fixes #401

codecov · 2026-03-16T10:07:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.11%. Comparing base (302d54d) to head (171b96b).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #662      +/-   ##
==========================================
+ Coverage   97.09%   97.11%   +0.02%     
==========================================
  Files         292      294       +2     
  Lines       38984    39220     +236     
==========================================
+ Hits        37851    38088     +237     
+ Misses       1133     1132       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

GiggleLiu · 2026-03-16T10:42:37Z

Implementation Summary

Changes

Added the new ComparativeContainment set model with One, i32, and f64 variants, schema registration, example-db coverage, library exports, and dedicated unit tests.
Added pred create ComparativeContainment support, including --r-sets / --s-sets, weight parsing for i32 and f64, /One validation, no-flag help output, and CLI regression tests.
Added the Typst paper entry and figure, then regenerated the checked-in schema, reduction-graph, and example-fixture exports.

Deviations from Plan

During review, I fixed two issues discovered after the initial implementation: the /f64 CLI path was still parsing weights as integers, and the schema-driven help path advertised --universe-size instead of the actual --universe flag.
I also rewrote one sentence in the paper entry to avoid implying reduction edges that are not yet present in the exported catalog.

Open Questions

This PR follows the existing weighted-model convention and does not add explicit positivity validation for non-unit weights. If maintainers want stricter enforcement for ComparativeContainment, that can be added consistently in a follow-up.

Copilot

Pull request overview

Adds a new ComparativeContainment set-based satisfaction problem to the core library and wires it through the CLI, example DB, and generated documentation so it can be created/serialized consistently like existing models.

Changes:

Introduces the ComparativeContainment<W> model (schema registration, variants, example-db spec, and unit tests).
Extends pred create with new flags (--r-sets/--s-sets/--r-weights/--s-weights) and help formatting tweaks.
Regenerates example fixtures and docs artifacts (schemas + reduction graph) and adds a paper entry.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/unit_tests/trait_consistency.rs	Adds the new model to the trait-consistency smoke test suite.
src/unit_tests/models/set/comparative_containment.rs	New unit tests covering creation, evaluation, brute-force solving, and serialization.
src/models/set/mod.rs	Registers and re-exports the new set model; adds example-db specs hook.
src/models/set/comparative_containment.rs	Implements the ComparativeContainment model, schema entry, variants, and example-db spec.
src/models/mod.rs	Re-exports `ComparativeContainment` at the models module level.
src/lib.rs	Re-exports `ComparativeContainment` via the crate prelude.
src/example_db/fixtures/examples.json	Adds a canonical model example and updates regenerated fixtures.
problemreductions-cli/tests/cli_tests.rs	Adds CLI tests for creating ComparativeContainment instances across weight variants.
problemreductions-cli/src/commands/create.rs	Adds ComparativeContainment creation logic; generalizes set/weight parsing helpers; improves help flag naming.
problemreductions-cli/src/cli.rs	Adds new `create` flags for ComparativeContainment and updates CLI help banner.
docs/src/reductions/reduction_graph.json	Regenerates reduction graph nodes/indices including ComparativeContainment variants.
docs/src/reductions/problem_schemas.json	Regenerates exported schema list to include ComparativeContainment.
docs/paper/reductions.typ	Adds Comparative Containment entry to the paper (definition + example).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/models/set/comparative_containment.rs

+    /// Create a new instance with explicit weights.
+    pub fn with_weights(
+        universe_size: usize,
+        r_sets: Vec<Vec<usize>>,
+        s_sets: Vec<Vec<usize>>,
+        r_weights: Vec<W>,
+        s_weights: Vec<W>,
+    ) -> Self {
+        assert_eq!(
+            r_sets.len(),
+            r_weights.len(),
+            "number of R sets and R weights must match"
+        );
+        assert_eq!(
+            s_sets.len(),
+            s_weights.len(),
+            "number of S sets and S weights must match"
+        );
+        validate_set_family("R", universe_size, &r_sets);
+        validate_set_family("S", universe_size, &s_sets);


src/models/set/comparative_containment.rs

+        self.valid_config(config)
+            && config
+                .iter()
+                .enumerate()
+                .all(|(element, &selected)| selected == 0 || set.contains(&element))


…01-comparative-containment # Conflicts: # docs/paper/reductions.typ # docs/src/reductions/reduction_graph.json # problemreductions-cli/src/commands/create.rs # problemreductions-cli/tests/cli_tests.rs # src/lib.rs # src/models/mod.rs # src/unit_tests/trait_consistency.rs

…01-comparative-containment # Conflicts: # docs/src/reductions/problem_schemas.json # docs/src/reductions/reduction_graph.json # problemreductions-cli/src/cli.rs # problemreductions-cli/src/commands/create.rs # src/example_db/fixtures/examples.json # src/unit_tests/trait_consistency.rs

GiggleLiu · 2026-03-17T15:45:17Z

Agentic Review Report

Structural Check

Structural Review: model ComparativeContainment

Structural Completeness

#	Check	Status
1	Model file exists (`src/models/set/comparative_containment.rs`)	PASS
2	`inventory::submit!` present	PASS
3	`#[derive(...Serialize, Deserialize)]` on struct	PASS
4	`Problem` trait impl	PASS
5	`SatisfactionProblem` impl	PASS
6	`#[cfg(test)]` + `#[path = "..."]` test link	PASS
7	Test file exists (`src/unit_tests/models/set/comparative_containment.rs`)	PASS
8	Test file has >= 3 test functions	PASS — 13 test functions found
9	Registered in `set/mod.rs`	PASS
10	Re-exported in `models/mod.rs`	PASS
11	`declare_variants!` entry exists	PASS — 3 variants (One, i32/default, f64), all marked `sat`
12	CLI `resolve_alias` entry	PASS — uses catalog-based resolution via `inventory::submit!`
13	CLI `create` support	PASS — full parsing for all 3 weight variants
14	Canonical model example registered	PASS — via `canonical_model_example_specs()` in model file, chained through `set/mod.rs` into `model_builders.rs`; fixture present in `examples.json`
15	Paper `display-name` entry	PASS — `"ComparativeContainment": [Comparative Containment]`
16	Paper `problem-def` block	PASS — `#problem-def("ComparativeContainment")` with full example, data-driven diagram, and satisfying-solution enumeration

Build Status

make test: PASS — all tests passed, 0 failures
make clippy: PASS — no warnings with -D warnings

Semantic Review

evaluate() correctness: OK — For each candidate subset Y (encoded as a binary config), it computes R-weight sum and S-weight sum by iterating over sets and checking containment. Returns true if R-total >= S-total. Empty set correctly treated as contained in every set. Manual verification confirms correctness.
dims() correctness: OK — Returns vec![2; self.universe_size], matching the binary configuration space.
Size getter consistency: OK — universe_size(), num_r_sets(), num_s_sets() provided as inherent methods. Complexity string "2^universe_size" references valid getter.
Weight handling: OK — Weights managed via inherent methods. Weight validation uses Any downcasting for i32 and f64.
Variant configuration: OK — variant_params![W] with single weight dimension.

Issue Compliance

#	Check	Status
1	Problem name matches issue	OK
2	Mathematical definition matches	OK
3	Problem type (opt/sat) matches	OK — Satisfaction, `Metric = bool`
4	Type parameters match	OK — `W: WeightElement`
5	Configuration space matches	OK — `vec![2; universe_size]`
6	Feasibility check matches	OK
7	Objective function matches	OK — `r_total >= s_total`
8	Complexity matches	OK — `"2^universe_size"`

Summary

16/16 structural checks passed
8/8 issue compliance checks passed
0 FAIL/ISSUE items found
Note: merge conflict with main in problemreductions-cli/src/commands/create.rs (merge logistics, not structural deficiency)

Quality Check

Design Principles

DRY: ISSUE — is_valid_solution() (line 152-157) and evaluate() (line 190-195) in src/models/set/comparative_containment.rs contain identical logic: both pattern-match on (r_weight_sum, s_weight_sum) and return r_total >= s_total. The is_valid_solution method should delegate to the shared weight comparison (or just call evaluate if the trait bound can be satisfied). If the different trait bounds make delegation impractical, at least extract the shared comparison into a private method that both call.
DRY: OK — The validate_set_family and validate_weight_family helper functions are well-factored and reused for both R and S families. The sum_containing_weights helper avoids duplicating the weight-accumulation loop. CLI helper functions properly generalize existing patterns.
KISS: ISSUE — The validate_weight_family function (line 224-239) uses dyn Any downcasting to validate weight positivity at runtime. This approach is brittle — adding a new weight type would silently skip validation. A trait-based approach would be more idiomatic. That said, the current weight type set (One, i32, f64) is closed, so it works correctly even if inelegant.
KISS: OK — Overall model structure is straightforward. Clean separation of concerns.
HC/LC: OK — Each function has a single responsibility. No god functions.

HCI (CLI/MCP changed)

Error messages: OK — Actionable error messages with usage examples.
Discoverability: OK — --help table documents required flags.
Consistency: OK — --r-sets/--s-sets/--r-weights/--s-weights naming follows model conventions.
Least surprise: OK — Default weights are all-ones when flags are omitted.
Feedback: OK — --universe flag naming tested to avoid confusion.

Test Quality

Naive test detection: ISSUE — r_weight_sum() and s_weight_sum() public methods (lines 142-149) are never directly tested. Only exercised indirectly through evaluate() and BruteForce::find_all_satisfying().
Validation coverage gap: ISSUE — Only the R-family side of i32 weight validation is tested (test_comparative_containment_rejects_nonpositive_i32_weights). No #[should_panic] test for S-family with nonpositive i32 weights, nor for S-family count mismatch.
Assert count: OK — Tests have sufficient assertions across both yes/no instances.
Adversarial cases: OK — Tests include invalid configs, no-instance, nonpositive weights, non-finite weights, out-of-range elements. Paper example verifies exact solution count (3).
Instance sizes: OK — yes_instance uses universe_size=4 with 2 R-sets and 2 S-sets (16 configurations to enumerate).

Issues

Important (Should Fix):

DRY: is_valid_solution duplicates evaluate — src/models/set/comparative_containment.rs:152-157 and :190-195 contain identical logic. Extract shared comparison into a private method or delegate.
KISS: validate_weight_family uses dyn Any downcasting — src/models/set/comparative_containment.rs:224-239. Brittle pattern; adding a new weight type silently skips validation. Consider trait-based approach or at minimum a documenting comment.
Test gap: r_weight_sum/s_weight_sum not directly tested — src/unit_tests/models/set/comparative_containment.rs. Direct value assertions would catch bugs masked by the comparison in evaluate.
Test gap: S-family i32 validation and S-family count mismatch not tested — Missing #[should_panic] tests for nonpositive i32 weight on S side and mismatched S-set/S-weight count.

Minor (Nice to Have):

validate_weight_family silently passes for unknown types — Not a current bug but could become one.
Redundant use std::any::Any import — Only needed for the downcasting pattern.
No test for empty set families — Edge case: r_sets = vec![] or s_sets = vec![] not tested.

Agentic Feature Tests

Summary

Overall verdict: PASS — no issues found. All CLI workflows function correctly from a downstream user's perspective.

Tests Performed

#	Test	Result
1	Discoverability (`pred list`)	PASS — All 3 variants listed, i32 marked default, correct complexity
2	Problem Information (`pred show`)	PASS — 5 fields with correct types, 0 reductions (expected for new model)
3	Example Creation (`pred create --example`)	PASS — Canonical example produces valid JSON
4	Custom Instance Creation (`pred create ...`)	PASS — All weight variants, default weights, error handling verified
5	Solving (`pred solve`)	PASS — Correct solutions for yes/no instances across all weight variants
6	Reductions (`pred reduce`)	N/A — No reductions exist (expected for new isolated model)
7	Serialization Round-Trip	PASS — JSON output from `pred create` correctly consumed by `pred solve`
8	Test Suite	PASS — 13 unit tests + 6 CLI integration tests pass
9	Paper Entry	PASS — Complete `problem-def` with example, diagram, and solution enumeration

Semantic Correctness Verification

Manually verified evaluate() semantics:

"Containing" relation correctly implemented: every selected element of Y must appear in set S
Empty set Y correctly handled (contained in every set)
Weight sums and >= comparison correct
Canonical example [1,0,0,0] (Y={0}) verified: R-sum=7 >= S-sum=3
No-instance verified: all 4 configs checked, none satisfying

Issues Found

None. The implementation is complete and correct from a user perspective.

Generated by review-pipeline

- evaluate() now delegates to is_valid_solution() instead of duplicating the r_weight_sum >= s_weight_sum comparison - Add missing #[should_panic] test for S-family i32 nonpositive weights Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

GiggleLiu · 2026-03-17T16:58:35Z

Follow-up items (recorded during final review):

validate_weight_family in src/models/set/comparative_containment.rs uses dyn Any downcasting for weight validation. If a new weight type is added to WeightElement, validation silently passes. Consider adding trait-based weight validation (e.g., fn validate_positive(&self) on WeightElement) as a cross-cutting improvement across all models that validate weights.

…e coverage - Replace `validate_weight_family` dyn Any downcasting with WeightElement trait: use `weight.to_sum() > zero` via `partial_cmp`, which correctly rejects non-positive and NaN values for all weight types - Remove `use std::any::Any` import - Tighten impl block bound from `W: 'static` to `W: WeightElement` - Add direct r_weight_sum/s_weight_sum test with value assertions - Add S-family weight count mismatch should_panic test - Update should_panic messages to match unified validation error format Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…elper The empty_args() helper in create.rs tests constructs CreateArgs manually and was missing the r_sets/s_sets/r_weights/s_weights fields added by this PR, causing a compilation error in CI coverage builds. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add plan for #401: [Model] ComparativeContainment

60d8fd1

GiggleLiu added 2 commits March 16, 2026 18:41

Implement #401: [Model] ComparativeContainment

06d08e9

chore: remove plan file after implementation

3876914

GiggleLiu requested a review from Copilot March 16, 2026 10:43

Copilot started reviewing on behalf of GiggleLiu March 16, 2026 10:43 View session

Copilot AI reviewed Mar 16, 2026

View reviewed changes

GiggleLiu added 4 commits March 16, 2026 21:41

fix: address ComparativeContainment review comments

2147e31

fix: validate ComparativeContainment CLI inputs

bb69a45

GiggleLiu and others added 2 commits March 18, 2026 00:28

Merge origin/main into issue-401-comparative-containment

d6bac5f

GiggleLiu and others added 2 commits March 18, 2026 01:20

GiggleLiu merged commit 9640b88 into main Mar 17, 2026
5 checks passed

GiggleLiu deleted the issue-401-comparative-containment branch March 17, 2026 17:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #401: [Model] ComparativeContainment#662

Fix #401: [Model] ComparativeContainment#662
GiggleLiu merged 11 commits intomainfrom
issue-401-comparative-containment

GiggleLiu commented Mar 16, 2026

Uh oh!

codecov bot commented Mar 16, 2026 •

edited

Loading

Uh oh!

GiggleLiu commented Mar 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

GiggleLiu commented Mar 17, 2026

Uh oh!

GiggleLiu commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

GiggleLiu commented Mar 16, 2026

Summary

Uh oh!

codecov bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

GiggleLiu commented Mar 16, 2026

Implementation Summary

Changes

Deviations from Plan

Open Questions

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

GiggleLiu commented Mar 17, 2026

Agentic Review Report

Structural Check

Structural Review: model ComparativeContainment

Structural Completeness

Build Status

Semantic Review

Issue Compliance

Summary

Quality Check

Design Principles

HCI (CLI/MCP changed)

Test Quality

Issues

Agentic Feature Tests

Summary

Tests Performed

Semantic Correctness Verification

Issues Found

Uh oh!

GiggleLiu commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 16, 2026 •

edited

Loading