feat: overhead-aware ILP path selection (fixes #780) by zazabap · Pull Request #785 · CodingThrust/problem-reductions

zazabap · 2026-03-27T06:59:24Z

Summary

Replace MinimizeSteps with MinimizeStepsThenOverhead in best_path_to_ilp — when two paths have the same step count, the one producing smaller intermediate/final problems wins (e.g., HC→HP→ILP over HC→QAP→ILP)
Add source_size_fn to ReductionEntry (auto-generated by #[reduction] proc macro) so the ILP path selector can compute actual input sizes instead of using empty ProblemSize
Add ReductionGraph::compute_source_size() and evaluate_path_overhead() for overhead-aware path comparison

Context

PR #779 added HC→QAP, creating a second 2-step path to ILP alongside HC→HP→ILP. MinimizeSteps cannot distinguish them (both cost 2), so the winner depends on iteration order. The QAP path produces ~1,332 ILP variables vs ~144 for the HP path — a 9× increase that destabilizes the solver (#780).

	Old path (main)	New path (tier1b)
Route	HC → HP → ILP	HC → QAP → ILP
Steps	2	2
ILP vars (prism, n=6)	~144	~1,332

With this fix, the prism graph HC example works reliably again regardless of which reduction rules are registered.

Test plan

All existing tests pass (make check — 3647 tests)
New tests for MinimizeStepsThenOverhead, MinimizeOutputSize, ProblemSize::total()
New tests for compute_source_size and evaluate_path_overhead
model_specs_are_optimal passes with prism graph HC example
CI green

🤖 Generated with Claude Code

Replace MinimizeSteps with MinimizeStepsThenOverhead in ILP path selection. When two paths have the same step count, the one producing smaller intermediate/final problems wins (e.g., HC→HP→ILP over HC→QAP→ILP). Key changes: - Add source_size_fn to ReductionEntry for extracting source problem dimensions from &dyn Any instances - Add MinimizeStepsThenOverhead cost function (step count dominates, log(output_size) breaks ties) - Add MinimizeOutputSize cost function for pure overhead minimization - Add ReductionGraph::compute_source_size() and evaluate_path_overhead() - Update best_path_to_ilp to compute actual input sizes and compare paths by final ILP output size - Add ProblemSize::total() and Default derive Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

codecov · 2026-03-27T07:05:45Z

Codecov Report

❌ Patch coverage is 97.40260% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 97.82%. Comparing base (772d001) to head (48f48f2).
⚠️ Report is 13 commits behind head on tier1b-batch-rules.

Files with missing lines	Patch %	Lines
src/unit_tests/rules/registry.rs	72.72%	3 Missing ⚠️
src/unit_tests/rules/graph.rs	98.41%	1 Missing ⚠️

Additional details and impacted files

@@                  Coverage Diff                   @@
##           tier1b-batch-rules     #785      +/-   ##
======================================================
+ Coverage               97.81%   97.82%   +0.01%     
======================================================
  Files                     601      601              
  Lines                   66527    66672     +145     
======================================================
+ Hits                    65073    65225     +152     
+ Misses                   1454     1447       -7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

isPANN · 2026-03-27T13:05:56Z

Agentic Review Report

Structural Check

Build: PASS (fmt-check, clippy, test all pass)
Blacklisted files: PASS

Semantic Review:

Component	Status	Notes
`source_size_fn` proc macro generation	OK	Correctly collects variables from overhead expressions, generates getter calls
`ReductionEntry::source_size_fn` field	OK	All existing manual entries updated
`ProblemSize::total()` + `Default`	OK	Sum of components; empty default returns 0
`MinimizeOutputSize` cost fn	OK	Correct use of `evaluate_output_size().total()`
`MinimizeStepsThenOverhead` cost fn	OK	Sound in practice despite misleading comment (see issues)
`compute_source_size()`	OK	Uses `catch_unwind` for type safety; `HashSet` dedup is functionally correct
`evaluate_path_overhead()`	OK	Sequential overhead composition with correct error handling
`best_path_to_ilp()`	OK	Core fix -- uses actual instance sizes for overhead-aware tiebreaking

Issue Compliance (#780):

Bug	Status
Bug 1: Non-deterministic path selection	FIXED -- overhead-aware tiebreaking produces deterministic results
Bug 2: QAP->ILP invalid permutation extraction	NOT ADDRESSED -- this PR avoids the problematic path via overhead preference, but does not fix the underlying QAP->ILP `extract_solution` bug

Test Coverage: 7 new tests covering cost functions, ProblemSize::total(), compute_source_size, and evaluate_path_overhead. Adequate for new features.

Quality Check

Design Principles: DRY OK, KISS OK, HC/LC OK -- responsibilities cleanly separated across cost.rs, graph.rs, solver.rs.

Issues Found:

Severity	Issue	Location
Important	Misleading comment: claims `ln(1+x)` is "normalized to [0, 1)" but it's unbounded. Code is correct (1e9 step weight dwarfs realistic values), but comment is wrong.	`src/rules/cost.rs:55-56`
Important	Double cost evaluation with inconsistent metrics: Dijkstra uses per-edge `STEP_WEIGHT + log(output)` sums, but cross-variant comparison uses end-to-end composed output size. Benign for short paths but could pick suboptimal paths for longer chains.	`src/solvers/ilp/solver.rs:256-271`
Important	`test_evaluate_path_overhead` only tests identity path (MIS->MVC where output~=input). Missing multi-step test where sizes actually change -- the assertion would pass even if the function were a no-op.	`src/unit_tests/rules/graph.rs:1614`
Minor	`compute_source_size` scans all registry entries linearly -- fine at current call frequency but consider documenting.	`src/rules/graph.rs:935`
Minor	Silent zero fallback for missing overhead fields in `Minimize::edge_cost` could mask spec bugs.	`src/rules/cost.rs:17`

Agentic Feature Tests

Test	Result	Details
Build CLI	PASS	Builds successfully with ILP features
`pred list`	PASS	116 types, 144 reductions, 140 variant nodes
`pred path HamiltonianCircuit ILP`	PASS	Correctly selects HC->HP->ILP (2-step) over HC->TSP->ILP; `--all` shows 4 total paths
Solve HC via ILP (prism graph)	PASS	Returns `Or(true)` with valid circuit `[0, 2, 1, 4, 5, 3]`
Solve HC via ILP (path graph, no HC)	PASS	Returns `Or(false)` correctly
`model_specs_are_optimal`	PASS	1 test, 0 failures
All ILP tests	PASS	484 passed, 0 failed -- no regressions
Cost function tests	PASS	41 passed including new `MinimizeStepsThenOverhead` tests

Verdict: All checklist items pass. No issues found in feature testing.

Generated by review-pipeline

Fix misleading comment, document two-level path selection strategy, and add multi-step test for evaluate_path_overhead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: add 12 Tier 1b high-confidence reduction rules (#770) Implement 12 verified reduction rules from Garey & Johnson (30-80 lines each): - KSatisfiability(K3) → MinimumVertexCover (#197): truth-setting + clause triangles - Partition → SequencingWithinIntervals (#205): enforcer task gadget - MinimumVertexCover → MinimumFeedbackArcSet (#208): vertex-splitting with penalty arcs - KSatisfiability(K3) → KClique (#229): Karp's non-contradictory edge construction - HamiltonianCircuit → BiconnectivityAugmentation (#252): {1,2}-weighted potential edges - HamiltonianCircuit → StrongConnectivityAugmentation (#254): {1,2}-weighted potential arcs - HamiltonianCircuit → StackerCrane (#261): vertex-splitting with mandatory arcs - HamiltonianCircuit → RuralPostman (#262): vertex-splitting with required edges - Partition → ShortestWeightConstrainedPath (#360): +1 offset layered graph - MaximumIndependentSet → IntegralFlowBundles (#366): Sahni's flow-bundle construction - HamiltonianCircuit → QuadraticAssignment (#373): cycle cost + penalty distance matrices - HamiltonianPath → ConsecutiveOnesSubmatrix (#432): vertex-edge incidence matrix Each rule includes full test coverage (closed-loop, edge cases, extraction). Fixes #197, Fixes #205, Fixes #208, Fixes #229, Fixes #252, Fixes #254, Fixes #261, Fixes #262, Fixes #360, Fixes #366, Fixes #373, Fixes #432 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address PR #779 review comments - MIS→IntFlowBundles: remove BruteForce::solve() from reduce_to(), set requirement=1 (any IS of size ≥ 1 gives a feasible flow) - Partition→SWCP: use checked_add for a_i+1 and weight_bound overflow - MVC→FAS: use checked_add for big_m overflow - HP→ConsecOnesSub: use .get() instead of indexing for Tucker fallback safety - Partition→SeqIntervals: fix odd-sum test (forward-only reduction is correct) - MIS→IFB tests: update all requirement assertions from optimal to 1 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci: retrigger CI (flaky HC model spec test) * fix: use 4-cycle for HC model example (fixes flaky CI) The prism graph example produced a non-deterministic ILP solution on CI that failed HC validation. The 4-cycle has fewer valid permutations, making the ILP solution more predictable. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Revert "fix: use 4-cycle for HC model example (fixes flaky CI)" This reverts commit 9038b53. * fix: use K4 for HC model example to avoid ILP solver non-determinism The prism graph (6 vertices) produced different ILP solutions on CI vs locally due to HiGHS version differences. The QAP→ILP reduction path (introduced by HC→QAP in this PR) sometimes extracted an invalid permutation on CI. K4 (complete graph on 4 vertices) makes every permutation a valid HC, eliminating solver non-determinism as a failure source. See #780 for the underlying QAP→ILP investigation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Revert "fix: use K4 for HC model example to avoid ILP solver non-determinism" This reverts commit 2366d66. * fix: disable HiGHS presolve to avoid incorrect MIP solutions HiGHS presolve has known bugs that can return suboptimal solutions for certain MIP instances (see ERGO-Code/HiGHS#2173, scipy/scipy#24141). On CI (Ubuntu 24.04), presolve deterministically returns obj=18 instead of the optimal obj=6 for the QAP→ILP formulation of HC on the prism graph. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci: add --locked to all cargo commands to enforce Cargo.lock versions CI was resolving good_lp 1.15.0 instead of lockfile's 1.14.2, potentially causing different solver behavior. Pin all dependencies via --locked flag. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: pin good_lp to =1.14.2 and revert --locked CI flags CI resolved good_lp 1.15.0 (vs lockfile's 1.14.2) since Cargo.lock is gitignored. Pin the exact version in Cargo.toml instead. Revert --locked flags since Cargo.lock is not committed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * update * feat: overhead-aware ILP path selection (fixes #780) (#785) * feat: overhead-aware ILP path selection (fixes #780) Replace MinimizeSteps with MinimizeStepsThenOverhead in ILP path selection. When two paths have the same step count, the one producing smaller intermediate/final problems wins (e.g., HC→HP→ILP over HC→QAP→ILP). Key changes: - Add source_size_fn to ReductionEntry for extracting source problem dimensions from &dyn Any instances - Add MinimizeStepsThenOverhead cost function (step count dominates, log(output_size) breaks ties) - Add MinimizeOutputSize cost function for pure overhead minimization - Add ReductionGraph::compute_source_size() and evaluate_path_overhead() - Update best_path_to_ilp to compute actual input sizes and compare paths by final ILP output size - Add ProblemSize::total() and Default derive Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address review findings for overhead-aware path selection Fix misleading comment, document two-level path selection strategy, and add multi-step test for evaluate_path_overhead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Xiwei Pan <90967972+isPANN@users.noreply.github.com> Co-authored-by: Xiwei Pan <xiwei.pan@connect.hkust-gz.edu.cn> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Xiwei Pan <xiwei.pan@connect.hkust-gz.edu.cn> Co-authored-by: Xiwei Pan <90967972+isPANN@users.noreply.github.com>

Merge branch 'main' into overhead-aware-path-cost

8e71a50

isPANN added this to Good issues Mar 27, 2026

isPANN moved this to Review pool in Good issues Mar 27, 2026

isPANN moved this from Review pool to Under review in Good issues Mar 27, 2026

isPANN moved this from Under review to Final review in Good issues Mar 27, 2026

fix: address review findings for overhead-aware path selection

48f48f2

Fix misleading comment, document two-level path selection strategy, and add multi-step test for evaluate_path_overhead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

isPANN changed the base branch from main to tier1b-batch-rules March 27, 2026 13:27

isPANN merged commit 333200e into tier1b-batch-rules Mar 27, 2026
3 checks passed

isPANN mentioned this pull request Mar 27, 2026

[Enhancement] ILP path selector over-prioritizes step count over final output size #788

Open

GiggleLiu moved this from Final review to Done in Good issues Mar 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: overhead-aware ILP path selection (fixes #780)#785

feat: overhead-aware ILP path selection (fixes #780)#785
isPANN merged 3 commits intotier1b-batch-rulesfrom
overhead-aware-path-cost

zazabap commented Mar 27, 2026 •

edited by isPANN

Loading

Uh oh!

codecov bot commented Mar 27, 2026 •

edited

Loading

Uh oh!

isPANN commented Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zazabap commented Mar 27, 2026 • edited by isPANN Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Context

Test plan

Uh oh!

codecov bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

isPANN commented Mar 27, 2026

Agentic Review Report

Structural Check

Quality Check

Agentic Feature Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zazabap commented Mar 27, 2026 •

edited by isPANN

Loading

codecov bot commented Mar 27, 2026 •

edited

Loading