Calibration Benchmarks: Uncertainty And Less Data#
Note
This page and its static assets are auto-generated by python -m tools.doc_gallery. The Sphinx build only reads committed PNG and JSON artifacts.
These calibration benchmarks deliberately reduce information content by using fewer observations and adding uncertainty, so the methods are tested on weaker inverse constraints.
See also
Open the calibration benchmark category for the case-by-case pages and their configuration figures.
Coverage#
Cases compared: 1
Method rows: 6
Timing now separates candidate runtime from calibration-method overhead, and further splits actualize, launcher preparation, runtime patch, simulation, output selection, and objective scoring.
Summary Figures#





Linked Cases#
Calibration Twin: Recharge-Step Flux-Only K+Sy 1D
Transient modflow6 twin calibration benchmark with K_global, Sy_global.
Method Rows#
Case |
Method |
Metric |
Target |
Cost |
Eval |
Calibration (s) |
Candidate runtime (s) |
Algorithm overhead (s) |
Actualize (s) |
Launcher prep (s) |
Patch (s) |
Sim (s) |
Output select (s) |
Objective score (s) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
random_search |
best_fit_or_distribution |
1 |
0.410218 |
24 |
31.46 s |
31.1 s |
0.3578 s |
0 s |
0 s |
0 s |
1.296 s |
0 s |
0 s |
|
optuna |
best_fit_or_distribution |
1 |
0.407764 |
48 |
105.2 s |
103.3 s |
1.859 s |
0 s |
0 s |
0 s |
2.152 s |
0 s |
0 s |
|
cma_es |
best_fit |
0 |
0.407793 |
56 |
125.9 s |
124.3 s |
1.554 s |
0 s |
0 s |
0 s |
2.221 s |
0 s |
0 s |
|
scipy_nelder_mead |
best_fit |
0 |
0.407768 |
16 |
26.37 s |
26.04 s |
0.331 s |
0 s |
0 s |
0 s |
1.627 s |
0 s |
0 s |
|
gp_mapping |
best_fit_or_distribution |
1 |
0.415671 |
20 |
56.85 s |
37.88 s |
18.96 s |
0 s |
0 s |
0 s |
1.894 s |
0 s |
0 s |
|
da_mh_gp |
best_fit_or_distribution |
1 |
0.649032 |
12 |
23.3 s |
22.74 s |
0.5585 s |
0 s |
0 s |
0 s |
1.895 s |
0 s |
0 s |
Artifacts#
docs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/calibration_intercomparison_summary.jsondocs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/benchmark_target_success_rates.pngdocs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/benchmark_cost_vs_budget.pngdocs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/benchmark_time_vs_cost.pngdocs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/benchmark_calibration_time_closure.pngdocs/source/_static/capability_gallery/calibration/intercomparison/calibration_uncertain_less_data/benchmark_candidate_timing_breakdown.png