Unify the model parameter validation code introduced in PR 2092 with the DataValidator classes by GernotMaier · Pull Request #2107 · gammasim/simtools

GernotMaier · 2026-04-02T08:26:09Z

This PR should provide exactly the same functionality as PR #2092, but with model parameter validation entirely moved to the data_validator module. This should ensure the model parameters are validated with the same code - independently if it is a parameter dict from file, from DB, or provided through the overwrite model parameter mechanism.

The handling of values without units was inconsistent ("", None, dimensionless) and I tried to fix this - introduced a single function with the logic in value_conversion.normalize_dimensionless_unit.

Copilot

Pull request overview

This PR refactors model-parameter validation so that the same DataValidator-based logic is used regardless of whether parameters originate from files, the DB, or the overwrite mechanism—aiming to preserve PR #2092 behavior while consolidating validation in simtools.data_model.

Changes:

Moved/centralized overwrite-time model-parameter validation to DataValidator.validate_model_parameter() and added schema helpers to fetch per-parameter type/unit by schema version.
Standardized dimensionless-unit handling via value_conversion.is_dimensionless_unit() / normalize_dimensionless_unit() and updated unit-conversion behavior/tests accordingly.
Expanded unit tests for heterogeneous-list validation and schema helper behavior; updated mock DB parameter fixtures to match schema expectations.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/simtools/model/model_parameter.py`	Routes overwrite validation through `DataValidator` and resolves schema versions/types/units from `data_model.schema`.
`src/simtools/data_model/validate_data.py`	Adds heterogeneous-list validation flow and adjusts unit conversion outputs (now returning unit strings).
`src/simtools/data_model/schema.py`	Adds helpers to load per-parameter schemas and extract type/unit with normalization.
`src/simtools/utils/value_conversion.py`	Introduces shared helpers to detect/normalize dimensionless unit markers.
`src/simtools/simtel/simtel_config_reader.py`	Uses the new dimensionless-unit helper when applying schema units.
`src/simtools/db/db_handler.py`	Normalizes dimensionless units to `None` when inserting parameters.
`src/simtools/data_model/model_data_writer.py`	Normalizes dimensionless units for model-parameter outputs and before writing.
`tests/unit_tests/*`	Updates expectations for new validation/error behavior; adds coverage for new helpers and heterogeneous lists.
`tests/resources/mock_db/mock_parameters.json`	Updates mock parameter units/values to align with schema-driven validation (but currently contains an inconsistency; see comments).

tests/resources/mock_db/mock_parameters.json

src/simtools/model/model_parameter.py

src/simtools/data_model/validate_data.py

orelgueta

Thanks for this!

Minor comments, but at least the num_gains tests should be looked at, unless I really misunderstood something.

src/simtools/data_model/model_data_writer.py

src/simtools/data_model/schema.py

orelgueta · 2026-04-02T13:21:25Z

src/simtools/data_model/schema.py

+def get_parameter_type_from_schema(par_name, schema_version):
+    """
+    Get parameter type from schema file for a specific schema version.
+
+    Parameters
+    ----------
+    par_name: str
+        Name of the parameter.
+    schema_version: str
+        Schema version to look up.
+
+    Returns
+    -------
+    str or list
+        Type of the parameter (string for simple types, list for heterogeneous types).
+    """
+    return _get_parameter_attribute_from_schema(par_name, schema_version, "type")
+
+
+def get_parameter_unit_from_schema(par_name, schema_version):
+    """
+    Get parameter unit from schema file for a specific schema version.
+
+    Parameters
+    ----------
+    par_name: str
+        Name of the parameter.
+    schema_version: str
+        Schema version to look up.
+
+    Returns
+    -------
+    str or list or None
+        Unit of the parameter (string for simple types, list for heterogeneous types,
+        None for dimensionless parameters).
+    """
+    return _get_parameter_attribute_from_schema(par_name, schema_version, "unit")


These are quite trivial functions with a lot of documentation just to avoid opening the API to using the private function _get_parameter_attribute_from_schema. Is there a good reason not to make that one public and give as examples the type and unit in the documentation?

Agree - changed it to a public get_parameter_attribute_from_schema function only.

(story behind is that these where two long function until I realised that they are doing essentially the same).

src/simtools/data_model/validate_data.py

orelgueta · 2026-04-02T13:42:37Z