Implementing multistart version of theta_est using multiple sampling methods #3575

sscini · 2025-04-23T20:32:06Z

Fixes # .

Summary/Motivation:

Currently, the optimization is only done from a single initial value. This implementation adds the ability to specify multiple initial values using selected sampling techniques: from a random uniform distribution, using Latin Hypercube Sampling, or using Sobol Quasi-Monte Carlo sampling.

Changes proposed in this PR:

All changes made adding pseudocode in comments
Added inputs needed for multistart simulation
Added a function to generate points using the selected method
Added theta_est_multistart to work for the multistart process

TODO before converting from draft:

Receive feedback from collaborators on logical setup
Convert finalized pseudocode
Test and debug
Confirm function with examples

Legal Acknowledgement

By contributing to this software project, I have read the contribution guide and agree to the following terms and conditions for my contribution:

I agree my contributions are submitted under the BSD license.
I represent I am authorized to make the contributions and grant the license. If my employer has rights to intellectual property that includes these contributions, I represent that I have received permission to make contributions and grant the required license on behalf of that employer.

sscini · 2025-04-23T20:32:39Z

@djlaky @adowling2 Please provide early feedback.

sscini · 2025-04-30T13:58:25Z

Dynamic saving using flush, add.

adowling2

Notes from our in-person discussion/informal code review

pyomo/contrib/parmest/parmest.py

adowling2 · 2025-04-30T13:48:44Z

pyomo/contrib/parmest/parmest.py

+    #         # If only one restart, return an empty list
+            # return []
+
+    #         return {theta_names[i]: initial_theta[i] for i in range(len(theta_names))}


We discussed adding a "dataframe" sampling method that uses multistart points defined by the user. This is helpful if we want to try the same set of multistart points for multiple experiments.

adowling2 · 2025-04-30T13:50:22Z

pyomo/contrib/parmest/parmest.py

+                "Multistart is not supported in the deprecated parmest interface")
+            )
+
+        assert isinstance(n_restarts, int)


Also check that this is > 1

Please look at other Pyomo code fgor exampels of throwing exceptions

Agree with @adowling2 here, you need to throw an exception so you can test the exception is caught.

pyomo/contrib/parmest/parmest.py

adowling2 · 2025-04-30T13:52:43Z

pyomo/contrib/parmest/parmest.py

+            )
+
+
+            results = []


It might make more sense to create a dataframe and then add rows as you go. Or you could preallocate the dataframe size because you know how many restarts.

You could even have your generate_samples function generate this empty dataframe.

pyomo/contrib/parmest/parmest.py

sscini · 2025-04-30T14:01:34Z

Extend existing tests for parmest to include multistart, add.

sscini · 2025-04-30T14:41:13Z

Models provided need to include bounds, add exception

adowling2

Here are some more comments for you to consider are you continue to refine this.

pyomo/contrib/parmest/parmest.py

adowling2 · 2025-05-01T21:58:28Z

pyomo/contrib/parmest/parmest.py

+        upper_bound = np.array([parmest_model.find_component(name).ub for name in theta_names])
+        # Check if the lower and upper bounds are defined
+        if np.any(np.isnan(lower_bound)) or np.any(np.isnan(upper_bound)):
+            raise ValueError(


You probably already know this, but you will need to check all the errors are raised when expected.

adowling2 · 2025-05-01T21:59:25Z

pyomo/contrib/parmest/parmest.py

+            )
+
+        if self.method == "random":
+            np.random.seed(seed)


Do you want to skip setting the random seed if seed=None (default)?

The default is none for all the functions I use that set seed, so if it receives seed = None, it would work as expected. Would skipping it still be best practice?

adowling2 · 2025-05-01T22:00:52Z

pyomo/contrib/parmest/parmest.py

+        elif self.method == "latin_hypercube":
+            # Generate theta values using Latin hypercube sampling
+            sampler = scipy.stats.qmc.LatinHypercube(d=len(theta_names), seed=seed)
+            samples = sampler.random(n=self.n_restarts+1)[1:]  # Skip the first sample


Why are you skipping the first sample? Please explain in the comments.

I will add a comment in code to explain as well. The first sample generated using qmc.sobol is always the origin (zero vector). I thought logic applied to all qmc methods, but no only sobol. So to get nonzero points, you need to skip first sample

pyomo/contrib/parmest/parmest.py

adowling2 · 2025-05-01T22:07:05Z

pyomo/contrib/parmest/parmest.py

@@ -921,6 +1020,116 @@ def theta_est(
            cov_n=cov_n,
        )

+    def theta_est_multistart(
+        self,
+        buffer=10,


Need to explain the buffer in the doc string.

adowling2 · 2025-05-01T22:07:52Z

pyomo/contrib/parmest/parmest.py

+                "Multistart is not supported in the deprecated parmest interface"
+            )
+
+        assert isinstance(self.n_restarts, int)


Replace all of these with more descriptive error messages. Remember that we need tests for each error message.

pyomo/contrib/parmest/parmest.py

djlaky · 2025-05-05T12:14:59Z

pyomo/contrib/parmest/parmest.py

+    # optimization. It will take the theta names and the initial theta values
+    # and return a dictionary of theta names and their corresponding values.
+    def _generate_initial_theta(self, parmest_model, seed=None, n_restarts=None, multistart_sampling_method=None, user_provided=None):
+        if n_restarts == 1:


I like just sending a warning, and not returning. For example, n_restarts might be 1 by default. You should check if n_restarts is an int as well. Then, if n_restarts is 1, you should send a warning that the tool is intended for this number to be greater than one and solve as normal.

djlaky · 2025-05-05T12:16:21Z

pyomo/contrib/parmest/parmest.py

+
+        # Get the theta names and initial theta values
+        theta_names = self._return_theta_names()
+        initial_theta = [parmest_model.find_component(name)() for name in theta_names]


Is it better to use the suffix for this? The suffix value shouldn't change, but the theta value may if the model has been solved for some reason. I don't know if this is a potential issue but I think that grabbing these values from the suffixes would be more dummy-proof.

pyomo/contrib/parmest/parmest.py

djlaky · 2025-05-05T12:29:17Z

pyomo/contrib/parmest/parmest.py

+
+        # Add the output info values to the dataframe, starting values as nan
+        for i in range(len(theta_names)):
+            df_multistart[f'converged_{theta_names[i]}'] = np.nan        


Are all string characters legal for pandas names? I would think so but this line seems (maybe) dangerous? For instance, what if we start getting into block structuring with multi-index parameters? We should make sure there is a test to ensure the system is robust. I believe I have this on the back burner to make one for Pyomo.DoE as well.

pyomo/contrib/parmest/parmest.py

sscini added 2 commits April 23, 2025 11:50

Work on multistart implement 4/23 morning

cdd7d52

Finished first draft of pseudocode for multistart

eca0ba8

Fixed logical errors in pseudocode

2160aec

blnicho added the ParmEst label Apr 29, 2025

github-project-automation bot added this to ParmEst & Pyomo.DoE Development Apr 29, 2025

blnicho moved this to Todo in ParmEst & Pyomo.DoE Development Apr 29, 2025

adowling2 moved this from Todo to Development in ParmEst & Pyomo.DoE Development Apr 30, 2025

adowling2 reviewed Apr 30, 2025

View reviewed changes

sscini added 4 commits April 30, 2025 11:07

Started implementing review comments 4/30

266beea

Merge branch 'Pyomo:main' into multistart-in-parmest

b877ada

Work on edits, 5/1/25

9f1ffe5

Merge branch 'Pyomo:main' into multistart-in-parmest

43f1ab3

adowling2 reviewed May 1, 2025

View reviewed changes

Made edits, still debugging

ea067c8

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Outdated Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Outdated Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

djlaky reviewed May 5, 2025

View reviewed changes

pyomo/contrib/parmest/parmest.py Show resolved Hide resolved

Addressed some comments in code. Still working through example to debug

3b839ef

Merge branch 'Pyomo:main' into multistart-in-parmest

3a7aa1d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing multistart version of theta_est using multiple sampling methods #3575

Implementing multistart version of theta_est using multiple sampling methods #3575

sscini commented Apr 23, 2025 •

edited

Loading

sscini commented Apr 23, 2025

sscini commented Apr 30, 2025

adowling2 left a comment

adowling2 Apr 30, 2025

adowling2 Apr 30, 2025

adowling2 Apr 30, 2025

djlaky May 5, 2025

adowling2 Apr 30, 2025

adowling2 Apr 30, 2025

sscini commented Apr 30, 2025

sscini commented Apr 30, 2025

adowling2 left a comment

adowling2 May 1, 2025

adowling2 May 1, 2025

sscini May 1, 2025 •

edited

Loading

adowling2 May 1, 2025

sscini May 1, 2025

adowling2 May 1, 2025

adowling2 May 1, 2025

djlaky May 5, 2025

djlaky May 5, 2025

djlaky May 5, 2025 •

edited

Loading

Implementing multistart version of theta_est using multiple sampling methods #3575

Are you sure you want to change the base?

Implementing multistart version of theta_est using multiple sampling methods #3575

Conversation

sscini commented Apr 23, 2025 • edited Loading

Fixes # .

Summary/Motivation:

Changes proposed in this PR:

TODO before converting from draft:

Legal Acknowledgement

sscini commented Apr 23, 2025

sscini commented Apr 30, 2025

adowling2 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sscini commented Apr 30, 2025

sscini commented Apr 30, 2025

adowling2 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sscini May 1, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

djlaky May 5, 2025 • edited Loading

Choose a reason for hiding this comment

sscini commented Apr 23, 2025 •

edited

Loading

sscini May 1, 2025 •

edited

Loading

djlaky May 5, 2025 •

edited

Loading