Improve some thermo calculation bottlenecks by refactoring how units are handled by jthielen · Pull Request #2064 · Unidata/MetPy

jthielen · 2021-08-28T21:13:20Z

Description Of Changes

I noticed that there have been two major performance bottlenecks caused by the existing approaches to unit handling: iteration in moist_lapse/lcl (#1169) and looping of thickness_hydrostatic in io.gempak's _interp_* functions (#2062). In an attempt to improve this situation, this PR takes the approach of refactoring the relevant thermo calculations to have private versions that only use base units (no pint.Quantity handling), and wrapping those while handling units. No robust benchmarks were evaluated, but this did speed up pytest tests/calc/test_thermo.py on my workstation from about 3.5 seconds to 0.69 seconds (similar to #1980), and @akrherz's test file in #2041 (comment) from 35.88 seconds to 1.78 seconds.

I tried to be careful to not touch any tests in this refactor (other than the fix to the flake8 checker), but do let me know if any tests should be added.

Future work building on this PR could examine if all/most of the calculations should be refactored in this way (which may also reveal some cleaner implementation approaches) and if numba jit/guvectorize could meaningfully accelerate any of these private routines.

@sgdecker, could you evaluate how much performance improvement this gives you relative to your earlier tests in #2062?

@nawendt, these changes caused some failing tests in the GEMPAK reader due to what looks to be marginally different output values. Would you be willing to take a look and see if these differences are significant or not?

Checklist

Closes thermo.py: (hack) Optimise dt function in moist_lapse by eliminating … #1169, closes snxarray is slow #2062
Tests added
Fully documented

src/metpy/io/gempak.py

nawendt · 2021-08-29T17:31:30Z

I added a comment to convert_degC_to_K. This function is the reason the tests started failing. It did not account for missing values which were most likely -9999. The reference data has a nan value whereas the decoded soundings ended up having some nonsense values in the same array position. Glad it was an easy fix.

sgdecker · 2021-08-30T03:54:22Z

On an old laptop, I am going from 3:00 to 0:13 with this branch and my test file. Very nice!

jthielen · 2021-08-31T17:38:51Z

I wasn't all that happy with the private routine + public wrapper approach...it would get really messy if applied to the whole calculation suite. So, I went back and came up with a decorator-based alternative: https://gist.github.com/jthielen/93ec9fcd8f3892a83215309fe5cda2c6. Doing it this way moves all unit specification into the decorator (which does carry with it the slight issue of default arguments in the signature needing to be in base units and not Quanitities), and leaves the unitless version attached to the parent function on the ._nounit attribute (for use internally).

The existing approach is passing all checks, but let me know if it'd be useful and go back and redo it in the decorator-based way. If going that route, I'd want to clean it up to allow the existing check_units and this new process_units to better share code, and to think through if I'm missing any edge cases in the design. Otherwise, if we want to get these performance improvements in as soon as possible, the decorator alternative can be left to a future refactor when we need to optimize more routines.

jthielen · 2021-09-17T01:37:04Z

This has now all been rewritten to use the decorator-based approach! Performance improvements from my tests are about the same as before.

jthielen · 2022-01-14T20:50:38Z

This should be ready for final review once it is adapted after the merge of #2263

…r unit handling on decorator

…culations

jthielen · 2022-01-18T19:26:17Z

@dcamron I've rebased this on main now that #2263 is in. I'm not on my primary workstation at the moment, so wasn't able to check full lint, test suite, and performance, but at least we can see what CI thinks. Also, should be good enough for a review.

dopplershift · 2022-01-18T22:18:10Z

We can ignore Code Climate.

src/metpy/units.py

dopplershift

Wish we had some actual benchmarks in place, but alas.

dopplershift

Wish we had a benchmark, but alas.

jthielen requested review from dcamron and dopplershift August 28, 2021 21:13

jthielen requested a review from a team as a code owner August 28, 2021 21:13

jthielen added Area: Calc Pertains to calculations Area: IO Pertains to reading data Area: Units Pertains to unit information Type: Maintenance Updates and clean ups (but not wrong) labels Aug 28, 2021

jthielen added this to the 1.2.0 milestone Aug 28, 2021

jthielen mentioned this pull request Aug 28, 2021

Handle units external to iteration in moist_lapse and lcl #1980

Closed

1 task

nawendt reviewed Aug 29, 2021

View reviewed changes

src/metpy/io/gempak.py Outdated Show resolved Hide resolved

jthielen force-pushed the refactor-unit-handling-bottlenecks branch from be2a1f2 to 52600c2 Compare August 30, 2021 20:30

This was referenced Aug 30, 2021

Option to return full (cumulative) profile from thickness_hydrostatic functions #1312

Open

Enhancements to GEMPAK reader(s) #2067

Open

jthielen force-pushed the refactor-unit-handling-bottlenecks branch from 52600c2 to e39fdbb Compare August 30, 2021 21:41

jthielen force-pushed the refactor-unit-handling-bottlenecks branch 2 times, most recently from 01592ce to 25b3567 Compare September 17, 2021 01:36

jthielen force-pushed the refactor-unit-handling-bottlenecks branch from 17d124c to 00745ba Compare September 19, 2021 19:28

Refactor constants module and check units decorator in preparation fo…

fab9996

…r unit handling on decorator

jthielen force-pushed the refactor-unit-handling-bottlenecks branch from 00745ba to 3289182 Compare January 18, 2022 19:23

jthielen added 3 commits January 18, 2022 12:23

Handle units externally for moist_lapse and lcl and those they rely on

f404d50

Refactor GEMPAK reader interp functions to use nounit versions of cal…

963faf8

…culations

fix lint

3289182

dopplershift reviewed Jan 21, 2022

View reviewed changes

src/metpy/units.py Show resolved Hide resolved

dopplershift previously approved these changes Jan 21, 2022

View reviewed changes

MNT: Change square back to use power rather than multiplication

588f4e2

dopplershift dismissed their stale review via 588f4e2 January 21, 2022 21:29

dopplershift enabled auto-merge January 21, 2022 21:30

dopplershift approved these changes Jan 21, 2022

View reviewed changes

dopplershift merged commit 94ef0e0 into Unidata:main Jan 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve some thermo calculation bottlenecks by refactoring how units are handled#2064

Improve some thermo calculation bottlenecks by refactoring how units are handled#2064
dopplershift merged 5 commits intoUnidata:mainfrom
jthielen:refactor-unit-handling-bottlenecks

jthielen commented Aug 28, 2021

Uh oh!

Uh oh!

nawendt commented Aug 29, 2021

Uh oh!

sgdecker commented Aug 30, 2021

Uh oh!

jthielen commented Aug 31, 2021 •

edited

Loading

Uh oh!

jthielen commented Sep 17, 2021

Uh oh!

jthielen commented Jan 14, 2022

Uh oh!

jthielen commented Jan 18, 2022

Uh oh!

dopplershift commented Jan 18, 2022

Uh oh!

Uh oh!

dopplershift left a comment

Uh oh!

dopplershift left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jthielen commented Aug 28, 2021

Description Of Changes

Checklist

Uh oh!

Uh oh!

nawendt commented Aug 29, 2021

Uh oh!

sgdecker commented Aug 30, 2021

Uh oh!

jthielen commented Aug 31, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jthielen commented Sep 17, 2021

Uh oh!

jthielen commented Jan 14, 2022

Uh oh!

jthielen commented Jan 18, 2022

Uh oh!

dopplershift commented Jan 18, 2022

Uh oh!

Uh oh!

dopplershift left a comment

Choose a reason for hiding this comment

Uh oh!

dopplershift left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jthielen commented Aug 31, 2021 •

edited

Loading