Support serving in container without `tensorflow`, `torch`, or `cudf` #326

oliverholworthy · 2023-04-13T14:11:54Z

Support serving in container without tensorflow, torch, or cudf.

`tensorflow`/`torch`

The python packages tensorflow, and torch are not required to be installed to serve an Systems Ensemble with Triton since we're using the the tensorflow and pytorch Triton backends which don't require the python packages.

`cudf`

Adding check for cudf to convert_format enables us to run an NVTabular workflow in an environment without cudf installed

github-actions · 2023-04-13T14:21:29Z

Documentation preview

https://nvidia-merlin.github.io/systems/review/pr-326

karlhigley · 2023-04-13T14:37:12Z

merlin/systems/dag/ops/pytorch.py

+try:
+    import torch
+except ImportError:
+    torch = None


These should probably import from merlin.core.compat.torch and merlin.core.compat.tensorflow

Updated. There is a side-effect in the merlin.core.compat.tensorflow that does more than importing tensorflow, which I'm not sure about - the configure_tensorflow function. It may be worth looking at making the device configuration something that is called explictly instead of called as a result of the import of that module.

There are arguments for both sides here. We have had customer complaints before about TF taking up more space than required. The issue manifests itself as OOM error, and this takes a while to investigate given the back and forth required with the customer. So we have to go through this whole thing that explains that it is required that you use the configure_tensorflow method to clamp down TF gpu memory reservation. However this ends up confusing the customer, so we wanted to handle that for the user.

We used to do it explicitly, but it was a giant pain to remember to do it everywhere and if you didn't remember to do it before importing tensorflow then things would break. I'm honestly not sure that doing it this way is entirely better because you can't really set the parameters anymore, but 🤷🏻 six of one, half dozen of the other.

oliverholworthy · 2023-04-13T15:29:31Z

It would be nice to be able to have a test that checks this. One option would involve configuring a workflow that uses triton as a service-container so that we can test with a more minimal triton docker image that doesn't have tensorflow/torch or cudf installed

oliverholworthy added 3 commits April 13, 2023 15:03

Enable ensemble to run in container without tensorflow installed

e34b742

Check if cudf is available in convert_format

4849e96

Support Triton runtime with torch op in env without torch package

98f48f0

oliverholworthy added the chore Maintenance for the repository label Apr 13, 2023

oliverholworthy added this to the Merlin 23.04 milestone Apr 13, 2023

oliverholworthy self-assigned this Apr 13, 2023

karlhigley reviewed Apr 13, 2023

View reviewed changes

karlhigley requested a review from jperez999 April 13, 2023 14:37

oliverholworthy and others added 2 commits April 13, 2023 16:15

Use tensorflow/torch imports from merlin.core.compat

31c0bed

Merge branch 'main' into env-without-cudf-tensorflow

59e39a9

karlhigley approved these changes Apr 13, 2023

View reviewed changes

karlhigley merged commit da739e2 into NVIDIA-Merlin:main Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support serving in container without `tensorflow`, `torch`, or `cudf` #326

Support serving in container without `tensorflow`, `torch`, or `cudf` #326

oliverholworthy commented Apr 13, 2023

Uh oh!

github-actions bot commented Apr 13, 2023

Uh oh!

karlhigley Apr 13, 2023

Uh oh!

oliverholworthy Apr 13, 2023

Uh oh!

jperez999 Apr 13, 2023

Uh oh!

karlhigley Apr 13, 2023

Uh oh!

oliverholworthy commented Apr 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support serving in container without tensorflow, torch, or cudf #326

Support serving in container without tensorflow, torch, or cudf #326

Conversation

oliverholworthy commented Apr 13, 2023

tensorflow/torch

cudf

Uh oh!

github-actions bot commented Apr 13, 2023

Documentation preview

Uh oh!

karlhigley Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

oliverholworthy Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

jperez999 Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

karlhigley Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

oliverholworthy commented Apr 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support serving in container without `tensorflow`, `torch`, or `cudf` #326

Support serving in container without `tensorflow`, `torch`, or `cudf` #326

`tensorflow`/`torch`

`cudf`