[Functionalization] The inplace decompose op is meaningless

### Question:
When I use torch_xla to accelerate model training, I find the inlace op will go through PyTorch's functionalization. The inlace op will be dispatched to the python level decompose implementation registered here:https://github.com/pytorch/pytorch/blob/8f70bf7a943799b5cd870952d39f36361de4b87f/torch/_refs/__init__.py#L504. However, after the python level implementation, it will then go to the c++ level implementation:https://github.com/pytorch/pytorch/blob/8f70bf7a943799b5cd870952d39f36361de4b87f/aten/src/ATen/FunctionalTensorWrapper.cpp#L219 and the output is determined by the c++ level implementation. It seems that the output of decompose op is meaningless.
I find a pr related to the python level implementation: 
- https://github.com/pytorch/xla/pull/4158
- https://github.com/pytorch/pytorch/pull/87426

and a pr related to the c++ level implementation:
- https://github.com/pytorch/pytorch/pull/63048

And I guess when you need to use functionalization inlace op will meet this problem.

### System info:
Pytorch version:  2.2.2
torch_xla version: 2.3.0
python version: 3.10.13

### reproducible code:
```python
import torch
import torch_xla.core.xla_model as xm

dev = xm.xla_device()

tensor = torch.tensor([1.0, 2.0, 3.0])

xla_tensor_a = tensor.to(dev)
xla_tensor_b = tensor.to(dev)

xla_tensor_a.mul_(xla_tensor_b)
```
you can check the timeline or add some log in the python level and c++ level implementation
here's the timeline of mul_ op: 
![image](https://github.com/pytorch/pytorch/assets/83819142/70d4e155-53a0-44d7-9cd0-f7a50d031457)


I want to know is there any problem in this implementation? Or can someone tell me why the functionalization of Pytorch do this, thank you!

cc @bdhirsh @ezyang

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Functionalization] The inplace decompose op is meaningless #130021

Question:

System info:

reproducible code:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Functionalization] The inplace decompose op is meaningless #130021

Description

Question:

System info:

reproducible code:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions