Inductor - IR

Update: 2024-01-16

Description

Inductor IR is an intermediate representation that lives between ATen FX graphs and the final Triton code generated by Inductor. It was designed to faithfully represent PyTorch semantics and accordingly models views, mutation and striding. When you write a lowering from ATen operators to Inductor IR, you get a TensorBox for each Tensor argument which contains a reference to the underlying IR (via StorageBox, and then a Buffer/ComputedBuffer) that says how the Tensor was computed. The inner computation is represented via define-by-run, which allows for compact definition of IR representation, while still allowing you to extract an FX graph out if you desire. Scheduling then takes buffers of inductor IR and decides what can be fused. Inductor IR may have too many nodes, this would be a good thing to refactor in the future.

Comments

In Channel

Compiler collectives

2024-08-0416:33

TORCH_TRACE and tlparse

2024-04-2915:28

Higher order operators

2024-04-2117:10

Inductor - Post-grad FX passes

2024-04-1224:07

CUDA graph trees

2024-03-2420:50

Min-cut partitioner

2024-03-1715:56

AOTInductor

2024-03-0217:30

Tensor subclasses and PT2

2024-02-2413:25

Compiled autograd

2024-02-1918:07

PT2 extension points

2024-02-0515:54

Inductor - Define-by-run IR

2024-01-2412:06

Unsigned integers

2024-01-1713:07

Inductor - IR

2024-01-1618:00

Dynamo - VariableTracker

2024-01-1215:55

Unbacked SymInts

2023-02-2121:31

Zero-one specialization

2023-02-2021:07

torchdynamo

2022-12-0625:35

PyTorch 2.0

2022-12-0417:51

History of functorch

2022-11-0719:10

Learning rate schedulers

2022-06-1319:35

00:00

1.0x

#box-pro-ellipsis-176682565654440{-webkit-line-clamp:2;}Inductor - IR

Inductor - IR

PyTorch

Inductor - IR