Add option to use torch._inductor.standalone_compile #17057

zou3519 · 2025-04-23T14:08:53Z

This PR adds the option to use torch._inductor.standalone_compile to perform compilation instead of compile_fx. The goal of standalone_compile is to remove the hacks around vLLM's usage of compile_fx, we want to migrate to using it in PyTorch 2.8.

standalone_compile replaces how vLLM interacts with the torch.compile caches. Instead of vLLM trying to redirect them into its torch_compile_cache folder, vLLM can pass standalone_compile a filepath that is inside of the torch_compile_cache folder and standalone_compile will write the full precompiled artifact to it.

Right now this option is hidden behind a config flag. It is also not tested in vLLM CI (vLLM CI only tests against PyTorch 2.6). This option also needs more testing before we turn it on by default for PyTorch 2.8+. I am putting this PR out so that we can merge something that we can keep developing on top of.

Test Plan:

Run https://gist.github.com/zou3519/aebb622714e80f4cd4c369472f2372cd

github-actions · 2025-04-23T14:09:08Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

This PR adds the option to use torch._inductor.standalone_compile to perform compilation instead of compile_fx. The goal of standalone_compile is to remove the hacks around vLLM's usage of compile_fx, we want to migrate to using it in PyTorch 2.8. standalone_compile replaces how vLLM interacts with the torch.compile caches. Instead of vLLM trying to redirect them into its torch_compile_cache folder, vLLM can pass standalone_compile a filepath that is inside of the torch_compile_cache folder and standalone_compile will write the full precompiled artifact to it. Right now this option is hidden behind a config flag. It is also not tested in vLLM CI (vLLM CI only tests against PyTorch 2.6). This option also needs more testing before we turn it on by default for PyTorch 2.8+. I am putting this PR out so that we can merge something that we can keep developing on top of. Test Plan: - Run https://gist.github.com/zou3519/aebb622714e80f4cd4c369472f2372cd Signed-off-by: rzou <[email protected]>

zou3519 force-pushed the standalone_compile branch 2 times, most recently from de70e86 to 031b8a1 Compare April 23, 2025 15:41

zou3519 force-pushed the standalone_compile branch from 031b8a1 to 24dc355 Compare April 23, 2025 18:32

zou3519 marked this pull request as ready for review April 24, 2025 00:42

zou3519 requested review from tlrmchlsmth, mgoin, youkaichao and houseroad April 24, 2025 00:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to use torch._inductor.standalone_compile #17057

Add option to use torch._inductor.standalone_compile #17057

zou3519 commented Apr 23, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Apr 23, 2025

Add option to use torch._inductor.standalone_compile #17057

Are you sure you want to change the base?

Add option to use torch._inductor.standalone_compile #17057

Conversation

zou3519 commented Apr 23, 2025 • edited by github-actions bot Loading

github-actions bot commented Apr 23, 2025

zou3519 commented Apr 23, 2025 •

edited by github-actions bot

Loading