Refactoring & improvements to algorithmic complexity of inline.rs. by ElectronicRU · Pull Request #811 · EmbarkStudios/rust-gpu

ElectronicRU · 2021-11-29T14:24:44Z

inline.rs was a sore spot for me reading the codebase because of how many TODOs and quadratic algorithms it had sitting there.

Looking at it, I was able to see some nicer algorithms trying to poke out. Also a little bit more uniformity for doing the same stuff (like finding variable split points), less cloning, and more caching :)

Tests aren't affected at all, which I'd consider a success.

P.S. Idle hands shave yaks, or so they say.

ElectronicRU · 2021-11-29T14:27:06Z

I've tried to explain what exactly I am doing in each commit, and the rationale behind it. This is a second draft, but if some parts seem unclear or I missed something simple, be sure to tell.

khyperia

Thanks! This is a whole heck of a lot, and my burnt-out brainthinker can barely cope with reviewing it, but hopefully we can get this through. (I'd definitely like @eddyb to take a look here, too)

crates/rustc_codegen_spirv/src/linker/inline.rs

EmbarkStudios#811 (review)

khyperia

LGTM, just want to wait a bit to see if eddyb wants to review!

…c inlining. By inlining in callee -> caller order, we avoid the need to continue inlining the code we just inlined. A simple reachability test from one of the entry points helps avoid unnecessary work as well. The algorithm however remains quadratic in case where OpAccessChains repeatedly find their way into function parameters. There are two ways out: either a more complex control flow analysis, or conservatively inlining all function calls which reference FunctionParameters as arguments. I don't think either case is very worth it.

We need pointer types, and re-checking all the types to see if we already have one is rather slow, it's better to keep track.

…ned. The functions we are going to delete definitely either need to be inlined, or are never called (so we don't care what to decide about them).

Since during inlining, the only escaping value is the return value, we can calculate and update whether it has an invalid-to-call-with value as well. (Note that this is, strictly speaking, more rigor than get_invalid_values() applies, because it doesn't look behind OpPhis) As a nice bonus, we got rid of OpLoad/OpStore in favor of OpPhi, which means no type mucking and no work created for mem2reg.

Originally, this algorithm walked a linked list by the back-edges, copying and skipping. It is easier to just go with front-edges and gobble up a series of potential blocks at once. The predecessor finding algorithm really just wanted to find 1-to-1 edges (it was split between `compute_all_preds` and `fuse_trivial_branches`), so made it that.

Just inlining entry points deletes functions from tests and makes everyone sad.

This partially reverts commit 990425b.

EmbarkStudios#811 (review)

ElectronicRU · 2022-05-04T09:56:17Z

Updated the MR and resolved conflicts, @eddyb do you wish to take a look?

ElectronicRU requested review from eddyb and khyperia as code owners November 29, 2021 14:24

khyperia suggested changes Nov 30, 2021

View reviewed changes

ElectronicRU added a commit to ElectronicRU/rust-gpu that referenced this pull request Nov 30, 2021

linker/inline: code review

fa653c1

EmbarkStudios#811 (review)

khyperia approved these changes Dec 2, 2021

View reviewed changes

ElectronicRU added 12 commits May 4, 2022 12:53

linker/inline: add pointer type caching

37e659d

We need pointer types, and re-checking all the types to see if we already have one is rather slow, it's better to keep track.

linker/inline: reuse information about which functions should be inli…

99f25a6

…ned. The functions we are going to delete definitely either need to be inlined, or are never called (so we don't care what to decide about them).

linker/inline: make a proper closure in fuse_trivial_branches

a7cc8db

linker/inline: fix test regression

b360321

Just inlining entry points deletes functions from tests and makes everyone sad.

linker/inline: make Clippy happy

c5d3abe

linker/inline: use variables instead of OpPhis to unify branches.

b473f2a

This partially reverts commit 990425b.

linker/inline: code review

6e4d191

EmbarkStudios#811 (review)

linker/inline: added test for cascade inlining.

dcd3512

Make clippy happy

3722fc6

ElectronicRU force-pushed the inline-2 branch from 05d4709 to 3722fc6 Compare May 4, 2022 09:55

eddyb added the s: waiting on review PRs that blocked on a team member's review. label May 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring & improvements to algorithmic complexity of inline.rs.#811

Refactoring & improvements to algorithmic complexity of inline.rs.#811
ElectronicRU wants to merge 12 commits intoEmbarkStudios:mainfrom
ElectronicRU:inline-2

ElectronicRU commented Nov 29, 2021

Uh oh!

ElectronicRU commented Nov 29, 2021

Uh oh!

khyperia left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khyperia left a comment

Uh oh!

ElectronicRU commented May 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ElectronicRU commented Nov 29, 2021

Uh oh!

ElectronicRU commented Nov 29, 2021

Uh oh!

khyperia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khyperia left a comment

Choose a reason for hiding this comment

Uh oh!

ElectronicRU commented May 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants