Consolidate together Bevy's TaskPools [adopted] #18163

hymm · 2025-03-05T19:37:20Z

This is an adoption of #12090

Changes from #12090

Added a spawn_blocking and spawn_blocking_async to scope. This allows the gltf plugin to not block the compute task pool when running tasks.

Todo

figure out what to do with setting the environment variable for limiting the number of blocking threads. set_var is now unsafe and the api in the old pr could potentially set the environment variable from multiple threads. I may just expose setting the environment variable in the docs.
test loading a large gltf
Think a little bit more about the transmute in Scope::spawn_blocking and make sure that it is sound.
Review the migration guide and check for any necessary change from the merge with main.

Objective

Fixes #1907. Spiritual successor to #4740.

Bevy currently creates a 50%/25%/25% split between the Compute, AsyncCompute, and IO task pools, meaning that any given operation can only be scheduled onto that subset of threads. This is suboptimal in the cases where apps are not using any IO or Async Compute, or vice versa, where available parallelism would be under utilized due to the split not reflecting the actual use case. This PR aims to fix that under utilization.

Solution

Do away with the IO and AsyncCompute task pools and allocate all of the threads to the ComputeTaskPool.
Move all of the non-blocking IO tasks to the Compute task pool.
Add TaskPool::spawn_blocking as an alternative for blocking IO operations and any task that would previously be spawned on the AsyncCompute task pool. This will be backed by blocking's dynamically scaled thread pool, which will spin up and down threads depending on the need.

This allows ECS systems and parallel iterators to be scheduled onto all CPU cores instead of artificially constraining them to half of the logical cores available, as well as for typical async IO tasks to interweave onto any thread.

The use of spawn_blocking to perform CPU-bound operations will have to rely on the OS's preemptive scheduler to properly schedule the threads when the main task pool's threads are sitting idle. This comes with potential context switching costs, but is generally preferable to choking out the entire app due the task not cooperatively yielding available threads or artificially limiting available parallelism.

Note: We're already using blocking through async-fs for loading assets from disk. This shift primarily moves all of the other blocking IO tasks, and the async compute tasks into blocking's thread pool.

Changelog

Added: TaskPool::spawn_blocking Added: TaskPool::spawn_blocking_async Removed: IoTaskPool Removed: AsyncComputeTaskPool Changed: ComputeTaskPool by default now spawns a thread for every available logical CPU core.

Migration Guide

IoTaskPool and AsyncComputeTaskPool have been removed and merged with ComputeTaskPool. Replace IoTaskPool::get and AsyncComputeTaskPool::get.

If you were spawning futures that are reliant on non-blocking IO (as in the task spends most of it's time yielding via await), just spawn the task onto the ComputeTaskPool:

// in 0.13
IoTaskPool::get().spawn(async move {
     while let Ok(item) = stream.next().await {
         // process the item here
     }
});
// in 0.14
ComputeTaskPool::get().spawn(async move {
     while let Ok(item) = stream.next().await {
         // process the item here
     }
});

If you were spawning futures that are reliant on blocking IO (i.e. std::fs::File::read_to_string), use ComputeTaskPool::spawn_blocking instead. This will spawn and/or reuse one of the dynamically scaled threads explicitly made for blocking operations.

// in 0.13
IoTaskPool::get().spawn(async move {
     let contents = File::open(path).unwrap().read_to_string().unwrap();
     // process the file contents here
});
// in 0.14
ComputeTaskPool::get().spawn_blocking(move || {
     let contents = File::open(path).unwrap().read_to_string().unwrap();
     // process the file contents here
});

If you were spawning futures for async compute, use ComputeTaskPool::spawn_blocking instead. This will spawn and/or reuse one of the dynamically scaled threads explicitly made for blocking operations.

If you were spawning futures that are reliant on blocking IO (i.e. std::fs::File::read_to_string), use ComputeTaskPool::spawn_blocking instead. This will spawn and/or reuse one of the dynamically scaled threads explicitly made for blocking operations.

// in 0.13
AsyncComputeTaskPool::get().spawn(async move {
     solve_traveling_salesman();
});
// in 0.14
ComputeTaskPool::get().spawn_blocking(move || {
     solve_traveling_salesman();
});

If you were spawning futures for long running tasks that used both blocking and non-blocking work, use ComputeTaskPool::spawn_blocking_async instead.

// in 0.13
AsyncComputeTaskPool::get().spawn(async move {
     let contents = async_fs::File::open("assets/traveling_salesman.json")
         .await
         .unwrap()
         .read_to_string()
         .await
         .unwrap();
     solve_traveling_salesman(contents);
});
// in 0.14
ComputeTaskPool::get().spawn_blocking_async(async move {
     let contents = async_fs::File::open("assets/traveling_salesman.json")
         .await
         .unwrap()
         .read_to_string()
         .await
         .unwrap();
     solve_traveling_salesman(contents);
});
> ```

Co-authored-by: Afonso Lage <lage.afonso@gmail.com>

Co-authored-by: Mike <mike.hsu@gmail.com>

…tion

hymm · 2025-03-08T00:27:48Z

So I looked into the regression with loading assets and it seems like the issue is that the ImageLoader is doing longer running operations in load. Specifically the call to Image::from_buffer takes a decent amount of time. Besides that loading a large gltf queues up a large amount of image loading tasks, so we end up starving the schedules of any threads to run tasks.

There are a few possible solutions here:

Change all asset loaders to use blocking threads. Probably not a good idea as the blocking threads don't have a proper cooperative async executor. Might be worth trying and seeing what the extra overhead is. We'd be spawning a new thread for every task, up to the max threads limit, so could be expensive memory wise too.
Keep the IoTaskPool, but change the TaskPoolBuilder, so that it'll allow overprovisioning the threads. We would set the ComputeTaskPool to provision threads for all the logical processors, and set the IoTaskPool to be some percentage of that.
Just fix the image loader to use blocking threads for it's longer running tasks. This requires creating an async scope as we need to free up the thread that load is called on. This would put the burden on the asset loader authors for making their asset loaders behave properly.

My inclination here is to go with (2), and rework this pr to remove the AsyncComputeTaskPool. Long running async tasks should be using separate threads from the task pool. (3) probably isn't a good idea to assume that the loaders are always be written to behave. (1) seems like a bad idea for the reasons listed above.

There might be a (4) with some type of task priority system, but that requires more extensive executor changes. I'll probably explore overprovisioning threads and put up a pr for it if it looks ok.

Elabajaba · 2025-03-09T23:02:07Z

Fresh trace of Bistro with compressed textures for this PR, for context on the asset loading regression.

edit: #17914 might improve the blocking in prepare assets after the images are loaded.

cart · 2025-03-12T20:51:33Z

Yeah the "separate pools" model was solving a real problem that I don't think we can just fully ignore. I like (2).

cart · 2025-03-12T20:55:59Z

Although I see no reason why IO is any different than "async compute". Seems like plenty of "async compute" cases could also benefit from a pooled model?

james7132 and others added 26 commits February 24, 2024 05:30

Consolidate together Bevy's TaskPools

6bb352c

Formatting

803e74f

Backticks

29e5fab

Apply suggestions from code review

05007b0

Co-authored-by: Afonso Lage <lage.afonso@gmail.com>

Add a spawn_blocking_async

16c9816

Add configuration for the number of blocking threads.

332c98b

Fix build

589cab7

Apply suggestions from code review

7d85100

Co-authored-by: Mike <mike.hsu@gmail.com>

Fix Wasm and document platform specific behavior.

ac13d61

Formatting

e08a7fc

Fix warning

b0cb7c6

Fix toml formatting

25706c7

Provide more disambiguation for num_blocking_threads.

f3ef65c

Correct documentation on spawn_blocking_async

9bf32a5

Formatting

4896e19

Fix typos

95b7435

Fix more typos

274fe31

Merge branch 'main' into task-pool-consolidation

207075e

Remove reference to IO task pool.

9a64617

Toml formatting

1bc4ff4

Merge remote-tracking branch 'upstream/main' into task-pool-consolida…

c7e7c60

…tion

fix merge issues

8161b3e

add spawn blocking to scope

72e6719

fix task pool plugin rebase

fa9ec1d

fmt

60e7762

temporarily comment out more code

d0a19e9

hymm force-pushed the task-pool-consolidation branch from 22ec886 to d0a19e9 Compare March 5, 2025 19:50

add reason

d3bb78e

hymm force-pushed the task-pool-consolidation branch from 8baead2 to d3bb78e Compare March 5, 2025 19:57

add maybesend and maybesync to single threaded

d8828e4

TimJentzsch added C-Performance A change motivated by improving speed, memory usage or compile times A-Tasks Tools for parallel and async work S-Waiting-on-Author The author needs to make changes or address concerns before this can be merged labels Mar 5, 2025

hymm added 2 commits March 5, 2025 14:01

format tasks cargo.toml

e1e8533

fix doc links

78b78ec

hymm force-pushed the task-pool-consolidation branch from 58725af to 78b78ec Compare March 6, 2025 00:13

ci

61335ae

brianreavis mentioned this pull request Mar 19, 2025

KTX2 Updates: ETC1s/BasisLZ, ASTC HDR, and faster Zstd #18411

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate together Bevy's TaskPools [adopted] #18163

Consolidate together Bevy's TaskPools [adopted] #18163

hymm commented Mar 5, 2025

hymm commented Mar 8, 2025

Elabajaba commented Mar 9, 2025 •

edited

Loading

cart commented Mar 12, 2025

cart commented Mar 12, 2025

Consolidate together Bevy's TaskPools [adopted] #18163

Are you sure you want to change the base?

Consolidate together Bevy's TaskPools [adopted] #18163

Conversation

hymm commented Mar 5, 2025

Changes from #12090

Todo

Objective

Solution

Changelog

Migration Guide

hymm commented Mar 8, 2025

Elabajaba commented Mar 9, 2025 • edited Loading

cart commented Mar 12, 2025

cart commented Mar 12, 2025

Elabajaba commented Mar 9, 2025 •

edited

Loading