RFC for Waiting for Individual Tasks in task_group by kboyarinov · Pull Request #1862 · uxlfoundation/oneTBB

kboyarinov · 2025-09-29T16:13:11Z

Description

Add an RFC for new functions that allow waiting for Individual Tasks in task_group:

namespace oneapi {
namespace tbb {
enum task_group_status { // Existing API
    not_complete,        // Existing API
    complete,               // Existing API
    canceled,                // Existing API
    task_complete        // Proposed API 
}

class task_group {
    task_group_status wait_for_task(task_completion_handle&);
    task_group_status run_and_wait_for_task(task_handle&&);
};

class task_arena {
   task_group_status wait_for(task_completion_handle&);
};

} // namespace tbb
} // namespace oneapi

Fixes # - issue number(s) if exists

Type of change

Choose one or multiple, leave empty if none of the other choices apply

Add a respective label(s) to PR if you have permissions

bug fix - change that fixes an issue
new feature - change that adds functionality
tests - change in tests
infrastructure - change in infrastructure and CI
documentation - documentation update

Tests

added - required for new features and some bug fixes
not needed

Documentation

updated in # - add PR number
needs to be updated
not needed

Breaks backward compatibility

Yes
No
Unknown

Notify the following users

List users with @ to send notifications

Other information

rfcs/proposed/task_group_wait_single_task/README.md

vossmjp · 2025-09-29T20:01:01Z

rfcs/proposed/task_group_wait_single_task/README.md

+Since the new waiting functions track the progress of a single task, returning a ``task_group_status`` may be misleading.
+If the group execution is cancelled, the tracked task may still execute, and returning ``canceled`` does not accurately reflect the task's 
+completion status.
+If execution is not cancelled, the function would need to track whether other tasks remain in the group and return ``not_complete` if any are still pending.


Will transfer of a task_completion lead to any confusion around cancellation, if for example, the original task has executed but the task that was transfered to is canceled? The user will see canceled as a status even though the initial task did complete. I think it's fine and will just need to be well documented in transfer.

I think that if the completion of the task was transferred to another task, the task status after the wait is also transferred. I agree that it should be clearly documented. Added explicit mention in the RFC.

akukanov · 2025-09-30T14:34:38Z

rfcs/proposed/task_group_wait_single_task/README.md

+To address this, a new enum ``task_status`` is proposed to track the status of the awaited task. ``task_status::complete`` indicates that the tracked 
+task was executed and completed, while ``task_status::canceled`` signifies that the task was not executed due to group cancellation.


I would consider extending the existing enum with one more value, named e.g. task_complete, which would be returned by the single-task waiting functions instead of complete.

When other waiting functions return task_group_status::canceled, they rely on the cancellation of the group (i.e. cancellation of the associated task_group_context). Individual tasks can be both executed or canceled.
For single-task waiting functions, this flag would mean different thing - that the task we are waiting for was canceled.

And even if the task group execution was canceled, the single-task wait can return task_complete. And the returned task_group_status knows nothing about the status of task group.

I think separating the statuses would be more obvious for users.

For single-task waiting functions, this flag would mean different thing - that the task we are waiting for was canceled.

No, it would mean the same thing - that the task group, which task we are waiting for, was cancelled. There is no way to cancel a single task.

And even if the task group execution was canceled, the single-task wait can return task_complete. And the returned task_group_status knows nothing about the status of task group.

Sure, and I see no problem with that. The task was complete, while the status of the whole group is unknown. On the other hand, if the task was cancelled, then the whole group was cancelled.

For canceled, I agree, for run_and_wait_task it means the awaited task was not executed because the cancellation of the task_group execution.
For complete, I find it a bit misleading to have a status of task_group that knows nothing about the actual status of the group and serves the status of the task instead.

Talking with @kboyarinov earlier today, we discussed that for an individual task, a good set of status values would be "executed" and "canceled", instead of complete and canceled. A task might finish before a task group is canceled. Or a long-running task could execute and then discover while executing, by querying its task_group_context, that its group has been canceled and short-cut its execution. So for a specific task, "executed" simply means that the scheduler executed the task but does not imply anything about completion of the work or other work in the task group. And then "canceled" means the task never started to execute.

I would not object to a different status name, but then I think we would need to adjust task_completion_handle and the transfer method name accordingly - to me, these all are aligned and speak of the same thing from different angles.

What do you mean by "short-circuited"? If you mean a user action within the task to check the group status and adjust the task behavior, there is no way for task_group to know that - for all we know, it was executed and either signaled completion or transferred it to another task. And I do not see it different for the task group complete status, really - it also does not mean "a full and valid result", it only means all tasks in the group have been executed - even though they may have short-circuited due to some external condition.

What I think makes sense to clarify is when/how the group cancellation status is checked during the task waiting call. For example, can the implementation signal all the pending task waiting calls as "canceled" once the group is cancelled - even if a particular task is in the middle of execution? What if the task transfers its completion, and then the group is canceled and the "continuation" task is not executed?

I don't believe it is correct for semantics and implementation to "early-exit" the wait call with the canceled returned state if the group execution is cancelled. It seems for me that since the state of the particular task (or task tree) is important for the user, the implementation should wait to get cancelation signal from the task, not a group. It may be important for the user to know that the task is completed even if the whole group is canceled.
The proposed implementation follows this pattern - if task::execute was called for the awaited task, the wait function returns "task_complete", if task::cancel - "canceled". If the task's completion was transferred, it just waits for such a signal from the task, receiving the completion.

I think it is important to distinguish between whether a task (or its transferred completion) has been skipped due to cancellation or has executed. My concern comes from the existence of task_group_status::complete that means the work in a task_group is done and was not interrupted by an exception or cancellation that set the task group status to canceled. The status task_group_status::task_complete means the task or transferred completion was executed and was not interrupted by an exception. Its logic may have been affected (which I have called short-circuited) by cancellation since it can query cancellation and bail out early. Querying of the task_group cancellation status and the performing an early exit is a pattern we endorse. I do agree that we should NOT override the result in the task waiting call by looking at the task group's status after the task has returned. I just think it would be better if we didn't call it task_complete. But if I'm the only one who thinks that task_complete might be confusing, then I won't press it.

There can be three situations:

the task group is not canceled at the moment the last task in the transfer chain has executed

it can be additionally subdivided, depending on whether the task group is canceled later.

the task group has been canceled to the moment the last task in the transfer chain has executed

the task group is canceled and at least one task in the transfer chain has not executed

It's only the second case where short-circuiting via task_group status query is in theory possible. And I expect it to be a rare case, comparing to the normal execution. So if we want to indicate a possibility of this case, I think we need a third status value for that, rather than deciding that the case 2 defines the status name also for 1.

I prefer to use task_complete for both, but I would also not object to distinct statuses.

rfcs/proposed/task_group_wait_single_task/README.md

akukanov · 2025-09-30T14:49:27Z

rfcs/proposed/task_group_wait_single_task/README.md

+```cpp
+task_status wait_for(task_completion_handle& comp_handle);
+```
+
+Waits for the completion of the task represented by ``comp_handle``.
+If completion was transferred to another task using ``tbb::task_group::transfer_completion_to``, the function waits for completion of that task.
+
+This is semantically equivalent to: ``execute([&] { tg.wait_for(comp_handle); })``.


Are we able to get the task group out of a completion handle? If not, are we able to implement the function without requiring a task group to be also provided by the caller?

With our current implementation, task_group is not required for a single-task waiting. All we need is a task_dynamic_state and the associated task_group_context, both can be obtained from the task object.

In theory, we can implement task_group::wait_task (and even run_and_wait_task if we omit the "same task group" check) as a static function. But I have proposed member functions for consistency with other waiting functions.

I am not sure if implementing task_arena::wait_for wihout the task group is possible for any other TBB implementation. From the perspective of the further inclusion into oneTBB specification, it may make sense to add task_group argument into this function and keep it unused in our implementation. What do you think?

Sure it is possible for any implementation, under the assumption that a task_completion_handle (and perhaps also task_handle) is always "bound" to a certain task group (that is, can keep a pointer/reference to the group). We just need to be clear about that, as well as at which point the binding happens (I guess that is at task creation, and not at submission). Otherwise, it is a mystery where tg comes from in the "equivalent" expression.

I have added the clarification about what tg is in this case.

I agree that this can be approached by binding the handle to the task_group. But I think that we should not specify how binding should be implemented (to ensure the validity of our implementation).

True, we should not specify how the binding is to be implemented. When I said a task handle "can keep a pointer/reference", I did not mean it should, just that an implementation that takes this approach is valid and safe.

Specifying when binding happens is different, as it affects which task group to wait for, Waiting on a wrong task group could result in a program hang.

rfcs/proposed/task_group_wait_single_task/README.md

akukanov · 2025-09-30T18:06:47Z

rfcs/proposed/task_group_wait_single_task/README.md

+An alternative approach to address this limitation is to implement a general mechanism within the scheduler that forces the thread to exit the
+bypass loop and spawn the returned task if further execution should not be continued (i.e., ``waiter.continue_execution()`` returns ``false``).
+
+## Alternative Implementation Approaches


What are benefits and downsides of the alternative approaches to the recommended one?

The main benefit of the recommended implementation approach is that waiting for completion is guaranteed to be implemented using a single r1::wait or r1::run_and_wait in case of transferring.

For both alternative approaches, we will need to switch to another waiting in case of transferring:

task_dynamic_state* state = comp_handle.get_dynamic_state(); r1::wait(state->get_wait_context()); while (state->was_transferred()) { state = state->get_new_completion_point(); r1::wait(state->get_wait_context()); }

With the recommended approach, in case of transferring the wait context pointer is migrating between tasks, hence we don't need to double check if we completion was transferred. If the wait context was released, the completion is guaranteed to happen (no matter of which "final" task in the transfer chain).

Another benefit is that the wait context is created only when the wait was requested (that is not true for the first alternative approach).

I will add more details on the benefits and downsides into the RFC.

Updated each section with the pros and cons.

So, are the alternative implementation approaches considered and rejected, or is a further discussion necessary?

I believe that the natural support for completion transferring in the proposed approach provides a strong rationale for its implementation over other alternatives. If reviewers see it differently, we can discuss further.

akukanov

It's a well-elaborated proposal. I still have some questions and suggestions, though.

rfcs/proposed/task_group_wait_single_task/README.md

akukanov · 2025-10-16T11:48:21Z

rfcs/proposed/task_group_wait_single_task/README.md

+An alternative approach to address this limitation is to implement a general mechanism within the scheduler that forces the thread to exit the
+bypass loop and spawn the returned task if further execution should not be continued (i.e., ``waiter.continue_execution()`` returns ``false``).
+
+## Alternative Implementation Approaches


So, are the alternative implementation approaches considered and rejected, or is a further discussion necessary?

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

…ub.com/oneapi-src/oneTBB into dev/kboyarinov/rfc-tg-wait-single-task

rfcs/proposed/task_group_wait_single_task/README.md

vossmjp · 2026-01-07T23:32:55Z

rfcs/proposed/task_group_wait_single_task/README.md

+};
+
+class task_group {
+    task_status wait_task(task_completion_handle& comp_handle);


I can't remember if we previously discussed this but wait_task seems to me like a getter that returns the wait task. Shouldn't this be something like wait_for_task.

See #1862 (comment)
I am OK with wait_for_task

Agree, renamed to wait_for_task

vossmjp · 2026-01-07T23:33:51Z

rfcs/proposed/task_group_wait_single_task/README.md

+
+class task_group {
+    task_status wait_task(task_completion_handle& comp_handle);
+    task_status run_and_wait_task(task_handle&& handle);


Same issue here, why not run_and_wait_for_task

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Co-authored-by: Mike Voss <michaelj.voss@intel.com>

…ub.com/oneapi-src/oneTBB into dev/kboyarinov/rfc-tg-wait-single-task

…tg-wait-single-task

vossmjp · 2026-01-08T16:57:17Z

rfcs/proposed/task_group_wait_single_task/README.md

+
+// Defined in <oneapi/tbb/task_arena.h>
+class task_arena {
+    task_group_status wait_for(task_completion_handle& comp_handle);


Ok so, the task_arena::wait_for that receives a task completion handle is an overload of task_arena::wait_for that receives a task group. So a consistent name there. Do we want consistency with names in task_group, which now has task_group::wait_for_task and task_group::wait. Even though I recently suggested task_group::wait_for_task, would it be better as task_group::wait_for? And then task_group::run_and_wait_for?

As Alexey noted in #1862 (comment), run_and_wait_for and the existing run_and_wait appear quite similar, so using a more explicit name run_and_wait_for_task would help avoid misinterpretation. I agree with this suggestion.

Thanks, Mike! I also wanted to suggest such consistent naming.
In my opinion that this _for suffix is what helps to differentiate. Without that suffix, it waits for all the tasks inside an entity the method is called on - task_group::wait() without parameters signifies just that and helps to understand the semantics of run_and_wait. While the _for suffix indicates that it is the passed argument the method is going to wait for.
Besides becoming inconsistent with the task_arena, the structures like tg.wait_for_task(task) is too repetitive on the word task.

In my opinion that this _for suffix is what helps to differentiate.

Maybe for wait vs. wait_for, which also differ in their arguments, but not quite enough for run_and_wait vs. run_and_wait_for, which look the same except for the name.

Besides becoming inconsistent with the task_arena

Names in task_group and task_arena were never consistent :)
The former class has run, run_and_wait, and wait, while in the latter we have execute, enqueue, wait_for.

the structures like tg.wait_for_task(task) is too repetitive on the word task.

Remember the argument of it is not a task but a completion handle, and so is quite unlikely to be called just task; I would expect either some meaningful name or an abbreviation like tch.

I guess we will have to just disagree there and find the way to resolve that disagreement. I strongly prefer semantical clarity over consistency with a loosely related method of another class.

kboyarinov added 2 commits September 29, 2025 19:09

Add first version of the RFC

14b7574

Increase picture sizes, remove unnecessary assets

db90a80

kboyarinov requested review from akukanov, aleksei-fedotov, dnmokhov, isaevil and vossmjp September 29, 2025 16:16

kboyarinov added the RFC label Sep 29, 2025

vossmjp reviewed Sep 29, 2025

View reviewed changes

rfcs/proposed/task_group_wait_single_task/README.md Outdated Show resolved Hide resolved

vossmjp reviewed Sep 29, 2025

View reviewed changes

rfcs/proposed/task_group_wait_single_task/README.md Show resolved Hide resolved

vossmjp reviewed Sep 29, 2025

View reviewed changes

Address review comments

72b08f4

akukanov reviewed Sep 30, 2025

View reviewed changes

kboyarinov added 2 commits October 8, 2025 16:22

Start addressing feedback

fbb1e14

Address comments

9778371

akukanov reviewed Oct 16, 2025

View reviewed changes

kboyarinov and others added 8 commits October 20, 2025 17:36

Apply part of comments

ff2c12f

Update rfcs/proposed/task_group_wait_single_task/README.md

71806f1

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Update rfcs/proposed/task_group_wait_single_task/README.md

59e0c6b

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Update rfcs/proposed/task_group_wait_single_task/README.md

d46788d

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Update rfcs/proposed/task_group_wait_single_task/README.md

cfb7089

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Update rfcs/proposed/task_group_wait_single_task/README.md

b9e4862

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Merge branch 'dev/kboyarinov/rfc-tg-wait-single-task' of https://gith…

1e3975b

…ub.com/oneapi-src/oneTBB into dev/kboyarinov/rfc-tg-wait-single-task

Save progress

8c9158e

kboyarinov mentioned this pull request Oct 28, 2025

POC for Waiting for Individual tasks in task_group #1887

Open

4 tasks

kboyarinov added this to the 2022.3.0 milestone Dec 2, 2025

vossmjp reviewed Jan 7, 2026

View reviewed changes

rfcs/proposed/task_group_wait_single_task/README.md Outdated Show resolved Hide resolved

vossmjp reviewed Jan 7, 2026

View reviewed changes

Add missed open question

0d209ae

kboyarinov and others added 6 commits January 8, 2026 15:15

Update rfcs/proposed/task_group_wait_single_task/README.md

8d70040

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>

Update rfcs/proposed/task_group_wait_single_task/README.md

1cdbae3

Co-authored-by: Mike Voss <michaelj.voss@intel.com>

Merge branch 'dev/kboyarinov/rfc-tg-wait-single-task' of https://gith…

efcb59f

…ub.com/oneapi-src/oneTBB into dev/kboyarinov/rfc-tg-wait-single-task

wait_task->wait_for_task

2814c4d

Merge remote-tracking branch 'origin/master' into dev/kboyarinov/rfc-…

8e29965

…tg-wait-single-task

Merge remote-tracking branch 'origin/master' into dev/kboyarinov/rfc-…

a4900d8

…tg-wait-single-task

vossmjp reviewed Jan 8, 2026

View reviewed changes

Add another open question about another task_group_status

af50c06

		To address this, a new enum ``task_status`` is proposed to track the status of the awaited task. ``task_status::complete`` indicates that the tracked
		task was executed and completed, while ``task_status::canceled`` signifies that the task was not executed due to group cancellation.

Conversation

kboyarinov commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tests

Documentation

Breaks backward compatibility

Notify the following users

Other information

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vossmjp Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akukanov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kboyarinov commented Sep 29, 2025 •

edited

Loading

akukanov Oct 6, 2025 •

edited

Loading

vossmjp Oct 13, 2025 •

edited

Loading

akukanov Jan 9, 2026 •

edited

Loading

akukanov Jan 13, 2026 •

edited

Loading

akukanov Oct 14, 2025 •

edited

Loading

akukanov Oct 16, 2025 •

edited

Loading

akukanov Oct 16, 2025 •

edited

Loading

vossmjp Jan 8, 2026 •

edited

Loading