Is it possible to use alloca for C++ coroutines?

Question

C++ uses global operator new to allocate coroutines by default. But this potentially leaves a lot of performance on the floor compared to Rust, which can stack allocate them. This is disappointing because for most coroutine use this would be fine -- often you co_await a coroutine right when creating it, or co_await a join/combinator of it and several others immediately. You can override the operator new and operator delete for a promise type and create a custom allocator that does does strict FIFO allocation over some preallocated heap area, but it would still generally be better to reuse the thread's already existing, already hot in cache stack.

AFAICT it is impossible to use alloca on the fly for this -- any call to it in operator new would be freed when the operator function returns. You could preallocate a big chunk of space with alloca in some top level function and then define operator new for the promise type to allocate out of that region, but this is effectively the same as having the separate heap allocated area from a cache-hotness perspective (all of your coroutines using a separate special otherwise cold area instead of being intermingled with your regular calls using the real top of the stack).

Is there any way to make alloca work?

There is a related question here about whether you can call alloca inside a coroutine, but I am asking about using it to back the allocation of the coroutine itself (which necessarily happens outside it) before running it.

There is also this question that open endedly asks if stackless C++ coroutines are a problem, where some answers try to justify the design but doesn't mention alloca at all and doesn't address that Rust model is an existence proof for it being possible in principle to use stack allocations.

Is it possible to use alloca for C++ coroutines?

Answers (1)

Related Questions