Some of my coroutines are not (always) starting

Question

I am working with Langchain and FastApi. Basically I am creating a StreamingResponse, which streams back JSON diffs. The diffs are created by comparing a JSON document to its previous version. The various attributes of the JSON are generated by information it receives from OpenAI through langchain chains (with LCEL (https://python.langchain.com/docs/expression_language/) (also streamed).

To come up with all the attributes, I run a function called 'run' that triggers other (async) functions. These async functions add a json diff to an async.queue. The anext method unloads the queue.

The code is quite complex, i tried to condense the relevant parts in here:

This is the code that creates the async generator. The run method of the RunAgent actually builds the document and fills the queue.

from jsonpatch import JsonPatch
import asyncio
from pydantic import BaseModel
from src.llm.chain import meta_chain, sub_object_meta_chain, sub_object_details_chain
from src.schema import MyObject


class IterableAgent(BaseModel):
    queue: asyncio.Queue | None = None
    done: bool = False
    timeout: int = 10

    async def __anext__(self):
        if self.done and self.queue.empty():
            raise StopAsyncIteration
        try:
            return await asyncio.wait_for(self.queue.get(), timeout=self.timeout)
        except asyncio.TimeoutError:
            raise StopAsyncIteration

    def __aiter__(self):
        self.done = False
        self.queue = asyncio.Queue()
        asyncio.create_task(self.run())
        return self

    @abstractmethod
    def run():
        raise NotImplementedError


class RunAgent(IterableAgent):
    _last_object: str = "{}"

    def _get_patch(self) -> str:
        new_object = self.object.model_dump_json()
        patch = JsonPatch.from_diff(self._last_object, new_object).patch
        self._last_object = new_object
        return patch

    def _continue_response(self) -> None:
        patch = self._get_patch()
        if patch:
            self.queue.put_nowait(patch)

    async def _add_meta(self):
        inputs = dict()  # Any inputs I need here for the chain...
        async for result in meta_chain.astream(inputs):
            self.object.meta = result
            self._continue_response()

    async def _add_sub_objects_meta(self):
        inputs = dict()  # Any inputs I need here for the chain...
        async for result in sub_object_meta_chain.astream(inputs):
            self.object.sub_objects = result
            self._continue_response()

    async def _add_sub_objects_details(self, so):
        inputs = dict(so=so)  # Any inputs I need here for the chain...
        async for result in sub_object_details_chain.astream(inputs):
            so.details = result
            self._continue_response()

    async def run(self):
        self.object = MyObject()
        await self._add_meta()
        await self._add_sub_objects_meta()
        cors = [self._add_sub_objects_details(so) for so in self.object.sub_objects]
        # I tried to create tasks instead of just having coroutines, no luck there...
        tasks = [asyncio.create_task(x) for x in cors]
        await asyncio.gather(*tasks)
        self.done = True

Then there is also a small piece of code in my FastApi router that uses the generator to feed into the StreamingResponse.


@router.post("/new")
async def new_document(req: MyRequest):
    \# The agent is implemented to be an AsyncIterator
    agent = RunAgent() # I would typically unpack MyRequest and feed the relevant data into the  agent initialization
    return StreamingResponse(agent, media_type="application/json")

My problem

Now this works quite well, however sometimes, it just stops half way. The queue.get just times out. It seems as if tasks are just not started. It does seem that any tasks/coroutines that have started do complete. So for example What i would see for instance is the metadata of the base object gets generated, but the metadata of the subobjects not. Or the details of one of the sub objects is missing. I think there is some race condition, I just know where....

Some of my coroutines are not (always) starting

My problem

Answers (1)

Related Questions