Python multithreading, thread suddenly stop

Question

My program is supposed to get a dataset, split into chunks and do a couple of calculations per chunk.

The program runs great without any type of parallel calculations and also when I use multiprocess per chunk, and in the chunk multiprocessing per calculation.

But when I tried using multiprocessing per chunk and inside each chunk use multithreading per calculation type, I can see that sometimes not all of threads completed before the program exit. (with exit code 0)

The code is not really complicated, I do use aws clients to use S3 and query from Athena, but the fact that everything works great with only multiprocessing leads me to believe that its not something with the connections.

Working code snippet:

ChunkDataset(AthenaDataset):
    def _run_per_chunk(self, chunks: List[pd.DataFrame]) -> List[str]:
        created_tables_names_futures = []
        with ProcessPoolExecutor(max_workers=self.config[NUM_OF_WORKERS]) as process_executor:
            for chunk_num, chunk in enumerate(chunks):
                created_tables_names_futures.append(process_executor.submit(self.get_athena_datasets, chunk_num))

class AthenaDataset:
    def get_athena_datasets(self, chunk_num: int) -> str:
        historical_table_name = self._get_historical_table_name(chunk_num)
        self._get_datasets(historical_table_name, chunk_num)

    def _get_datasets(self, historical_table_name: str, chunk_num: int):
        with ProcessPoolExecutor(max_workers=len(self.config['data_types'])) as process_executor:
            for dataset_type in self.config[DATASET_TYPES]:
                process_executor.submit(self._get_dataset, historical_table_name, dataset_type, chunk_num)

When I switch the ProcessPoolExecutor in AthenaDataset to ThreadPoolExecutor its failing.

Does someone has any idea on why it something like that could happen? (the function _get_dataset saves the results to a csv file I don't need to get .result())

Python multithreading, thread suddenly stop

Answers (1)

Related Questions