Why do Python dict consuming more memory when stored with more than 550k keys

Question

I use Python's dict type to store a data file with more than 550k keys, almost 29M. however, after reading the data file, the memory used is more than 70M which is unnormal.

So, how does this happen?

Below is the function to read the data file.

def _update_internal_metrics(self, signum, _):
    """Read the dumped metrics file"""
    logger.relayindex('reload dumped file begins')
    dumped_metrics_file_path = os.path.join(settings.DATA_DIR,
                                            settings.DUMPED_METRICS_FILE)
    epoch = int(time.time())
    try:
        new_metrics = {}
        with open(dumped_metrics_file_path) as dumped_metrics_file:
            for line in dumped_metrics_file:
                line = line.strip()
                new_metrics[line] = epoch
    except Exception:
        if not signum:
            self._reload_dumped_file()
        logger.relayindex("Dumped metrics file does not exist or can"
                          "not be read. No update")
    else:
        settings["metrics"] = new_metrics

    instrumentation.increment('dumped.Reload')
    logger.relayindex('reload dumped file ends')

Why do Python dict consuming more memory when stored with more than 550k keys

Answers (1)

Related Questions