Read Large Json in Python and take a slice as a sample

Question

I'am dealing a really large json file (6.5GB), with my local machine, it's impossible to read it all at once. So I want to read a chunk as a testing sample and write code based on this sample before running on the entire dataset.

import pandas as pd


file_dir = 'D://yelp_dataset/yelp_academic_dataset_review.json'

df_review_sample = pd.read_json(file_dir, lines=True, chunksize=1000)

I made the following try and then df_review_sample become a JsonReader Object. Is there a way to show the first chunk as a dataframe?

Read Large Json in Python and take a slice as a sample

Answers (1)

Related Questions