Python Pandas: How do I get data one for each nth line with CSV files in?

Question

The data file is so big that I want to receive it at certain intervals only to reduce the interpretation time. I'm using pandas.read_csv. How can I get only one line for every n lines?

Pablo C · Accepted Answer

Try ignoring rows by their indices:

n = 5
skip_func = lambda x: x%n != 0
df = pd.read_csv("data.csv", skiprows = skip_func)

When skiprows is a callable, pandas.read_csv ignore those rows whose indices return True when they are evaluated in the function.

Python Pandas: How do I get data one for each nth line with CSV files in?

Answers (2)

Related Questions