user8162
user8162

Reputation: 221

Histogram plot for very large data

I have very large data of size (1 x 23750811). I would like to visualise this data in histogram-Matlab.

As the data is very large, I am getting only a single dot in my plot. But I could visualise them separately, the first 1/4th of data and so on.

Any suggestion to visualise the entire data in a single plot at once.

Thanks !

Upvotes: 1

Views: 391

Answers (1)

Eduardo
Eduardo

Reputation: 1413

Loading all your data into MatLab is inefficient; you can try using DuckDB; it allows you to use SQL to query very large datasets in several formats like CSV or Parquet; you can pre-compute the bins and heights, then export them and plot them using matlab.

This is a snippet you can use:

select
  floor(column/bin_size)*bin_size,
  count(*) as count
from "path/to/file.csv"
group by 1
order by 1;

Alternatively, you can try sampling your data.

Upvotes: 1

Related Questions