Reputation: 221
I have very large data of size (1 x 23750811). I would like to visualise this data in histogram-Matlab.
As the data is very large, I am getting only a single dot in my plot. But I could visualise them separately, the first 1/4th of data and so on.
Any suggestion to visualise the entire data in a single plot at once.
Thanks !
Upvotes: 1
Views: 391
Reputation: 1413
Loading all your data into MatLab is inefficient; you can try using DuckDB; it allows you to use SQL to query very large datasets in several formats like CSV or Parquet; you can pre-compute the bins and heights, then export them and plot them using matlab.
This is a snippet you can use:
select
floor(column/bin_size)*bin_size,
count(*) as count
from "path/to/file.csv"
group by 1
order by 1;
Alternatively, you can try sampling your data.
Upvotes: 1