Reputation: 1890
I am on intel dev cloud and using Intel OneAPI. This is my code till now:
# first block of jupyter notebook
import modin.pandas as pd
# second block of jupyter notebook
df = pd.read_csv('dataset/dataset.csv')
df.head()
# output of second block
UserWarning: Ray execution environment not yet initialized. Initializing...
To remove this warning, run the following python code before doing dataframe operations:
import ray
ray.init()
2023-09-01 12:00:16,471 INFO worker.py:1636 -- Started a local Ray instance.
The first block is running properly but, when I am reading my dataset, it is giving me this warning and server unavailable error.
If I use import pandas as pd
, the code is running fine, but modin.pandas
is not working. My dataset is ~ 1 GB csv file. Why is this happening???
How to Reproduce this?
System Information
Upvotes: 0
Views: 171
Reputation: 11
Installing Ray 2.6.1 by running
pip uninstall ray
pip install ray==2.6.1
and then re-exporting the ipykernel to run the notebook which has
import ray
ray.shutdown()
ray.init(_memory=16000 * 1024 * 1024, object_store_memory=500 * 1024 * 1024,_driver_object_store_memory=500 * 1024 * 1024)
as first block of the notebook and then desired code in the next block
import modin.pandas as pd
# third block of jupyter notebook
df = pd.read_csv('dataset/dataset.csv')
df.head()
should help to avoid the issue
You can also check Intel DevCloud support for the discussion
Upvotes: -1