Reputation: 1088
I'm trying to use turbodbc with Pyarrow support on Databricks - runtime 7.2. Turbodbc works without the pyarrow support well on the same same instance. From Databricks 7.2 release page it says that Pyarrow is already which I've verified to be true. It comes with 0.15.1.
However, when i try running turbodbc with pyarrow, I'm thrown the below error.
This installation of turbodbc does not support Apache Arrow extensions. Please install the
pyarrow
package. If you have built turbodbc from source, you may also need to reinstall turbodbc to compile the extensions.
The databricks is bases on Ubuntu 18.04. I've made sure that the init script installs
libboost-all-dev
unixodbc-dev
python-dev
I also tried specifying pyarrow again in the libraries list so that it gets installed via pip, but no luck.
Any help is greatly appreciated.
Update 1:
Tried creating an ML cluster where python package installation happens with conda. I'm still getting the same error.
Upvotes: 0
Views: 534