SparkSQL, Thrift Server and Tableau

Question

I am wondering if there is a way that will make the sparkSQL table in sqlContext directly visible by other processes, for example Tableau.

I did some research on thrift server, but I didn't find any specific explanation about it. Is it a middleware between Hive(database) and application(client)? If so, do I need to write into a Hive table in my spark program?

When I use Beeline to check the tables from thrift server, there's a field isTempTable. Could I know what does it mean? I'm guessing it is a temp table in the sqlContext of thrift server, because I read something about it is a spark driver program and all cached tables are visible through multiple programs. My confusion here is, if it is a driver program, where are the workers?

To summarize,

Where should I write my DataFrame, or tables in sqlContext to? Which method should I use(like dataFrame.write.mode(SaveMode.Append).saveAsTable())?
Should the default settings be used for the thrift server? Or are the changes necessary?

Thanks

SparkSQL, Thrift Server and Tableau

Answers (1)

Related Questions