Does Spark TPCDS supports on YARN?

Question

I am testing Spark-3.3.0-without-Hadoop using TPCDS referring spark-tpcds-datagen ,This spark is running on my Hadoop-3.2

Data is produced and -put to hdfs://xxx/tpcds/data330

When I run :

./SPARK/bin/spark-submit  \ 
--master yarn \             # not working
--deploy-mode client  \     # not working
--queue tpcdsqueue \        # not working
--class org.apache.spark.sql.execution.benchmark.TPCDSQueryBenchmark  \
~/tpcds/spark-sql_2.12-3.3.0-tests.jar \
--data-location hdfs://xxx/tpcds/data330 --query-filter "q1"

It runs well and returns expected time-costing results:

Stopped after 2 iterations, 2691 ms
Java HotSpot(TM) 64-Bit Server VM 1.8.0_211-b12 on Linux 3.10.0-862.el7.x86_64
Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20GHz
TPCDS Snappy:                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
q1                                                 1167           1346         253          0.0      Infinity       1.0X

but seems not on YARN, which means the following 3 settings doesn't work

--master yarn \
--deploy-mode client  \
--queue tpcdsqueue \

Does Spark TPCDS supports on YARN?

Answers (1)

Related Questions