Hortonworks Oozie Spark Action - NullPointerException

Question

I am running on HDP 2.5.3 with oozie 4.2.0. The spark action is set to run in yarn-client mode. The Spark Job is for getting the data from hive table, process it and store it in HDFS. But when I try submitting the Spark Application from Spark Action, I am getting NullPointerException.

workflow.xml


   
      ${job_tracker}
      ${name_node}
   
   
      
         
            hive2.jdbc.url
            ${hive_beeline_server}
         
         
            hive2.server.principal
            ${hive_kerberos_principal}
         
      
   
   
   
      
         ${job_tracker}
         ${name_node}
         yarn-client
         Spark Hive Example
         com.fbr.genjson.exec.GenExecJson
         ${jarPath}/fedebomrpt_genjson.jar
         --jars /usr/hdp/current/spark-client/lib/datanucleus-api-jdo-3.2.6.jar,/usr/hdp/current/spark-client/lib/datanucleus-rdbms-3.2.9.jar,/usr/hdp/current/spark-client/lib/datanucleus-core-3.2.10.jar --files /etc/hive/conf/hive-site.xml --conf spark.sql.hive.convertMetastoreOrc=false --driver-memory 2g --executor-memory 16g --executor-cores 4 --conf spark.ui.port=5051 --queue fbr
         ${arg1}
         ${arg2}
      
      
      
   
   
      Spark Java PatentCitation failed, error message[${wf:errorMessage(wf:lastErrorNode())}]

Exception:

SERVER[xxx.hpc.xx.com] USER[prxtcbrd] GROUP[-] TOKEN[] APP[Spark_Test] JOB[0004629-170625082345353-oozie-oozi-W] ACTION[0004629-170625082345353-oozie-oozi-W@SparkTest] Error starting action [SparkTest]. ErrorType [ERROR], ErrorCode [NullPointerException], Message [NullPointerException: null]
org.apache.oozie.action.ActionExecutorException: NullPointerException: null
    at org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:446)
    at org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:1202)
    at org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor.java:1373)
    at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:232)
    at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:63)
    at org.apache.oozie.command.XCommand.call(XCommand.java:287)
    at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:331)
    at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:260)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:178)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException
    at org.apache.oozie.action.hadoop.SparkActionExecutor.setupActionConf(SparkActionExecutor.java:85)
    at org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:1091)
    ... 11 more

I dont know where I am doing mistake.. Do I need add any config xmls other than hive-site.xml ?

Chetan Tayade · Accepted Answer

In your example you import jars, files(hive-site.xml). I think there is no need to import these thing oozie already import these thing. Can you check with below spark action I think it might solve your problem.


    
        ${jobTracker}
        ${nameNode}
        
            
                mapred.compress.map.output
                true
            
            
                mapred.job.queue.name
                ${queueName}
            
        
        yarn
        cluster
        Spark Hive Example
        com.fbr.genjson.exec.GenExecJson
        ${jarPath}/fedebomrpt_genjson.jar
        --queue queue_name --executor-memory 28G --num-executors 70 --executor-cores 5

And also set below oozie properties in you workflow.xml file

oozie.use.system.libpath=true oozie.libpath=${jarPath}

Make sure you put all your user created libs and files inside your ${jarFile}

Hortonworks Oozie Spark Action - NullPointerException

Answers (1)

Related Questions