user1482350
user1482350

Reputation: 43

Hadoop Map Reduce reference static objects

I have a static object in my map reduce job class that I want to initialize once (in the main method), then call a function on it in every mapping. So I have this object, MyObject that I declare as a variable:

static MyObject obj;

And in my main function, before I start the job I call:

obj = new MyObject();
obj.init();

And then in my map function I want to call:

obj.execute();

But for some reason I get a null pointer exception when I try this (it says obj is null). If I initialize it in my main function, shouldn't the mapper see it as initialized? Does the mapper see static variables?

Upvotes: 2

Views: 2200

Answers (3)

user1482350
user1482350

Reputation: 43

Since my object was really loading a library, I ended up using the distributed cache and just instantiating the object in the M/R methods.

Upvotes: 0

stinkymatt
stinkymatt

Reputation: 1678

Your main() doesn't get invoked on every node, it only runs where you start up the job. In order to have access to your static object, it needs to be initialized at the instantiation of the mapper. That way the initialization will happen on every node that runs a map task.

But there may be another way to do what you're trying to accomplish, so the question is, what does this static object do?

Upvotes: 3

banjara
banjara

Reputation: 3890

static object resides in memory. now your system is distributed one so object you had created is in memory of node on which your jobtracker is running not on other systems.

now you cannot pass object from job to mapper because config is written as xml, but there is a workaround, Serialize your object into JSON and then put it as string in your configuration and in mappers deserialize this json object

for job

job.getConfiguration().set("some key", "json string")

for mapper

Configuration conf = context.getConfiguration();
conf.get("some key");

Upvotes: 7

Related Questions