You need to run the same job many times with minor variations. Rather than hardcoding all job configuration options in your drive code, you’ve decided to have your Driver subclass org.apache.hadoop.conf.Configured and implement the org.apache.hadoop.util.Tool interface.
Indentify which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?
A . hadoop “mapred.job.name=Example” MyDriver input output
B . hadoop MyDriver mapred.job.name=Example input output
C . hadoop MyDrive CD mapred.job.name=Example input output
D . hadoop setproperty mapred.job.name=Example MyDriver input output
E . hadoop setproperty (“mapred.job.name=Example”) MyDriver input output
Answer: C
Explanation:
Configure the property using the -D key=value notation:
-D mapred.job.name=’My Job’
You can list a whole bunch of options by calling the streaming jar with just the -info argument
Reference: Python hadoop streaming: Setting a job name
Leave a Reply