Which format should you use to store this data in HDFS?
You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and...
You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and...
To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory. What is the best way to accomplish this?A . Serialize the data file, insert in it the...
Examine the following Pig commands: Which one of the following statements is true?A . The SAMPLE command generates an "unexpected symbol" errorB . Each MapReduce task will terminate after executing for 0.2 minutesC ....
Given the following Hive command: INSERT OVERWRITE TABLE mytable SELECT * FROM myothertable; Which one of the following statements is true?A . The contents of myothertable are appended to mytableB . Any existing data...
Given a directory of files with the following structure: line number, tab character, string: Example: 1abialkjfjkaoasdfjksdlkjhqweroij 2kadfjhuwqounahagtnbvaswslmnbfgy 3kjfteiomndscxeqalkzhtopedkfsikj You want to send each line as one record to your Mapper. Which InputFormat should you...
In a MapReduce job with 500 map tasks, how many map task attempts will there be?A . It depends on the number of reduces in the job.B . Between 500 and 1000.C . At...
You have a directory named jobdata in HDFS that contains four files: _first.txt, second.txt, .third.txt and #data.txt. How many files will be processed by the FileInputFormat.setInputPaths () command when it's given a path object...
Which Hadoop component is responsible for managing the distributed file system metadata?A . NameNodeB . MetanodeC . DataNodeD . NameSpaceManagerView AnswerAnswer: A
How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapReduce?A . Keys are presented to reducer in sorted order; values for a given key...
Your client application submits a MapReduce job to your Hadoop cluster. Identify the Hadoop daemon on which the Hadoop framework will look for an available slot schedule a MapReduce operation.A . TaskTrackerB . NameNodeC...