Given a directory of files with the following structure: line number, tab character, string:
Example:
1abialkjfjkaoasdfjksdlkjhqweroij
2kadfjhuwqounahagtnbvaswslmnbfgy
3kjfteiomndscxeqalkzhtopedkfsikj
You want to send each line as one record to your Mapper.
Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?
A . SequenceFileAsTextInputFormat
B . SequenceFileInputFormat
C . KeyValueFileInputFormat
D . BDBInputFormat
Answer: C
Explanation:
http://stackoverflow.com/questions/9721754/how-to-parse-customwritable-from-text-in-hadoop