What is InputSplit in hadoop?

January 10, 2016 Gyantoday Hadoop interview questions Leave a comment

InputSplit represents the data to be processed by an individual Mapper. it presents a byte-oriented view on the input and is the responsibility of RecordReader of the job to process this and present a record-oriented view.

In simple way we can say when a Hadoop job is run, it splits input files into chunks and assign each split to a mapper to process. This is called InputSplit.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Share this:

Leave a Reply Cancel reply

Leave a Reply