What is InputSplit in hadoop?
InputSplit represents the data to be processed by an individual
Mapper. it presents a byte-oriented view on the input and is the responsibility of
RecordReader of the job to process this and present a record-oriented view.
In simple way we can say when a Hadoop job is run, it splits input files into chunks and assign each split to a mapper to process. This is called InputSplit.