As part of this topic we will see how to filter the data
- filter is the API to filter the data from input RDD
- It takes anonymous function which returns true or false
- All those elements on which anonymous function returns true, such elements will be copied to output RDD
- Output RDD is typically subset of input RDD
- No modifications can be made on the records while filtering the data.