As far as processing of data is concerned from the initial file only two columns were used the trip duration and the birth date. I created an age column for my analysis.