Comparison of Uber Fare with NYC fare ! Plotting the Cumulative distribution function(CDF) !
First the intersection of all the Timezone 6,10,16,20 locations is taken called the Candidate Set
Next,we make a json file which stores like key is location and values is difference between the fare
uber_fare = (high_estimate +low_estimate)/2 - nyc_fare;
DO this for all services:
Example Result "40.690346_-73.960293_40.6904473_-73.9787192": { "diff\_black": 8.016, "diff\_family": 17.516, "diff\_pool": 7.516, "diff\_suv": 18.016, "diff\_x": 7.516, "diff\_xl": 11.516 }

  1. Next task is finding the cdf for all the services.

  2. DO this for each service of Uber.

  3. round off the differences and convert to int.

  4. Find the frequency of each difference. Sum it up.

  5. Divide the freq/sumofallfreq in the list.

  6. Find the cumulative sum yaxis[i] = yaxis[i] + yaxis[i-1]

  7. And now plot the line with

  8. Differnece in Fare (X axis)

  9. CDF (Y axis) (Ranging between 0 to 1)

  10. Doing this for the four Timezones Gives us four plots with 6 cdf lines each for each service of uber.

The Four plots are given below: