data science

Analyzing the NYC Subway dataset

Currently I am following Udacity‘s data analyst course, which is really interesting. I have done some data analysis with R, but not so much with Python. Found out about Anaconda, which offers some great packages like numpy, pandas(ql) and scipy.  In project 2, the goal was to analyse the NYC subway ridership behaviour, especially answering the question if more people take the subway when it is raining. My analysis can be found on github

histogram_rain_norain  residual_plotavg_number_of_entries_per_hour

Leave a Reply

Your email address will not be published. Required fields are marked *