'Tis true: I have 'jumped the thread,' skipping from item 1 to item 7. I have every intention of completing thread items #2-6 with 10 months' worth of insights gained while pursuing my Professional Data Science Certification from IBM-Coursera.
In the mean time, however, I have an assignment to complete. See below - second week, item number #2, last bullet point. In the interest of completing my course of study, I am hereby producing a blog post containing the deliverables for my final capstone project. When I have cleared that hurdle, I'll come back and fill in the blanks.
Coursera Capstone Project
This is the final project for my Coursera-IBM Professional Data Science certificate program.
Here's the assignment. (Quoted directly from the Class Web site):
Now that you have been equipped with the skills and the tools to use location data to explore a geographical location, over the course of two weeks, you will have the opportunity to be as creative as you want and come up with an idea to leverage the Foursquare location data to explore or compare neighborhoods or cities of your choice or to come up with a problem that you can use the Foursquare location data to solve. If you cannot think of an idea or a problem, here are some ideas to get you started:
- In Module 3, we explored New York City and the city of Toronto and segmented and clustered their neighborhoods. Both cities are very diverse and are the financial capitals of their respective countries. One interesting idea would be to compare the neighborhoods of the two cities and determine how similar or dissimilar they are. Is New York City more like Toronto or Paris or some other multicultural city? I will leave it to you to refine this idea.
- In a city of your choice, if someone is looking to open a restaurant, where would you recommend that they open it? Similarly, if a contractor is trying to start their own business, where would you recommend that they setup their office?
These are just a couple of many ideas and problems that can be solved using location data in addition to other datasets. No matter what you decide to do, make sure to provide sufficient justification of why you think what you want to do or solve is important and why would a client or a group of people be interested in your project.
Review criteria
This capstone project will be graded by your peers. This capstone project is worth 70% of your total grade. The project will be completed over the course of 2 weeks. Week 1 submissions will be worth 30% whereas week 2 submissions will be worth 40% of your total grade.
For this week, you will required to submit the following:
- A description of the problem and a discussion of the background. (15 marks)
- A description of the data and how it will be used to solve the problem. (15 marks)
For the second week, the final deliverables of the project will be:
- A link to your Notebook on your Github repository, showing your code. (15 marks)
- A full report consisting of all of the following components (15 marks):
- Introduction where you discuss the business problem and who would be interested in this project.
- Data where you describe the data that will be used to solve the problem and the source of the data.
- Methodology section which represents the main component of the report where you discuss and describe any exploratory data analysis that you did, any inferential statistical testing that you performed, if any, and what machine learnings were used and why.
- Results section where you discuss the results.
- Discussion section where you discuss any observations you noted and any recommendations you can make based on the results.
- Conclusion section where you conclude the report.
- Your choice of a presentation or blogpost. (10 marks)
And off we go!
Here's a link to the blog post I submitted as part of my assignment.