Find the top 10 users that have traveled the least distance. Output their id, name and a total distance traveled.
Problem Difficulty Level : Medium
Data Structure ride_log
- id
- user_id
- travel
user
- id
- name
Data for ride_log and user table
In Spark we will solve this problem using two ways
- Using PySpark Functions
- Using Spark SQL
Use below notebook for solution
In MySQL We will load data from CSV using MySQL Import functionality. And then we will solve this problem.
Output Query
Please also follow below blog for understanding this problem
