MY PROJECT: Uber DE Project ETL Pipeline

# Watch Youtube video for more detail understanding


# My Links
> DE Project : GitHub Link
> Looker Studio: Report Link

# Reference: Darshil Parmar Youtube Links
>Youtube Video Of Reference : Video (Project By Darshil Parmar)
    > Darshil Parmar GitHub Link : GitHub

    Introduction

    The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.


    Architecture


    Technology Used

    • Programming Language - Python

    Google Cloud Platform

    1. Google Storage
    2. Compute Instance
    3. BigQuery
    4. Looker Studio

    Modern Data Pipeine Tool - https://www.mage.ai/


    Dataset Used

    TLC Trip Record Data Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

    More info about dataset can be found here:

    1. Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
    2. Data Dictionary - https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf

    Data Model




    # Project Steps : 

    1. Dataset (uber.csv)
    2. We built Data model (Fact and dimensional format)
    3. Tranformation Code (python)
    4. Upload (Cloud Instance)
    5. Installed Mage UI (Pipeline Tool)
      • Data Exporter => Exported Data to Google BIQ QUERY
    6. Created Final Dashboard (Looker Studio)

    Comments

    Popular posts from this blog

    MyCertificate: AI Online WorkShop