Earthquake-Data-Engineering-Project-with-Microsoft-Fabric

Project Overview

Learn to build an end to end data engineering and analysis pipelineutilising Microsoft Fabric’s Data Factory, Data Engineering, and Power BI experiences.

Ingesting Earthquake events data from usgs.

Technologies Used: Python, PySpark, Fabric (Data Engineering, Data Factory, Power BI)

Getting Started

To get started with this project, downalod the notebooks in the repository and follow the guidance provided in the YouTube tutorial.

Repository Contents

Worldwide Earthquake Events API - Bronze Layer Processing: This notebook focuses on ingesting raw earthquake data from the USGS API. It performs minimal processing to store data in its original format, serving as the foundational layer for further refinement.

Worldwide Earthquake Events API - Silver Layer Processing: This notebook enhances the data from the Bronze layer by cleaning, transforming, and consolidating the earthquake data. It prepares the data for more analytical processing.

Worldwide Earthquake Events API - Gold Layer Processing: In this final processing stage, the notebook refines the data to create business-ready datasets. These are optimized for high-value insights and are tailored for specific analytical purposes, such as reporting and visualization in tools like Power BI.

Data Attribute Definitions

id: A string identifier for each data record.

latitude: The latitude of the event, stored as a double.

longitude: The longitude of the event, also stored as a double.

elevation: The elevation at which the event occurred, expressed in meters, stored as a double.

title: A string representing the title or name of the event.

place_description: A string describing the location of the event.

sig: A bigint (large integer) representing the significance score of the event.

mag: A double indicating the magnitude of the earthquake.

magType: A string describing the type of magnitude scale used.

time: A timestamp marking the exact time of the event.

updated: A timestamp indicating the last update time for the event data.

Prerequisites

Microsoft Fabric Account.
Fabric Administrator (or access to individual with Admin account).
Familiarity with Python, Spark, and basic data engineering concepts.
Basic Power BI skills.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
01 Worldwide Earthquake Events API - Bronze Layer Processing.ipynb		01 Worldwide Earthquake Events API - Bronze Layer Processing.ipynb
02 Worldwide Earthquake Events API - Silver Layer Processing.ipynb		02 Worldwide Earthquake Events API - Silver Layer Processing.ipynb
03 Worldwide Earthquake Events API - Gold Layer Processing.ipynb		03 Worldwide Earthquake Events API - Gold Layer Processing.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Earthquake-Data-Engineering-Project-with-Microsoft-Fabric

Project Overview

Getting Started

Repository Contents

Data Attribute Definitions

Prerequisites

About

Releases

Packages

Languages

License

ADNANVL/Earthquake-Data-Engineering-Project-with-Microsoft-Fabric

Folders and files

Latest commit

History

Repository files navigation

Earthquake-Data-Engineering-Project-with-Microsoft-Fabric

Project Overview

Getting Started

Repository Contents

Data Attribute Definitions

Prerequisites

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages