US Emissions Analysis Dashboard (Databricks)

Project Description
This project performs Exploratory Data Analysis (EDA) on United States emissions data using Databricks SQL. The goal is to explore, understand, and visualize emission patterns across states and categories—and identify key contributors and trends through interactive dashboards.
The analysis emphasizes data-driven exploration over prediction, enabling stakeholders to quickly interpret emission distributions and gain comparative insights.
Project Creation Process
1. Data Understanding & Preparation
- Reviewed the structure and attributes of the US emissions dataset
- Validated data consistency and handled aggregations using SQL
2. Exploratory Data Analysis (EDA)
- Analyzed total emissions by year and state
- Compared emissions across sectors and categories
- Examined relationships between population and emissions
3. SQL-Based Analysis
- Used Spark SQL in Databricks for grouping, filtering, and aggregations
- Created derived metrics to support comparative analysis
4. Visualization & Dashboarding
- Designed an interactive Databricks SQL Dashboard
- Built charts, maps, and comparisons for intuitive exploration