In the project we will research the topic to identify all the relevant components and
measures for the topic. Create data governance, dimensional data model and normalized dimensional data model using MySQL
Based on the model created create a mock database and ETL pipeline for your topic, and create all necessary Python and SQL code
In the project we using Excel and MySQL to perform Data Cleaning,
Data Exploration and merge datasets to caculate nedded measures and contruct new datasets prepare for future analysis.
Then utillized Tableau to create several dashboards to produce visualization of the data based on datasets created determine the correlation between covid effects and various life factors and effect of vaccination.
In this project we will build an R Shiny web app to predict the TTC subway delay information using the forecasting methods like ARIMA model, ETS, STL and Tbats.
Where delay information about each lines is in time series variables format and create a selector/drop down for which data set to choose along with a a slider bar for the number of forecasts to produce
In this project we take raw housing data and transform it in MySQL change the format and delete missing and duplicate data to make it more usable for analysis.
In this project we inspect the website HTML source prasing and explore the structure of the website
using Beautiful soup library create function automate downloading and parsing the information saving as CSV, and send notification email when price change.
In this project we utillized Python to perform Data cleaning by change data format and filtering out data, look at both numeric and non numeric data determine which factor is more correlated with gross revenue than other.
In this research we explore the elementary ideas of simplificial complexes, boundary and
homology theory that all together lead to homology groups.