Lecture 3: Structured Data Processing (Part I)
(Last updated: Feb 2, 2024)
This lecture gives a tutorial about the Smell Pittsburgh project using a Jupyter Notebook. The learning goals can be found on this link.
Preparation
Do the preparation for the structured data processing module.
Materials
Below is the link to the online notebook:
Follow the steps below to set up the notebook:
- Have the JupyterLab environment ready.
 - Download the structured data processing module from the GitHub repository. Or you can also download the zip file from this link.
 - Open the notebook file (
docs/tutorial-structured-data.ipynb) and start working on the tasks. 
Additional Resources
Below are the videos that students found useful in understanding more about the course materials:
- 
    
[Pandas Resample pd.DataFrame.resample()](https://www.youtube.com/watch?v=l4dvMiSDBzs)  - Python Rolling Window Functions explained in 4 minutes
 
Below are websites related to this tutorial:
- The Smell Pittsburgh Dataset
 - The scikit-learn API
 - The seaborn API
 - The plotly API
 - The pandas API
 - The numpy API
 
Below are papers related to this tutorial: