Data Analysis with Python
Summary
In this course we will provide an extensive overview on various aspect of data manipulation and analysis with the help of the Python language. In the first part of the course we will start with an introduction of the Python programming language, with a particular stress on what can be achieved in terms of data analysis without using any external framework, in order to provide the basics for more advanced programming techniques. Then, we will focus on different Python frameworks (notably, numpy and pandas) to tackle larger datasets: from data cleaning (outliers detection, duplicates, and so on), missing value management (interpolation, substitution, removal) and basic data analysis (statistical and quantitative).
Dates: September-December 2025
Material
07-10-2025: Intro and Basic data analysis (download data ~255MB: expires on 20-10-2025)
14-10-2025: Data cleaning Notebook and Data (~100MB: expires on 27-10-2025)
21-10-2025: Data cleaning (complete)
22-10-2025: Data cleaning (continued) and Intro to ML