Introduction
Welcome to the practical session webpage on Exploratory Data Analysis (EDA) and Unsupervised Learning! This session is designed to equip you with the essential skills and techniques to explore and understand your data before diving into more complex modeling tasks. By the end of this session, you will be adept at uncovering crucial insights hidden within the data and leveraging this knowledge to guide your analysis effectively. You will approach data analysis with a clear objective, applying a variety of techniques to understand, interpret, and make informed decisions based on the data. Additionally, you will be able to present your findings visually, making them accessible for both technical and general audiences.
Course Criteria
Criteria |
Percentage |
---|---|
Attendance | 10% |
Participation & quiz | 10% |
Midterm Exam or/and Project | 15%+15% |
Final Exam | 20% |
Final Project & Presentation | 30% |
Programming:
You are free you use your favorite programming language
Python
or …
TP Sections
Topic | TP | Solution | Remark |
---|---|---|---|
Univariate Analysis | TP1-Gapminder | TP1-solution | ✅ Completed |
Bivariate Analysis | TP2-Bivariate-Gapminder | TP2-solution | ✅ Completed |
Data Preprocessing | TP3-Preprocessing | TP3-solution | ✅ Completed |
Clustering | TP4-Clustering | TP4-solution | ✅ Completed |
Gaussian Mixture Models & EM Algorithm | TP5-GMM_EM | TP5-solution | ✅ Completed |
Principal Component Analysis (PCA) | TP6-PCA | TP6-Solution | ...Loading |
Midterms, Exams and Projects
In this section, you will find all the information related to the midterms, exams and projects including instructions, starting dates and the deadlines.
✏️ Project:
- Deadline for the report:
...Loading
. - Where to submit:
...Loading
- Your report should be in (your favorite) PDF format and include the following criteria:
- Members’ names & contributions.
- Clearly state each member’s contribution to the report.
- For example:
- Luffy: Introduction, Data Preprocessing
- Gojo: Exploratory Data Analysis, Model Development
- Naruto: Results and Evaluation, Conclusion
- Introduction:
- Clearly state the objectives and purpose of the analysis.
- Provide a brief descriptive statistics of the dataset.
- Data Preprocessing:
- Detail steps taken to clean and preprocess the data.
- Explain handling of missing values and outliers.
- Describe any feature engineering techniques applied.
- Exploratory Data Analysis (EDA):
- Include visualizations (charts, graphs) and descriptive statistics.
- Identify key patterns, trends, and anomalies in the data.
- Unsupervised/supervised Tasks:
- Exploring structures or connections of the data according the flow of your analysis.
- Provide details on clustering methods or model training, validation, and testing processes.
- Include any choice of parameters of unsupervised tasks or hyperparameter tuning and model optimizations.
- Results and Evaluation:
- Present the resulting performance (e.g., obtained structures or accuracy, precision, recall, F1-score, RMSE).
- Compare performance across different methods if multiple were used.
- Highlight any insights or conclusions drawn from the implemented methods/models.
- Conclusion:
- Summarize key findings and their implications.
- Discuss any limitations of the analysis.
- Suggest potential areas for future research or improvement.
- References:
- Cite all sources of data and any external libraries or tools used.
- Include academic references for any theoretical concepts or methods applied.
- Appendix (Optional):
- Provide additional figures, tables, or code snippets that support the analysis.
- Members’ names & contributions.
- Presentation:
- A possible dates:
...Loading
.
- A possible dates:
Resources and Further Reading
Here, you will find additional resources, including books, research papers, and online courses, to further your understanding of Exploratory Data Analysis and Unsupervised Learning.
📚 You will find these books/links helpful…