Lecture 01 (Oct 2nd '12): Introduction to Information Management and Data Mining (slides in LibreOffice and (slides in pdf format);
motivation and examples; main issues in data mining.
Lecture 02 (Oct 3rd '12): Know your data (slides in LibreOffice and slides in pdf format);
data objects and attribute types, basic statistical descriptions of data, measuring data proximity.
Lecture 03 (Oct 9th '12): Data preprocessing - part 1 -
(slides in LibreOffice and slides in pdf format);
the quality of data; data cleaning, integration and reduction.
Lecture 06 (Oct 25th '12): Principal Component Analysis
(slides in LibreOffice and slides in pdf format);
Exercise session 02: PCA in practice using R ( dataset for Lab experiments).
Lecture 08 (Nov 7th '12): Principal Component Analysis
(slides in LibreOffice and slides in pdf format);
Exercise session 03: understanding PCA results in R, components selection criteria ( forest fires dataset for Lab experiments).
Lecture 12 (Nov 21st '12): Excercise session 05 - Feature Selection Algorithms in R; implementation of Sequential (Forward and Backward), Floating Sequential and FOCUS Algorithms; empirical and experimental analysis of FSAs
(slides in LibreOffice and slides in pdf format).
Lecture 13 (Nov 27th '12):
Mining frequent patterns: FPGrowth and Vertical Data Format Algorithms; Pattern evaluation measures.
(slides in LibreOffice and slides in pdf format).
Lecture 15 (Dec 4th '12):
Classification: basic concepts and ideas; classifying using decision tree induction models and algorithms.
(slides in LibreOffice and slides in pdf format).
Lecture 16 (Dec 5th '12): Excercise session 07 -
Decision tree induction: information gain indices; inducing decision trees in R
( computing Information measures).