CM2203: Informatics
School | Cardiff School of Computer Science and Informatics |
Department Code | COMSC |
Module Code | CM2203 |
External Subject Code | 100370 |
Number of Credits | 10 |
Level | L5 |
Language of Delivery | English |
Module Leader | Dr Sylwia Polberg-Riener |
Semester | Spring Semester |
Academic Year | 2025/6 |
Outline Description of Module
The aim of this module is to provide the student with and understanding of the role data mining and data quality techniques play in our lives. The students will develop a basic toolbox allowing them to use methods for learning and evaluation information. This will be paired with a consideration of the ethical implications surrounding gathering and using information in an automated manner.
On completion of the module a student should be able to
-
Execute and evaluate various techniques related to knowledge discovery and data mining
-
Analyse and critically evaluate methods for assuring quality of information
-
Appraise the ethical implications and societal risks associated with data mining and data quality assurance
How the module will be delivered
The module will be delivered through a combination of lectures and supervised lab sessions/tutorials as appropriate. You will be expected to attend all timetabled sessions and engage with online materials. Online drop-in sessions will be scheduled for students who miss sessions or require further assistance.
Skills that will be practised and developed
Please refer to the learning outcomes.
How the module will be assessed
The module will be assessed with two individual portfolios. Portfolio 1 will consist of online mini-tests and coded tasks relating to data mining. Portfolio 2 will consist of online mini-tests and written/coded tasks pertaining to data quality and data ethics.
Assessment Breakdown
Type | % | Title | Duration(hrs) |
---|---|---|---|
Portfolio | 60 | Informatics Portfolio 1 | N/A |
Portfolio | 40 | Informatics Portfolio 2 | N/A |
Syllabus content
Similarity measures
Knowledge discovery process
Data mining
Classification
Clustering
Association rule learning
Data quality dimensions, activities and methodologies
Ethical considerations concerning gathering and using information