In the era of big data, organizations collect massive, rapidly growing datasets comprising text, images, graphs, and vectors. Turning stored data into reliable, actionable knowledge does not happen automatically; it needs principled methods that go beyond basic querying. This course addresses that gap by teaching algorithms and systems that convert raw data into insight at scale. Manual analysis does not scale and often misses structure in large, complex datasets. Modern data analytics combines statistics, machine learning, and database systems to automate this process. We will study methods that work effectively on large datasets, including scalable clustering, classification, frequent pattern mining, graph analytics, and nearest-neighbor search, along with practical tools such as Apache Spark and MapReduce. This elite course offering is up-to-date and research-aware. Students will examine recent award-level papers, analyze real research scenarios, and practice end-to-end workflows from data cleaning to evaluation. The aim is to equip you to reason about methods, build scalable pipelines, and communicate findings with rigor.
| Academic Units | 3 |
| Exam Schedule | Thu May 07 2026 00:00:00 GMT+0000 (Coordinated Universal Time) 13:00-15:00 |
| Grade Type | Letter Graded |
| Department Maintaining | CSC(CE) |
| Prerequisites | Must be a Turing AI Scholar SC2301 |
| Index | Type | Group | Day | Time | Venue | Remark |
|---|---|---|---|---|---|---|
| 10201 | LEC/STUDIO | STA1 | MON | 1630-1820 | TR+2 | |
| 10201 | TUT | STA1 | TUE | 1330-1420 | TR+2 | Teaching Wk2-13 |
0930
1030
1130
1230
1330
1430
1530
1630
1730
SC2320
LEC/STUDIO | TR+2
SC2320
TUT | TR+2
Teaching Wk2-13
We would encourage you to review with the following template.
AY Taken: ...
Assessment (Optional): ...
Topics (Optional): ...
Lecturer (Optional): ...
TA (Optional): ...
Review: ...
Final Grade (Optional): ...