CISC7201

CISC7201 Data Science Programming

3 credits

Course Description

This course is designed for students who are new to the world of data science. After the introduction of some basic arithmetic, variables, and data structures in Python, students will start to learn how to collect and extract data from real datasets. Some data analytical skills using the control flows and Python packages (e.g., NumPy, SciPy, Pandas, etc.) will be introduced. To address the needs of big data processing, some distributed computing frameworks (e.g., Spark) and visualization tools with Python will be discussed. Students may apply some basic learning algorithms with Python packages (e.g., scikit-learn) to extract knowledge from data. In response to the evolving demands of the AI era, some AI assisted programming tools will also introduced.

Intended Learning Outcomes (ILO)

Apply the Python language fundamentals, including basic syntax, variables, and process flows, to write a program.
Apply functions and import packages to work with complex and/or large data sets.
Apply scientific packages (e.g., NumPy and SciPy) to perform useful computations.
Process different file types using external packages.
Apply stunning data visualization tools to visualize large data sets.

Fundamental Courses:

CISC7201 Data Science Programming

CISC7204 Data Science and Data Visualization

CISC7203 Database and Data Analysis

CISC7202 Practical Machine Learning