AMS 524, Modern Computational Data Analytics
This course introduces the tools for the analysis of big data sets on server machines.
It teaches
how to store, preprocess, analyze and visualize data arriving at high volume and velocity.
In the
first part of the course, we will cover programming in Python, from its basic libraries
to more
advanced methods for big data analytics, and machine learning. Emphasis will be on
the
implementation in Python and practical hands-on examples. Next, we will learn essential
Shell
scripting and terminal window commands for computations on server machines. We will
introduce database management systems and SQL querying. In the second part of the
course,
we will discuss code version control and collaboration solutions in GitHub and GitHub
Actions,
microservices, containers (Docker and Kubernetes), API gateways, and other tools necessary
in a professional data science pipeline.
Note: Instructor consent
Summer, 3 credits, ABCF grading
May be repeated for credit