Data Science Courses
DATA 110: Mathematical Foundations of Data Science
This course provides an introduction to several areas of mathematics which are needed to understand data science. The principal aim throughout is to provide the necessary skills to enable you to solve real-world problems involving data. In order to do this, we learn the mathematics underlying many problems in data science, machine learning, and statistical analysis. Topics of study will include basic calculus, vectors and matrices, probability, and inferential statistics.
DATA 210: Principles of Data Science
This course is an introduction to the principles and practice of Data Science. Students will learn the skills, tools, and techniques to extract valuable insights and make predictions from data using a high-level programming language. They will learn about data manipulation and handling, inferential statistics, data visualization, exploratory data analysis, concepts such as supervised learning (classification and regression), unsupervised learning (clustering), various machine learning models, training vs. testing error, hyper parameter tuning, ensemble methods and the use of associated software libraries. Real-world examples will be selected from a variety of domains – examples include textual analysis, stock price predictions, document clustering based on similarity, and handwritten digit classification. The course will culminate in a final project.
DATA 310: Communicating with Data
The course engages students in projects that require the use of data science and analytics methods. Students gain expertise in data science by working on real-life projects from the natural sciences, the social sciences, and the humanities involving complex data. Using no-code or low-code data science and business analytics and visualization tools such as Tableau and KNIME, students learn how to access data from various repositories, explore raw data from different domains and formats, clean the data, perform exploratory data analysis, and apply statistical methods and predictive modeling. The course enables students to make data-driven decisions and communicate results clearly and professionally.
SOCI 250: Data Ethics
This course exposes students to critical analyses of the role of data in society, governance and policies surrounding data, and ethical considerations around inequality, rights, and responsibilities in the context of data. In the course, students will examine ethical considerations around data collection, processing, analysis, and presentation of results. The course will culminate in a project in which students will create and present a report advising their peers of best ethical practices for various data sources and data science methods.