INFO-I 223 Data Fluency
Pervasive, vast, and growing describe data in today’s environment. This course introduces fundamental skills for extracting from data actionable knowledge. Students create, access, munge, analyze, and visualize data to draw inferences and make predictions. The course uses real datasets from a variety of disciplines including healthcare, business, and the humanities.
This course is approved for the Analytical Reasoning, List B, component of the General Education core.
- Store, structure, and access data of different types using simple relational models and tables.
- Munge data to prepare raw data for further analysis.
- Analyze large, complex datasets with supervised learning methods, including linear regression and k-nearest neighbors for functional approximation and naïve Bayesian classifiers and decision trees for classification and predictive modeling.
- Analyze large, complex datasets with unsupervised learning methods, including k-means clustering.
- Calculate probabilities by applying additive and multiplicative laws, permutations and combinations, and conditional probability.
- Calculate expectation and variance from the probability distribution of a random variable.
- Assess model fit (e.g., overfitting or underfitting).
- Create visualizations of data to communicate and persuade.
- Derive information from data and support conclusions or recommendations based on evidence existing in the data.