This course gives an overview of the statistical methods used by data scientists, with an emphasis on the applicability to business problems. Attendees do not need access to software for this course, and the mathematical details are kept to a minimum.
Learn How To
Identify the need for feature engineering such as sampling, variable transformations, imputation, and variable selection. Describe applications of supervised models such as decision trees, neural networks, support vector machines, factorization machines, and logistic regression. Describe the usefulness of unsupervised models such as clustering, text mining, network analysis, and path analysis. Implement model assessment and deployment such as visualization and monitoring. Describe the AI lIfe cycle and landscape.Who Should Attend
Business analysts who want an overview of the types of methods used in data science
Prerequisites
Before attending this course, you should have some exposure to model building and quantitative analyses.
Course Outline
Introduction to Data Science
Introductory concepts.Analytical life cycle.Preparing the Data for ModelingAnalytical challenges to predictive modeling.Feature engineering.Supervised ModelsIntroduction to supervised machine learning models. Regression.Decision trees.Neural networks.Support vector machines. Factorization machines.Ensemble models.Two-stage models.Unsupervised ModelsIntroduction to unsupervised models.Hierarchical clustering.K-means clustering. Self-organizing maps.Market basket analysis.Path analysis.Network analysis. Text analytics.Model AssessmentIntroduction to model assessment, model ops, and special topics.