Learn how to identify data science problems or opportunities, choose the best-fit modeling approach, select the correct features to model, evaluate the results and deploy the model.
Description
Data Science projects are not typical BI projects. Data Science projects start with a business problem or opportunity to be explored and result in gaining new insights as well as producing analytical models – meaning data science projects have different deliverables, pitfalls and challenges. Attempting to deliver a data science project with traditional BI project methodologies contributes to the high rate of data science project failures. CRISP-DM (Cross-industry standard process for data mining) is the accepted methodology for data science projects and addresses the success factors required for data science projects.
This hands-on workshop will expose Business Intelligence practitioners, data analysts, and those looking to get started in data science to an applied experience where they learn to identify the data science problem or opportunity, choose the modeling approach, select the correct features to model, evaluate the results and deploy the model. The workshop covers a wide range of data preparation and modeling exercises—from data sandbox construction to the creation of training, test, and validation data sets for model development.
We will provide a few data sets, jump-start workflows and final solutions for the exercises.
Please note that this workshop is NOT for data science developers or IT developers looking to write analytics programs in Java, Python, R or Scala.
Why attend
You will learn to:
- Understand a data science methodology and end-to-end workflow of problem solution including data understanding and preparation, model building and validation, and model deployment.
- Match data science problems and opportunities to the best-fit models
- Prepare data for different kind of models
- Handle missing data, outliers and quirks
- Train, test and validate data sets for model development
- When to apply supervised or unsupervised machine learning models
- Build prediction, classification and clustering models
- Deploy machine learning models into the cloud
Who should attend
- Analytics professionals, including business intelligence and data management professionals
- Business analysts, data analysts and functional analysts
- IT professionals and consultants
- Analytics leaders
- Program and project leaders
- Anyone who aspires to become a data scientist
Prerequisites
You should have some coding experience and basic knowledge of statistics.