Introduction to Data Curation for SAS® Data Scientists
Duration: 3.5 hours
This is the first course in the Data Curation Professional, SAS Academy for Data Science program. The program is required to earn your SAS data science certification. Designed for SAS data scientists, this program covers SAS topics for data curation techniques, including big data preparation with Hadoop. This course introduces data curation and provides prerequisite material for the SAS Data Curation Professional training.
Learn How ToUse data curation as a data scientist.Identify what SAS tools are available for implementing the data curation life cycle.
SAS Products Covered
SAS Data Integration Studio
Introduction to Data CurationExplore the utility of data for organizations and business in the 21st century.Introduce the field of data science.Define data curation and the data curation life cycle.An Overview of the Computing Environment Define the main components of the computing environment.Describe different types of data storage, including relational databases, Hadoop, data lakes, and cloud storage. Explore parallel processing, grid computing, and cloud computing.Define and discuss SAS 9 and SAS Viya. The Role of Data Science and Data ScientistsExplore the individual tasks that make up data curation.Introduce the SAS tools and applications for data curation.Define artificial intelligence and explain the necessity of data curation for machine learning.The Roadmap to SAS Data CurationIntroduce the SAS data management tools for data curation. Discover SAS data curation tools and applications for interacting with data in Hadoop.Present SAS data curation tools and applications for data federation, event stream processing, and data governance.