Available in:
Leveraging the Power of the SAS® Quality Knowledge Base
DQKB : DQKB21
This course is for programmers who want to learn how to leverage the power of the SAS Quality Knowledge Base, or QKB. The QKB and the associated SAS Data Quality functions, routines, procedures, and CAS actions enable data cleansing, transformation, deduplication, and fuzzy matching using simplified code in DATA step, SQL, and CASL. The course builds on concepts presented in the SAS® Programming 1: Essentials, SAS® Programming 2: Data Manipulation Techniques, and SAS® SQL 1: Essentials courses, and is not recommended for beginning SAS software users.
Learn How To
- Understand the structure of the SAS Quality Knowledge Base (QKB)
- Configure the QKB in SAS 9.4 and in SAS Viya
- Work with QKBs and Locales in SAS 9, the Viya Compute Server, and Cloud Analytic Services (CAS) to:
- Parse data from complex text
- Standardize data values
- Conduct a fuzzy join for tables that do not share primary keys
- Deduplicate data
Who Should Attend
Data scientists, data engineers, data analysts, and SAS programmers.
Prerequisites
Before attending this course, you should be able to do the following:
- Write DATA step code to subset rows and columns, compute new columns, perform conditional processing, and use a BY statement for merging and processing data in groups with FIRST./LAST. logic.
- Write SQL queries, including inner and outer joins and creating computed columns
- Apply SAS formats to data values using the PUT function
- Execute basic SAS procedures, like PROC PRINT and PROC MEANS
SAS Products Covered
SAS Viya
Course Outline
Introduction to the SAS Quality Knowledge Base (QKB)
- Introduction to the SAS Quality Knowledge Base.
- Quality Knowledge Base data types and tokens.
- Quality Knowledge Base definition types.
- Overview of SAS 9 system operations.
- Setting system options for accessing the QKB.
- Interacting with the QKB programmatically.
- Overview of SAS Viya data quality system options.
- Setting session options for accessing the QKB.
- Interacting with the QKB on the SAS Compute Server.
- Overview of QKBs and locales.
- Using the QKB procedure.
- Using the DQLOCLIST procedure.
- Using locale functions.
- Introduction to SAS Data Quality.
- SAS Data Quality procedures.
- SAS Data Quality functions.
- SAS Data Quality CALL routines.
- Data standardization overview.
- Using the DQSCHEME procedure.
- Using standardization functions.
- Using the Standardization CALL routine.
- Overview of entity resolution.
- Using the DQMATCH procedure.
- Using match code functions.
- Parsing text and working with tokens.
- Extracting values from text.
- Identification analysis and gender analysis.
- Casing and pattern analysis.
- SAS Data Quality functions in CAS.
- SAS Data Quality CAS action sets.
- The QKB action set.
- The Data Discovery action set.
- The Entity Resolution action set.
Live Class Schedule
Duration: 14 hours
Step into our live classes and experience a dynamic learning environment where you can ask questions, share ideas, and connect with your instructor and classmates. With on-demand lab hours, you can explore the material at your own pace. Our globally acclaimed instructors will motivate you to think bigger, so you can take what you've learned and achieve your biggest goals.
This course isn't publicly scheduled, but private training and mentoring may be available. Contact us to explore options.
Private Training
Get training tailored specifically for your team, led by expert SAS instructors. Choose from virtual sessions, or training at your location (or ours). Perfect for teams seeking a customized curriculum and plenty of interaction with a SAS specialist. We'll schedule it at a time that works for you.
Mentoring Services
Take your training to the next level with personalized mentoring. While private training offers structured coursework, mentoring provides hands-on, real-time support from a subject matter expert. As you work with your own data, you'll receive expert guidance to help you uncover insights, unlock the full potential of your data, and make faster progress. Perfect for those looking to apply what they’ve learned and see quicker results.