Amazon SageMaker Clarify

Last updated on December 12, 2025

Bookmarks

Key Capabilities
Core Components
Configuration Components
Bias Metrics Overview
SHAP Explainability
Validate Your Knowledge

Amazon SageMaker Clarify Cheat Sheet

Amazon SageMaker Clarify is a SageMaker AI feature for detecting bias and explaining model predictions.
Supports both pre-training and post-training bias analysis.
Provides feature attribution to explain how input features influence predictions.
Can monitor deployed models for bias drift and feature attribution drift over time.

Key Capabilities

Bias Detection

Pre-training bias: Analyzes datasets before model training.
Post-training bias: Evaluates model predictions for fairness across facets.
Supports binary, multiclass, and regression tasks.

Interpreting Model Behavior

Offers feature attributions via SHAP (SHapley Additive exPlanations), Partial Dependence Plots (PDP), etc.
Explains individual predictions and global feature importance.
Works with both tabular and text data.

Monitoring

Detects bias drift and feature attribution drift in real-time.
Integrates with SageMaker Model Monitor for continuous evaluation.

Integrations

Integration with SageMaker Autopilot: Clarify-based explanations for AutoML models.
Integration with SageMaker Data Wrangler— helps address detected bias through data balancing techniques (undersampling/oversampling/SMOTE).

Core Components

Configuration Objects

DataConfig: Specifies the source dataset and the destination path for output artifacts.
ModelConfig: Identifies the model container or endpoint to be evaluated during the analysis.
BiasConfig: Facets and label information for bias analysis.
SHAPConfig: Parameters for SHAP-based explainability.
ModelPredictedLabelConfig: Specifies how to extract predicted labels.

Processing Job Setup

Use SageMakerClarifyProcessor in SageMaker SDK.
Define ProcessingInput and ProcessingOutput.
Launch via run() method with all config objects.

Configuration Components

Analysis Configuration File (JSON)

Defines bias or explainability parameters.
Supports CSV, JSON Lines, and JSON datasets.
Compatible with tabular, text, image, and time-series data.

SageMakerClarifyProcessor (Python SDK)

High-level API to run Clarify jobs.
Key methods:
- run_bias_and_explainability
- run_post_training_bias
- run_explainability (SHAP / PDP)
- Supports combined SHAP + PDP jobs.

Bias Metrics Overview

Pre-training Bias Metrics

Class Imbalance: Distribution of labels across facets.
Differential Validity: Accuracy differences across groups.

Post-training Bias Metrics

Disparate Impact: Ratio of favorable outcomes between groups.
Equal Opportunity: True positive rate parity.
Predictive Parity: Positive predictive value parity.
Overall Accuracy Equality: Accuracy parity across groups.

SHAP Explainability

Computes local explanations for each prediction.
Aggregates to global feature importance.
Outputs include:
- SHAP values per feature
- Summary plots
- Feature importance rankings

Additional Concepts

Supports text explainability with SHAP for NLP models (e.g., token-level attributions).
Can run PDP, ICE, and global surrogate models for deeper model behavior insights.
Provides baseline (background) dataset selection options, which strongly influence SHAP results.
Clarify jobs automatically generate HTML reports, JSON outputs, and visual artifacts in S3.
Integrates with SageMaker Pipelines for automated bias checks before deployment.
Supports custom metrics and custom scripts for advanced fairness evaluation.

Validate Your Knowledge

Question 1

A retail company leverages machine learning models to predict quarterly sales and optimize inventory management. In response to stakeholder requests, the data science team has been tasked with providing a comprehensive report that ensures transparency and explains the rationale behind the models’ decisions.

What should the data science team present to clearly explain the model’s recommendation process?

Hyperparameter tuning results
Partial dependence plots (PDPs)
Feature engineering scripts
Model convergence tables

Show me the answer!

Correct Answer: 2

Amazon SageMaker Clarify, which includes using Partial Dependence Plots (PDPs), helps visualize the impact of different features on a model’s predictions. This allows stakeholders to better understand each feature’s role in the model’s decision-making process, enhancing overall model transparency.

Amazon SageMaker Clarify generates partial dependence plots (PDPs) to display the marginal effect of features on a machine learning model’s predicted outcome. These plots help explain the target response with specific input features. Clarify also extends explainability to both computer vision (CV) and natural language processing (NLP) by utilizing the same Shapley values (SHAP) algorithm that is used for explaining tabular data models.

Partial Dependence Plots (PDPs) are a powerful tool for explaining machine learning models by showing the relationship between features and the model’s predictions. They provide transparency by displaying how changing a feature affects the prediction while keeping other features constant. PDPs help understand the influence of each feature on the model’s decisions, which is particularly useful for explaining the model’s behavior to stakeholders.

Hence, the correct answer is: Partial dependence plots (PDPs).

The option that says: Hyperparameter tuning results is incorrect because it primarily focuses on model optimization rather than explaining the decision-making process to stakeholders. Hyperparameter tuning affects performance but does not directly contribute to model explainability.

The option that says: Feature engineering scripts is incorrect because they only involve data preprocessing steps and transformations applied to the raw data. While these scripts are crucial for building the model, they do not directly explain how the model makes decisions.

The option that says: Model convergence tables is incorrect because they just provide information on the model’s training progress but do not offer insights into the rationale behind the model’s predictions.

References:

https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-processing-job-analysis-results.html#clarify-processing-job-analysis-results-pdp

https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-processing-job-configure-analysis.html

https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-model-explainability.html

Check out this Amazon SageMaker Cheat Sheet:

https://tutorialsdojo.com/amazon-sagemaker/

Note: This question was extracted from our AWS Certified AI Practitioner Practice Exams AIF-C01.

Amazon SageMaker Clarify Cheat Sheet Resources:

https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-configure-processing-jobs.html
https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-shapley-values.html
https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-processing-job-configure-parameters.html
https://docs.aws.amazon.com/sagemaker/latest/dg/clarify-processing-job-run.html

Written by: Nestor Mayagma Jr.

Nestor is a cloud engineer and content creator at Tutorials Dojo. He's been an active AWS Community Builder since 2022, with a growing interest in multi-cloud technologies across AWS, Azure, and Google Cloud. In his leisure time, he indulges in playing FPS games.

All products - Video Courses, Practice Exams and eBooks	NON-PROMO PERIOD	25% OFF Sale
eBooks as low as	~~$6.99~~	$2.99
AWS Associate-Level Video Courses	~~$12.99~~	$9.74
AZ-900, AI-900, KCNA Practice Exams	~~$12.99~~	$9.74
AWS Foundational, Associate & Pro-Level Mock Exams	~~$14.99~~	$11.24
AZ-104, AZ-305, AZ-400 & AZ-500 Mock Exams	~~$14.99~~	$11.24
AWS Specialty Practice Exams	~~$17.99~~	$13.49
AWS CCP Triple Bundle	~~$29.97~~	$21.72
AWS Associate-Level Triple Bundle	~~$34.97~~	$24.97

Amazon SageMaker Clarify

Amazon SageMaker Clarify

Bookmarks

Amazon SageMaker Clarify Cheat Sheet

Key Capabilities

Bias Detection

Interpreting Model Behavior

Monitoring

Integrations

Core Components

Configuration Objects

Processing Job Setup

Configuration Components

Analysis Configuration File (JSON)

SageMakerClarifyProcessor (Python SDK)

Bias Metrics Overview

Pre-training Bias Metrics

Post-training Bias Metrics

SHAP Explainability

Additional Concepts

Validate Your Knowledge

Question 1

Show me the answer!

Amazon SageMaker Clarify Cheat Sheet Resources:

Learn AWS with our PlayCloud Hands-On Labs

$2.99 AWS and Azure Exam Study Guide eBooks

New AWS Generative AI Developer Professional Course AIP-C01

Learn GCP By Doing! Try Our GCP PlayCloud

Learn Azure with our Azure PlayCloud

FREE AI and AWS Digital Courses

FREE AWS, Azure, GCP Practice Test Samplers

Subscribe to our YouTube Channel

Follow Us On Linkedin

Written by: Nestor Mayagma Jr.

Our Community

What our students say about us?

Amazon SageMaker Clarify

Amazon SageMaker Clarify

Bookmarks

Amazon SageMaker Clarify Cheat Sheet

Key Capabilities

Bias Detection

Interpreting Model Behavior

Monitoring

Integrations

Core Components

Configuration Objects

Processing Job Setup

Configuration Components

Analysis Configuration File (JSON)

SageMakerClarifyProcessor (Python SDK)

Bias Metrics Overview

Pre-training Bias Metrics

Post-training Bias Metrics

SHAP Explainability

Additional Concepts

Validate Your Knowledge

Question 1

Show me the answer!

Amazon SageMaker Clarify Cheat Sheet Resources:

Learn AWS with our PlayCloud Hands-On Labs

$2.99 AWS and Azure Exam Study Guide eBooks

New AWS Generative AI Developer Professional Course AIP-C01

Learn GCP By Doing! Try Our GCP PlayCloud

Learn Azure with our Azure PlayCloud

FREE AI and AWS Digital Courses

FREE AWS, Azure, GCP Practice Test Samplers

Subscribe to our YouTube Channel

Follow Us On Linkedin

Written by: Nestor Mayagma Jr.

Our Community

What our students say about us?

Did you find our content helpful?