Scalable Data Processing and Transformation using SageMaker Processing (Part 2 of 2)

2023-04-28T02:18:41+00:00

In this 2-part tutorial, we will focus on SageMaker Processing and how we can use it to solve our data processing needs. Our overall goal is to demonstrate how to use SageMaker Processing to help us perform Min-Max scaling on a dataset in its own dedicated and easily scalable environment.  If you are looking for Part 1, you can find it here. As mentioned in Part I, we have divided this tutorial into 4 sections: [1] Synthetic Data Generation (found in PART I) [2] Using MinMaxScaler without the managed infrastructure support of SageMaker Processing (found in PART I) [3] Using [...]