(23-05-25) - AWS, IBM Workshop

Created: =dateformat(this.file.ctime,"dd MMM yyyy, hh:mm a") | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a") Tags: knowledge

Setup:

TeamRole:~/environment/ibm-aws-quickstart-immersionday-main/scripts $ ./create_users.sh
# initial_admin_password
Users created successfully.
Password for both user21756 and datascientist21756 is: password
providing access to service instances
 
 
TeamRole:~/environment/ibm-aws-quickstart-immersionday-main/scripts $ oc get route -n zen-46 |awk 'NR==2 {print $2}'
cpd-zen-46.apps.mod-a06dd33aea0643bb.cpd3qlr3x.ibmworkshops.com

IBM Cloud Pak for Data (CP4D) on AWS Modernization Workshop :: Immersion Day Workshop IBM Cloud Pak for Data (CP4D) on AWS Modernization Workshop :: English

IBM Data and AI Singapore Team

data governance

on prem vs on cloud data sources
challenges
- distributed storage, ownership
- regulatory req, sensitivity
- collab
- moving data

IBM + AWS

collaboration to run IBM software on AWS services as a SaaS
IBM consulting
- close relationship
Red Hat
- many customers run redhat openshift on AWS (red hat is by )
IBM technology
- working together to generate products

ai governance

AI Policies in Singapore regulation
- Singapore Model Governance Framework (2019, 2020)
- AI Verify - AI Governance Testing Framework + Toolkit (2022)
  - Technical model testing toolkit for explainability, fairness
  - https://file.go.gov.sg/aiverify.pdf
  - https://file.go.gov.sg/aiverify-primer.pdf
AI biases leading to regulations
SR11-7
- risk model assets
- financial services AI in the EU

IBM Cloud Pak for Data

cpd-zen-xx
has a notebook interface similar to sagemaker
what is the diff between sagemaker and cloudpak?
- ai governance ⇒ they added things for explainability etc
- explainability ⇒ using SHAP and LIME

in RI-SageMaker-Deploy-Wstudio.ipynb:

using sagemaker.LinearLearner to train on sagemaker
does IBM have their own model, and way to conduct training too? why use sagemaker via cloudpak?
can deploy it on other places also, currently uses sagemaker to deploy on the endpoint, then using ibm openscale to do drift detection

GitHub - IBM/watson-openscale-samples: Watson Openscale sample assets, notebooks and apps.

Trusted AI

explainability ⇒ LIME, SHAP
drift monitoring ⇒ IBM Documentation
fairness ⇒ disparate impact
quality ⇒ standard metrics - Quality metrics overview - IBM Documentation

Governance

seems more on tools than any specific method / theory to do data governance

they mainly support tabular form

if unstructured data - they need to do a transform first into the tabular form

charging / fees / liscencing

based on worker nodes

They have some work with federated learning too IBM Documentation

Darius Knowledge Hub

Explorer

(23-05-25) - AWS, IBM Workshop

(23-05-25) - AWS, IBM Workshop

Graph View