(23-05-25) - AWS, IBM Workshop


Created: =dateformat(this.file.ctime,"dd MMM yyyy, hh:mm a") | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a") Tags: knowledge


Setup:

TeamRole:~/environment/ibm-aws-quickstart-immersionday-main/scripts $ ./create_users.sh
# initial_admin_password
Users created successfully.
Password for both user21756 and datascientist21756 is: password
providing access to service instances
 
 
TeamRole:~/environment/ibm-aws-quickstart-immersionday-main/scripts $ oc get route -n zen-46 |awk 'NR==2 {print $2}'
cpd-zen-46.apps.mod-a06dd33aea0643bb.cpd3qlr3x.ibmworkshops.com
 

IBM Cloud Pak for Data (CP4D) on AWS Modernization Workshop :: Immersion Day Workshop IBM Cloud Pak for Data (CP4D) on AWS Modernization Workshop :: English

IBM Data and AI Singapore Team

data governance

  • on prem vs on cloud data sources
  • challenges
    • distributed storage, ownership
    • regulatory req, sensitivity
    • collab
    • moving data

IBM + AWS

  • collaboration to run IBM software on AWS services as a SaaS
  • IBM consulting
    • close relationship
  • Red Hat
    • many customers run redhat openshift on AWS (red hat is by )
  • IBM technology
    • working together to generate products

ai governance

IBM Cloud Pak for Data

  • cpd-zen-xx
  • has a notebook interface similar to sagemaker
  • what is the diff between sagemaker and cloudpak?
    • ai governance they added things for explainability etc
    • explainability using SHAP and LIME

in RI-SageMaker-Deploy-Wstudio.ipynb:

  • using sagemaker.LinearLearner to train on sagemaker
  • does IBM have their own model, and way to conduct training too? why use sagemaker via cloudpak?
  • can deploy it on other places also, currently uses sagemaker to deploy on the endpoint, then using ibm openscale to do drift detection

GitHub - IBM/watson-openscale-samples: Watson Openscale sample assets, notebooks and apps.

Trusted AI

Governance

  • seems more on tools than any specific method / theory to do data governance

they mainly support tabular form

  • if unstructured data - they need to do a transform first into the tabular form

charging / fees / liscencing

  • based on worker nodes

They have some work with federated learning too IBM Documentation