When is AWS Sagemaker not the best choice?

1 . Cost

2 . Flexibility

  • Sagemaker experiments requires all of the training jobs to be done using the sagemaker training api (meaning spending $$$ for them).
  • This could be an issue if you didn’t want to only use sagemaker supported algorithm.
  • sagemaker experiments is not very useful after I was told of its limitation.

3 . Cloud Vendor limitation

  • if cross cloud vendor tracking needed, then ML-flow is the natural choice.
  • for example tracking metrics from Azure or GCP

4 . Documentation & Community Support

  • limited documentation, or issue solutions => unless you pay for AWS premium support for help

MLFlow Automatic logging

https://www.mlflow.org/docs/latest/tracking.html#automatic-logging

The following libraries support autologging:

  • Scikit-learn
  • TensorFlow and Keras
  • Gluon
  • XGBoost
  • LightGBM
  • Statsmodels
  • Spark
  • Fastai
  • Pytorch

References: