Skip to main content

Michelangelo User Guides

Build, train, and deploy machine learning models at scale using Michelangelo's unified ML platform.

Getting Started

New to Michelangelo? Follow this path:

  1. Getting Started with Pipelines — Build your first pipeline in 30 minutes
  2. Prepare Your Data — Load, clean, and split datasets using Ray and Spark
  3. Train & Deploy Models — Train locally or at scale, then register and serve

Quick Navigation

ML Pipelines

GuideDescription
ML Pipelines OverviewTasks, workflows, pipelines, and the Uniflow framework
Running Uniflow PipelinesRun pipelines locally and remotely
Pipeline Running ModesLocal Run, Remote Run, Pipeline Dev Run, and Pipeline Run
Project ManagementOrganize pipelines, models, and resources into projects
Pipeline ManagementCreate, update, and manage pipelines
Set Up TriggersSchedule pipelines on a cron schedule
Backfill PipelinesReprocess historical data windows
Pipeline NotificationsEmail and Slack alerts on pipeline run outcomes

Train & Deploy Models

GuideDescription
Train and Register a ModelTrain at scale and register artifacts
Model Registry GuideVersion, track, and manage trained models
Deploy a ModelBind a registered model to an inference server

Reference

GuideDescription
Type SystemUniflow data types and serialization
Data Passing and ReferencesHow data flows between pipeline tasks
CLI ReferenceCommand-line tools for pipeline and project management

Examples

ExampleDescription
California Housing RegressionPredict house prices with XGBoost
BERT Text ClassificationClassify text with pre-trained transformers
GPT Fine-tuningFine-tune large language models with LoRA
Amazon Books RecommendationDual-encoder recommendation system