Please enable JavaScript.

Coggle requires JavaScript to display documents.

On-premises MLOps - Coggle Diagram

- - - - A framework for efficient distributed multi-GPU training
        based on MPI
      - Supports: Pytorch, Tensorflow, MXnet
    - - Artifact lineage
        
        Model
        
        Dataset
- - - - Web-apps
        
        LoadBalancer
        
        access that bypasses k8s cluster interaction
        
        K8s ingress
        
        uses k8s primitive - the ingress controller
- - - - Custom components must define CRDs
        CRD - custom resource definition - state of the resource
        that is appropriate
      - Access via Kubeflow API
    - - Jupyter Notebooks, Hyperparameter
        Training (Katib), Pipelines, and others
      - Access via these apps
    - - Python proficient users with containerization skills could
        use extension CRDs. They allow for larger flexibility and reuse
        Others may use Jupyter Notebooks atop and built pipes there.
        Security is type-specific without clear leader