Verta can deploy models either using the Verta Inference system or export models registered within Verta for use in other systems (e.g., Spark, SageMaker.) This tutorial focuses on deploying models using the Verta Deployment infrastructure.
- 1.Create an endpoint
- 2.Update the endpoint with a model
We'll look at these in turn.
endpoint = client.create_endpoint(path="/some-path")
Once an endpoint has been created, we update the endpoint with a deployed model. Verta supports two common paths to deploy models:
- Deploy a Registered Model Version
- Deploy Experiment Run (typically for testing)
The Verta Model Registry is a staging area for models that are to be deployed into production. As a result, deploying Registered Model Versions is the recommended way for deploying models in a production setting.
Once an endpoint has finished updating, you can make REST calls to the endpoint via your language of choice. The Verta Python library provides convenience functions to enable making predictions against the endpoint.
deployed_model = endpoint.get_deployed_model()
While deployment functionality similar to RMVs is available for deploying Experiment Runs, it does not have the same safeguards as deploying RMVs. So Experiment Run deployment is only recommended in interactive testing scenarios.