This repository was archived by the owner on Jun 3, 2025. It is now read-only.
sparsezoo.deployment_package
#261
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
sparsezoo.deployment_package
This utility provides an easy to use
python-api
and acommand-line-interface
to find pre-trained, packaged models across both NLP and CV domains that are
ready for deployment out-of-the-box and are optimized for metrics that are most important to a user like accuracy, throughput, latency,
etc.
python-api
The python-api can be used to quickly fetch a model from SparseZoo, the
model.onnx
path printed the end of the following code snippet can be directlyused with the
Dockerfile
provided with this diffFor deployment instructions refer here
Output:
cli
The
sparsezoo.deployment_package
cli provides an interface to the python-apiand also prints out instructions to deploy the saved model directory with
Dockerfile present here
(sparsezoo3.9) ~ sparsezoo.deployment_package \ --dataset imagenette \ --optimizing_metric compression
Output:
Usage
Examples:
sparsezoo.deployment_package --dataset imagenette --optimizing_metric compression
sparsezoo.deployment_package --dataset imagenette --optimizing_metric accuracy
sparsezoo.deployment_package --task qa --dataset squad --optimizing_metric latency
sparsezoo.deployment_package --task qa --dataset squad \ --optimizing_metric "compression, accuracy"
Deployment
Assuming we know the path to the directory containing
model.onnx
eitherwith or without configuration files, the following instructions can be used to
deploy a deepsparse container serving the onnx model using the
Dockerfile
,exact instructions are also shown with the
cli