I’ve recently tried to improve model deployement and test different approaches.
After trying Tensorflow Serving
and Torch Serve
, I decided to take a look
at Nvidia Triton
. Its high performance and multiple model backends is very
appealing. However I wanted to integrate it to my rust backend stack. Therefore,
I decided to implement a rust version of the GRPC Client.