I think TFServing is a gold standard of deploying deep learning models. It is lean, memory efficient, and supports a number of non-TensorFlow frameworks like JAX, scikit learn or XGBoost.
Here are some notes I constantly refer for details beyong the Google documentation: