Proper way of making my python module available to the mlflow during mlflow models build-docker

Question

I am trying to build a docker image that I could host with an endpoint of my model and have experienced issues on how to make my code available during the build so that the image would later run and serve the model.

I am trying a model which is a sklearn type of classifier, however it is wrapped in my modified version of sklearn.Pipeline class, let's call it a BypassablePipeline. The model trains fine, I am logging it into the mlflow tracking server with, I am also able to pull it again and execute it locally. This is how I log the model:

   model_info = mlflow.sklearn.log_model(model,
                                     artifact_path='model',
                                     registered_model_name=model_registry_name,
                                     signature=model_signature, 
                                     pip_requirements=my_pip_requirements)

I have the BypassablePipeline in mymodule and installed it in development mode, my requirements.txt file example, omitting other packages for simplicity sake:

scikit_learn==1.2.0
mlflow==2.1.1
-e .

. is where I locally have the setup.py for my mymodule and installs fine. When I execute pip list this is what I am getting:

scikit_learn 1.2.0
mlflow       2.1.1
mymodule     0.0.1 /home/my_user/my_repo/

plus it is working locally.

These requirements are passed as my_pip_requirements to the log_model above. Also, I am creating the local environment using pyenv. Next, I am activating the environment and run: mlflow models build-docker -m "path_to_model" --enable-mlserver -n "image_name" --env-manager=virtualenv The model server build runs fine. When I later try to test if the server stands up using: docker run "image_name" This where problems starts to occur and I am getting an error, namely: ModuleNotFoundError: No module named 'my_module'

Clearly the module although present in my env, is not properly transferred during build time. So far I have tried the following:

Prior to the docker build using the above command I am creating a fresh environment and building a wheel package out of mymodule then replacing the -e . line with the path to the wheel. Relogged the model again to have proper the my_pip_requirements and the error is still there.
Tried local instead of virtualenv but then it also omits much more (for instance python version) which becomes a warning but error is still there.
Wrapped the model with PythonModel per examples in mlflow repo:

class PyfuncModelWrapper(mlflow.pyfunc.PythonModel):
    def load_context(self, context):
        self.model = mlflow.sklearn.load_model(context.artifacts['original_model_path'])
        
    def predict(self, context, model_input, params: Optional[Dict[str, Any]] = None):
        return self.model.predict(model_input)

Can you point me in the right direction here? What have worked for you? Is there something obvious that I am missing?

Proper way of making my python module available to the mlflow during mlflow models build-docker

Answers (1)

Related Questions