How to save sklearn model on s3 using joblib.dump?

前端 未结 3 1602
暖寄归人
暖寄归人 2021-02-15 16:09

I have a sklearn model and I want to save the pickle file on my s3 bucket using joblib.dump

I used joblib.dump(model, \'model.pkl\') to save the model local

3条回答
  •  别那么骄傲
    2021-02-15 16:20

    Here's a way that worked for me. Pretty straight forward and easy. I'm using joblib (it's better for storing large sklearn models) but you could use pickle too.
    Also, I'm using temporary files for transferring to/from S3. But if you want, you could store the file in a more permanent location.

    import tempfile
    import boto3
    import joblib
    
    bucket_name = "my-bucket"
    key = "model.pkl"
    
    # WRITE
    with tempfile.TemporaryFile() as fp:
        joblib.dump(model, fp)
        fp.seek(0)
        s3_resource.put_object(Body=fp.read(), Bucket=bucket_name, Key=key)
    
    # READ
    with tempfile.TemporaryFile() as fp:
        s3_resource.download_fileobj(Fileobj=fp, Bucket=bucket_name, Key=key)
        fp.seek(0)
        model = joblib.load(fp)
    
    # DELETE
    s3_resource.delete_object(Bucket=bucket_name, Key=key)
    

提交回复
热议问题