[MLOps]Continuous Learning

Serendipityยท2023๋…„ 5์›” 2์ผ
0
post-custom-banner

What is ๐—–๐—ผ๐—ป๐˜๐—ถ๐—ป๐˜‚๐—ผ๐˜‚๐˜€ ๐—ง๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด (๐—–๐—ง) in MLOps and what steps are needed to achieve it?

CT is the process of automated ML Model retraining in Production Environments on a specific trigger. Letโ€™s look into some prerequisites for this:

1๏ธโƒฃ Automation of ML Pipelines.

๐Ÿ‘‰ Pipelines are orchestrated.
๐Ÿ‘‰ Each pipeline step is developed independently and is able to run on different technology stacks.
๐Ÿ‘‰ Pipelines are treated as a code artifact.

โœ… You deploy Pipelines instead of Model Artifacts allowing Continuous Training In production.
โœ… Reuse of components allows for rapid experimentation.

2๏ธโƒฃ Introduction of strict Data and Model Validation steps in the ML Pipeline.

๐Ÿ‘‰ Data is validated before training the Model. If inconsistencies are found - Pipeline is aborted.
๐Ÿ‘‰ Model is validated after training. Only after it passes the validation is it handed over for deployment.

โœ… Short circuits of the Pipeline allow for safe CT in production.

3๏ธโƒฃ Introduction of ML Metadata Store.

๐Ÿ‘‰ Any Metadata related to ML artifact creation is tracked here.
๐Ÿ‘‰ We also track performance of the ML Model.

โœ… Experiments become reproducible and comparable between each other.
โœ… Model Registry acts as glue between training and deployment pipelines.

4๏ธโƒฃ Different Pipeline triggers in production.

๐Ÿ‘‰ Ad-hoc.
๐Ÿ‘‰ Cron.
๐Ÿ‘‰ Reactive to Metrics produced in Model Monitoring System.
๐Ÿ‘‰ Arrival of New Data.

โœ… This is where the Continuous Training is actually triggered.

5๏ธโƒฃ Introduction of Feature Store (Optional).

๐Ÿ‘‰ Avoid work duplication when defining features.
๐Ÿ‘‰ Reduce risk of Training/Serving Skew.

๐— ๐˜† ๐˜๐—ต๐—ผ๐˜‚๐—ด๐—ต๐˜๐˜€ ๐—ผ๐—ป ๐—–๐—ง:

โžก๏ธ Introduction of CT is not straightforward and you should approach it iteratively. The following could be good Quarterly Goals to set:

๐Ÿ‘‰ Experiment Tracking is extremely important at any level of ML Maturity and the least invasive in the process of ML Model training - I would start with ML Metadata Store introduction.
๐Ÿ‘‰ Orchestration of ML Pipelines is always a good idea, there are many tools supporting this (Airflow, Kubeflow, VertexAI etc.). If you are not doing it yet - grab this next, also make the validation steps part of this goal.
๐Ÿ‘‰ The need for Feature Store will wary on the types of Models you are deploying. I would prioritize it if you have Models that perform Online predictions as it will help with avoiding Training/Serving Skew.
๐Ÿ‘‰ Donโ€™t rush with Automated retraining. Ad-hoc and on-schedule will bring you a long way.

Let me know your thoughts! ๐Ÿ‘‡


Follow Aurimas Griciลซnas to upskill inย #MLOps,ย #MachineLearning,ย #DataEngineering,ย #DataScienceย and overallย #Dataย space.

๐——๐—ผ๐—ปโ€™๐˜ ๐—ณ๐—ผ๐—ฟ๐—ด๐—ฒ๐˜ ๐˜๐—ผ ๐—น๐—ถ๐—ธ๐—ฒ ๐Ÿ‘, ๐˜€๐—ต๐—ฎ๐—ฟ๐—ฒ ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ผ๐—บ๐—บ๐—ฒ๐—ป๐˜!

Join a growing community of Data Professionals by subscribing to my ๐—ก๐—ฒ๐˜„๐˜€๐—น๐—ฒ๐˜๐˜๐—ฒ๐—ฟ:ย https://lnkd.in/e5d3GuJe

์ถœ์ฒ˜ : Aurimas, Machine Learning Community

profile
I'm an graduate student majoring in Computer Engineering at Inha University. I'm interested in Machine learning developing frameworks, Formal verification, and Concurrency.
post-custom-banner

0๊ฐœ์˜ ๋Œ“๊ธ€