Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/labrijisaad/axa-direct-ml-apprenticeship
Repository showcasing my Machine Learning Engineering Apprenticeship at AXA-Direct Assurance, contributing to the development and implementation of Machine Learning solutions.
https://github.com/labrijisaad/axa-direct-ml-apprenticeship
azure-devops data-drift databricks jupyter-notebook kedro machine-learning machine-learning-engineering mlflow python
Last synced: about 1 month ago
JSON representation
Repository showcasing my Machine Learning Engineering Apprenticeship at AXA-Direct Assurance, contributing to the development and implementation of Machine Learning solutions.
- Host: GitHub
- URL: https://github.com/labrijisaad/axa-direct-ml-apprenticeship
- Owner: labrijisaad
- Created: 2024-01-03T09:23:40.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-03T11:54:53.000Z (10 months ago)
- Last Synced: 2024-11-06T09:08:36.206Z (3 months ago)
- Topics: azure-devops, data-drift, databricks, jupyter-notebook, kedro, machine-learning, machine-learning-engineering, mlflow, python
- Homepage: https://test.pypi.org/project/kedro-table-drift/
- Size: 6.84 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## ( Ongoing π§ )
AXA - Direct Assurance / UniversitΓ© Paris CitΓ©
Apprenticeship - Machine Learning Engineer
---
*Welcome* to this repository that captures my year as a **Machine Learning Engineer Apprentice** at **AXA - Direct Assurance**.
## Overview π
- *Company*: AXA - Direct Assurance
- *Duration*: October 2023 - October 2024 (1 year)
- *Location*: Paris, France
- *Role*: Machine Learning Engineer / Data Scientist## Objective π
My primary goal is to enhance legacy ML systems by implementing MLOps best practices. This includes the development of automated ML pipelines that incorporate training, feature selection, data drift detection, and model deployment, all based on existing production ML codes.
## Core Projects and Responsibilities π
Here's a snapshot of what I've been working on:
1. **`Development of a Kedro-Based Tool for Monitoring Model Drift on Databricks`**: Implementing a pipeline in Kedro on Databricks to track model/data drift in Machine Learning models in production.
2. **`RAG with Longchain for Contextual Querying`**: Creating a Retrieval Augmented Generation (RAG) model using Longchain to provide contextual responses to email queries within the LLM framework (e.g., GPT, LLAMA).
3. **`Evaluation of Data/ML Hackathons and Creating Jupyter Notebook Guides`**: Reviewing Data/ML hackathons and developing Jupyter Notebook guides that provide theoretical insights and practical walkthroughs.
4. **`Kedro Model Deployment on Databricks`**: Collaborating with internal ML/Data teams and external service providers to deploy machine learning model training pipelines using Kedro on Databricks.
## Tools and Technologies π
During the Apprenticeship, I have been engaged with the following tools and technologies:
- **Azure**: Used as the primary cloud provider.
- **Longchain**: Used for enhancing LLMs with the ability to handle contextual inquiries effectively.
- **Azure DevOps**: Used for code version control and to implement CI/CD pipelines, facilitating automated testing and deployment.
- **Kedro**: For constructing robust, maintainable, and scalable Data Engineering/ML pipelines.
- **Scikit-Learn / Jupyter Notebook / Python**: Utilized for developing, testing, and prototyping ML models.
- **MLflow**: To manage the ML lifecycle, including experimentation, reproducibility, and deployment.
- **Azure Databricks**: Platform for building and deploying data and ML workflows.## Connect π