Skip to content

The Kidney Disease Classification project is a comprehensive machine learning pipeline designed to classify kidney disease using advanced tools like MLflow and DVC. It emphasizes reproducibility, scalability, and deployment readiness. The project leverages a modular design, enabling efficient experimentation and deployment in cloud environments.

License

Notifications You must be signed in to change notification settings

RPramodh/Production-Grade-Disease-Classifier-with-MLOps

Repository files navigation

Production-Grade-Disease-Classifier-with-MLOps

This project leverages machine learning to classify kidney disease efficiently, incorporating advanced tools like MLflow for experiment tracking and DVC for lightweight orchestration and pipeline management. Designed for seamless deployment and scalability, this system ensures an efficient and secure environment for data scientists and healthcare professionals.


  1. 🚀 Introduction
  2. ⚙️ Tech Stack
  3. 🎨 Features
  4. 🖥️ Quick Start
  5. 📜 About MLflow & DVC

  • Python 3.8: Core language for development and machine learning.
  • TensorFlow/PyTorch: Frameworks for model development and training.
  • MLflow: Production-grade experiment tracking and model logging.
  • DVC: Lightweight experiment tracker and pipeline orchestrator.
  • Dagshub: Simplified ML experiment tracking and collaboration platform.
  • Docker: Containerization for deployment.
  • AWS (ECR & EC2): Cloud-based infrastructure for scalable model deployment.

🕀 Experiment Tracking: Manage and track all experiments with MLflow and DVC.

🕀 Pipeline Management: Lightweight orchestration with dvc.yaml for modular and reproducible workflows.

🕀 Secure Cloud Deployment: Integration with AWS services like EC2 and ECR for reliable deployment.

🕀 Data Version Control: Maintain reproducibility with DVC for datasets and experiments.

🕀 Dockerized Deployment: Ensure portability with Docker containers.

🕀 User-Friendly Environment: Simplified setup and configuration for seamless adoption.


Prerequisites

Ensure the following are installed on your system:

Installation

Step 1: Clone the Repository

git clone https://github.com/krishnaik06/Kidney-Disease-Classification-Deep-Learning-Project

Step 2: Create a Conda Environment

conda create -n cnncls python=3.8 -y
conda activate cnncls

Step 3: Install Requirements

pip install -r requirements.txt

Step 4: Create a .env file in the root directory and add:

# MLflow Configuration
MLFLOW_TRACKING_URI=
MLFLOW_TRACKING_USERNAME=
MLFLOW_TRACKING_PASSWORD=

# AWS Configuration
AWS_ACCESS_KEY_ID=
AWS_SECRET_ACCESS_KEY=
AWS_REGION=us-east-1
AWS_ECR_LOGIN_URI=
ECR_REPOSITORY_NAME=

Step 5: Run the Application

python app.py

About

The Kidney Disease Classification project is a comprehensive machine learning pipeline designed to classify kidney disease using advanced tools like MLflow and DVC. It emphasizes reproducibility, scalability, and deployment readiness. The project leverages a modular design, enabling efficient experimentation and deployment in cloud environments.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published