PANDA ML Package

=====================

Overview

The PANDA ML package is designed to facilitate the integration of machine learning models with the PANDA (Production and Distributed Analysis) system. This package provides tools for data preprocessing, model training, prediction, and error handling, making it easier to deploy and manage ML models in a production environment.

Features

Data Preprocessing: Includes modules for handling historical and live data, with support for categorical encoding and data splitting.
Model Management: Offers classes for managing model pipelines, including training and prediction capabilities.
Prediction Utilities: Provides a set of utilities for fetching tasks, processing predictions, and handling errors.
Logging and Validation: Includes custom logging and data validation tools to ensure robustness and reliability.

Structure

The package is organized into the following submodules:

data: Contains classes for data preprocessing and fetching.
- data_manager.py: Base data preprocessors and specific data processors.
- fetch_db_data.py: Database fetcher for retrieving task parameters.
model: Includes classes for model pipelines and management.
- base_model.py: Basic model classes.
- model_pipeline.py: Model pipelines for training and prediction.
utils: Utility functions for logging, plotting, and prediction handling.
- logger.py: Custom logging setup.
- plotting.py: Plotting utilities for metrics.
- prediction_utils.py: Prediction processing and error handling.
- validator.py: Data validation tools.
tests: Unit tests for ensuring package functionality.
live_prediction.py: Script for running live predictions.

Installation

To install the package, ensure you have Python 3.12 or later installed. Then, follow these steps:

Clone the repository:

git clone git@github.com:PanDAWMS/pandaml.git

Navigate into the project directory
Create a virtual environment (recommended):

Create the virtual environment:
```
python -m venv venv
```
Activate it on Linux/Mac:
```
source venv/bin/activate
```

Install dependencies:

pip install -r requirements.txt

Execution:

python -m scout_ml_package.live_prediction

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PANDA ML Package

Overview

Features

Structure

Installation

About

Releases

Packages

Contributors 3

Languages

License

PanDAWMS/pandaml

Folders and files

Latest commit

History

Repository files navigation

PANDA ML Package

Overview

Features

Structure

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages