NUMERAI STOCK PREDICTION CHALLENGE

Author: Tanmay Singh

Project Overview

This repository presents a meta-learning-based solution for the NumerAI tournament, involving the design and training of a two-level stacked ensemble model. The goal is to predict financial data patterns effectively, even in the presence of noise, imbalance, and rapidly changing distributions.

Only a subset of successful runs and model checkpoints are stored in this repository for backup.
Note: The latest commit corresponds to the final submission, with a meta-test correlation of 0.018.

Key Contributions

Developed a stacked ensemble architecture comprising 6 expert models and a meta-model (XGBoost, Random Forest, LightGBM, etc.).
Applied meta-learning techniques with feature neutralization and balanced sampling strategies to address dataset drift and class imbalance.
Boosted generalization by combining diverse models and fine-tuning evaluation using meta-testing correlation and feature-neutral metrics.

Tech Stack: Python, Pandas, NumPy, Scikit-learn, XGBoost, LightGBM, Imbalanced-learn, Git/GitHub

Setup & Usage Instructions

1. Install Dependencies & Download Assets

python pre-installs.py

Function of pre-installs.py:

Downloads and installs all required libraries.
Creates:
- data: for downloading NumerAI data via API.
- saved_models: to download pretrained models from Google Drive using gdown.

2. Initialize Models

python Models/models.py

Initializes all expert models and the meta-model architecture.

3. Train Models

# Jupyter
jupyter notebook train.ipynb

# Or Script
python train.py

Trains models and saves new pickle files to saved_models.
To revert to using original Google Drive models, re-run pre-installs.py.

4. Validate Trained Models

# Jupyter
jupyter notebook validation.ipynb

# Or Script
python validation.py

Evaluates performance on the meta-testing (validation) parquet.

5. Generate Live Predictions

# Jupyter
jupyter notebook predict.ipynb

# Or Script
python predict.py

Generates and saves predictions in a new predictions directory.

IMPORTANT NOTE

Before every run, execute pre-installs.py to ensure a fresh slate on every iteration.
If you're using newly trained models, comment out the block that downloads models from Google Drive using gdown.

Repository Structure

.
├── data/                # NumerAI dataset (auto-downloaded)
├── saved_models/        # Trained model pickles (auto-downloaded)
├── predictions/         # Stores output predictions
├── Models/
│   └── models.py        # Model architecture definitions
├── train.ipynb / .py    # Training pipeline
├── validation.ipynb/.py # Validation script
├── predict.ipynb/.py    # Prediction generation
└── pre-installs.py      # Setup script

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
NumerAi_FullData_Testing.ipynb		NumerAi_FullData_Testing.ipynb
NumerAi_Live_Predictor.ipynb		NumerAi_Live_Predictor.ipynb
NumerAi_Testing.ipynb		NumerAi_Testing.ipynb
NumerAi_Training.ipynb		NumerAi_Training.ipynb
README.md		README.md
WorkSummary.potx.pdf		WorkSummary.potx.pdf
models.py		models.py
pre-installs.py		pre-installs.py
predict.ipynb		predict.ipynb
train.ipynb		train.ipynb
validation.ipynb		validation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NUMERAI STOCK PREDICTION CHALLENGE

Project Overview

Key Contributions

Setup & Usage Instructions

1. Install Dependencies & Download Assets

2. Initialize Models

3. Train Models

4. Validate Trained Models

5. Generate Live Predictions

Repository Structure

About

Releases

Packages

Languages

imtanmay46/Meta-Learning

Folders and files

Latest commit

History

Repository files navigation

NUMERAI STOCK PREDICTION CHALLENGE

Project Overview

Key Contributions

Setup & Usage Instructions

1. Install Dependencies & Download Assets

2. Initialize Models

3. Train Models

4. Validate Trained Models

5. Generate Live Predictions

Repository Structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages