- Link relevance scoring
- Summary relevance scoring
- Summary embedding similarity to expected answer and prompt
- Markdown tables and charts for visualization of scoring results
- Python 3.10+
-
Clone the repository:
git clone https://github.com/Chkhikvadze/ai-search-benchmark.git cd meta-benchmark
-
Create and activate a virtual environment:
python3 -m venv venv source venv/bin/activate
-
Install the required packages:
pip install -r requirements.txt
export OPENAI_API_KEY="<your_openai_api_key>"
-
Run the application:
cd scoring python3 main.py
- Concurrent tasks can be configured with
CONCURRENT_TASKS
constant. - At the end on file you can run both twitter and web providers, you can comment out the one you don't want to run.