Skip to content
@llm4eval

LLM4Eval

LLM4Eval: Large Language Model for Evaluation in IR

Pinned Loading

  1. LLMJudge Public

    LLMJudge: LLM4Eval workshop data challenge on automatic relevance judgment

    Jupyter Notebook 12 2

  2. llm4eval.github.io Public

    LLM4Eval: Large Language Model for Evaluation in IR

Repositories

Showing 6 of 6 repositories
  • SIGIR2025 Public
    SCSS 0 MIT 0 0 0 Updated Apr 23, 2025
  • WSDM2025 Public

    WSDM 2025

    SCSS 0 MIT 0 0 0 Updated Apr 22, 2025
  • LLMJudge-benchmark Public

    Judging the Judges: A Collection of LLM-Generated Relevance Judgements

    JavaScript 0 MIT 0 0 0 Updated Feb 22, 2025
  • llm4eval.github.io Public

    LLM4Eval: Large Language Model for Evaluation in IR

    0 0 0 0 Updated Feb 18, 2025
  • LLMJudge Public

    LLMJudge: LLM4Eval workshop data challenge on automatic relevance judgment

    Jupyter Notebook 12 2 1 0 Updated Jan 2, 2025
  • SIGIR2024 Public

    SIGIR 2024

    SCSS 0 MIT 0 0 0 Updated Oct 6, 2024

Top languages

Loading…

Most used topics

Loading…