Science Cast

Jury: A Comprehensive Evaluation Toolkit

Devrim ÇavusogluOctober 4, 2023 6:12am

Views (38)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Jury: A Comprehensive Evaluation Toolkit

arXivPDFOctober 3, 2023 12:00am

Authors

Devrim Cavusoglu, Ulas Sert, Secil Sen, Sinan Altinuc

Abstract

Evaluation plays a critical role in deep learning as a fundamental block of any prediction-based system. However, the vast number of Natural Language Processing (NLP) tasks and the development of various metrics have led to challenges in evaluating different systems with different metrics. To address these challenges, we introduce jury, a toolkit that provides a unified evaluation framework with standardized structures for performing evaluation across different tasks and metrics. The objective of jury is to standardize and improve metric evaluation for all systems and aid the community in overcoming the challenges in evaluation. Since its open-source release, jury has reached a wide audience and is available at https://github.com/obss/jury.

TwitterandLinkedIn

0 comments

Add comment

Jury: A Comprehensive Evaluation Toolkit

Jury: A Comprehensive Evaluation Toolkit

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments