人気の記事一覧
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models
MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs