人気の記事一覧

FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models

9か月前

KNVQA: A Benchmark for evaluation knowledge-based VQA

9か月前