人気の記事一覧

FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models

7か月前

KNVQA: A Benchmark for evaluation knowledge-based VQA

7か月前