人気の記事一覧
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design
誰もが知っておくべき 10 の AI 関連用語(2023/11/08、ニュースリリース)
Microsoft Azureで視覚と言語を統合したマルチモーダルモデル「Phi-3-vision」他を発表
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology
Chatbots and Large Language Models in Radiology: A Practical Primer for Clinical and Research Applications
CaMML: Context-Aware Multimodal Learner for Large Models
A Survey on Image-text Multimodal Models