lmms - Search News

Q-Future/A-Bench

T2I models aim to create images that accurately align with the text and showcase high perceptual quality. Therefore, the proposed A-Bench includes two parts to diagnose whether LMMs are masters at ...

GitHub

Enabling the finetuning of the latest Large Multimodal Models

More and more large multimodal models (LMMs) are being released from time to time, but the finetuning of these models is not always straightforward. This codebase aims to provide a unified, minimal ...

IEEE

Imp: Highly Capable Large Multimodal Models for Mobile Devices

Abstract: By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding. Nevertheless, ...

IEEE

WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction

Abstract: We introduce WildVideo, an open-world benchmark dataset designed to address how to assess hallucination of Large Multi-modal Models (LMMs) for understanding video-language interaction in the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results