Back to directory

GroundingLMM
Free959[CVPR 2024 ๐ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural langu
About
[CVPR 2024 ๐ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Key Features
- foundation-models
- llm-agent
- lmm
- vision-and-language
- vision-language-model
Pricing
FreeOpen source. You supply your own LLM API keys.
Categories
Research
