Back to directory
GroundingLMM logo

GroundingLMM

Free959

[CVPR 2024 ๐Ÿ”ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural langu

About

[CVPR 2024 ๐Ÿ”ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Key Features

  • foundation-models
  • llm-agent
  • lmm
  • vision-and-language
  • vision-language-model

Pricing

Free

Open source. You supply your own LLM API keys.

Categories

Research

Details

VerifiedJune 4, 2026
GitHub starsโ˜… 959