How to use vision models

Riley · April 13, 2024, 9:37am

Currently, I see that TaskingAI has already supported vision models, such as GPT4-v and similar, but I haven’t figured out how to use them. Does anyone know?

simson · April 15, 2024, 2:30am

Hi Riley,

TaskingAI has indeed integrated multimodal large models like GPT4-v. However, our image upload and reception functionality is currently under rapid development. We expect to implement such features in the next version, please stay tuned.

TailorMeta · May 28, 2024, 8:20am

Hey @simson any ETA on this?

simson · May 28, 2024, 9:48am

Hello, TaskingAI has now integrated these vision models and supports image upload functionality. However, the native image reception feature for the models is still under development and is expected to be released within 2 weeks.

Currently, there is a more powerful feature that can temporarily meet this need: users can set tools in the assistant and select a vision plugin, such as Gemini. In this way, all models can gain image comprehension capabilities comparable to Gemini.