Is there a tool or a model that can do what ChatGPT does with documents

afansfw@lemmynsfw.com · 4 months ago

Is there a tool or a model that can do what ChatGPT does with documents

NSFW

wise_pancake · 4 months ago

No idea your skill level, but try installing open webui and downloading any of ollama vision models

There’s a bit of a learning curve to running docker but ChatGPT can easily get you to a point it’s running.

afansfw@lemmynsfw.com · 4 months ago

I’m not sure if I’m doing something wrong here, but openwebui has been weird for me. I’ve tried running nanonets-ocr, but it only read the last lines visible on a photo. And other models would start reprocessing the whole chat and ignoring the last image I post, answering with the context of the previous reply instead… Using the websearch is easy with it though, so I think I’ll keep an eye on it and maybe will try again later