T-Ragx - Enhancing Translation with RAG-Powered LLMs

rayliuca · 1 year ago

T-Ragx - Enhancing Translation with RAG-Powered LLMs

slacktoid@lemmy.ml · edit-2 1 year ago

Can you elaborate on why vector databases arent the best option? also really cool project.

rayliuca · 1 year ago

Thanks! Vector databases store the semantic vector representation of each record and compare it to the query for retrieval, which would give results close to the meaning of the text, but not necessary the text surface. A lexical search, i.e. BM25 and levenshtein distance, seems to work better as translation examples in this case

slacktoid@lemmy.ml · 1 year ago

Understood, very cool. Thank you. will have to explore this more!!

T-Ragx - Enhancing Translation with RAG-Powered LLMs

T-Ragx - Enhancing Translation with RAG-Powered LLMs

GitHub - rayliuca/T-Ragx: Enhancing Translation with RAG-Powered Large Language Models