11 comments
jadbox · 5 days ago
Vector searching had strange quirks where searching for "cat" would return mostly a lot of paragraphs unrelated to the word. I was using 3072 length for OAI text-embedding-3-large. Each entry was roughly 1-2 paragraphs. For my recent project, I found that PGroonga was more reliable for full text document lookup (with some fuzzy matching support).

Show replies

simedw · 5 days ago
Very interesting breakdown, OP have you deep dived in pgvectorscale as well?

Show replies

· 5 days ago
[deleted]
mkesper · 5 days ago
I wanted to read this article. Gave up because of absolutely missing contrast. Please, if you publish something, use black (#000) for text and almost white for background and not darker grey on a lighter grey background.

Show replies