Super noob in vector embeddings: I never considered that tables would be a compl... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Royce-CMR 10 days ago \| parent \| context \| favorite \| on: So you wanna build a local RAG? Super noob in vector embeddings: I never considered that tables would be a complexifier. (beyond defining in a parseable format for ingestion). Do vector databases do better with long grouped text vs table formats?

Oras 9 days ago [–]

The issue is the ingestion (extracting the right data in the right format). This is mainly an issue in PDFs and sometimes when there are tables added as images in Docx too. You need a mix of text and OCR extraction to get the data correctly first before start chunking and adding embeddings

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact