Multimodal RAG — Intuitively and Exhaustively…

Daniel Warfield

Jul 25, 2024

40

2

1

Modern RAG for modern models.

Read →

2 Comments

Daniel Warfield

Jul 25

I would be thrilled to answer any questions or thoughts you might have. An article combined with thoughts, ideas, and considerations holds much more educational power!

Expand full comment

You have shown three patterns of multi-modal RAG, but both input and output are in text format. Is this practical in real-world? I mean, what if the input contain both text and image?

Expand full comment

Reply

Share

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts

Intuitively and Exhaustively Explained

Multimodal RAG — Intuitively and Exhaustively…