av天堂午夜精品一区,成人影院yy111111在线,成年女人黄小视频

Table of Contents

Getting to the Point

Using RAG

Home

Technology peripherals

Why Is Retrieval Augmented Generation Or RAG Popular Today?

Mary-Kate Olsen

Jul 10, 2025 pm 12:56 PM

Why Is Retrieval Augmented Generation Or RAG Popular Today?

You could say that it’s like chocolate and peanut butter – two great tastes that go really well together.

Alternatively, you might explain it in a more technical way. In essence, Retrieval Augmented Generation is when additional information is provided to the LLM as it applies its own training data and knowledge to perform a task.

According to experts at GeeksforGeeks, they describe it this way:

“In traditional LLMs, the model generates responses based only on the data it was trained on, which may not include up-to-date or specific details needed for certain tasks. RAG overcomes this by incorporating a retrieval mechanism that allows the model to access external databases or documents in real-time.”

There's also a helpful flow chart with “data chunks” and other components that illustrate how this system functions.

Imagine how this would work in practice – for instance, suppose you provide a chatbot with a set of white papers about your business and then ask it questions about your business strategy. On a personal level, if you want the AI to understand you better, you might upload personal documents such as diary entries or past writings to help it build a more accurate understanding of who you are.

Broadly speaking, RAG involves introducing any information not present in the original training set. This could be done for reasons of specificity, timing, intent, or simply to tailor the output more precisely.

Getting to the Point

I found this particularly interesting ---

At Learn By Building AI, Bill Chambers outlines a straightforward approach to RAG.

He contrasts it with the following explanation he came across at Facebook:

“Building a model that researches and contextualizes is more complex, but vital for future progress. Recently, we've made significant strides in this area with our Retrieval Augmented Generation (RAG) architecture, an end-to-end differentiable model that combines an information retrieval component (Facebook AI’s dense-passage retrieval system) with a seq2seq generator (our Bidirectional and Auto-Regressive Transformers [BART]

model). RAG can be fine-tuned on knowledge-intensive downstream tasks to achieve state-of-the-art results compared with even the largest pretrained seq2seq language models. And unlike these pretrained models, RAG’s internal knowledge can be easily modified or even expanded on the fly, allowing researchers and engineers to control what RAG knows and doesn’t know without having to retrain the entire model from scratch.” Good grief…

Chambers then presents a clear diagram showing a “corpus of documents” being connected to an LLM through user input.

That made sense: RAG means integrating specific informational resources! Of course, there are technical nuances involved, but I thought the tutorial did a solid job explaining things, so it's another resource for those wanting to grasp how RAG works in practice.

Using RAG

I also wanted to mention a tech talk by Soundararajan Srinivasan, Sr. Director of AI Program at Microsoft, and his colleague Reshmi Ghosh, a Microsoft Sr. Applied Scientist, at Imagination in Action in April, where they discussed practical applications of RAG.

Using terms like “knowledge store,” “vector database,” “orchestrator,” and “meta prompt,” Srinivasan walked through how these systems operate, highlighting their role in clarifying AI’s limitations within a given context.

The word “context” is especially important because, as he explains, a larger context window increases capability while potentially reducing memory usage.

Here are some additional reasons the presenters cited for using RAG:

To integrate knowledge and reasoning
To make model usage more accessible
For efficient use of time and resources

Ghosh then explained how we determine whether a model utilizes RAG information during processing.

“You have all these different contexts that are sent along with the query to inform the model, ‘hey, here's the external knowledge you may or may not already know,’” she said.

“When designing systems with large language models, or even smaller ones like llama and phi, we're finding that by sending context in a compartmentalized format rather than fine-tuning the model, you can still get accurate factual responses.”

Ghosh also touched on multi-modality.

“You can essentially have databases containing images, voice notes, sounds, or any kind of audio, and still develop AI applications around them with similar benefits, since models are now leaning more towards utilizing RAG context and relying less on internal memory, opening new possibilities for emerging frameworks.”

This, she added, aligns well with protocols like MCP (Model Context Protocol) and A2A (Agent to Agent systems).

This is significant as we enter a new era of interfaces beyond typing—voice is now part of the mix, and image and video generation will soon offer rich alternatives to text-based models.

Some might argue we're entering a world of limitless possibilities once thought unattainable.

RAG could be one key to ensuring we maintain control and deliver the outcomes we desire. It contributes to what might be called “convergence” in a digital intelligence system. So stay tuned for further developments in these methodologies as we continue building increasingly advanced AI tools and solutions.

The above is the detailed content of Why Is Retrieval Augmented Generation Or RAG Popular Today?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress images for free

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5060999 fails to install in Windows 11?

1 months ago By DDD

Oguri Cap Build Guide | A Pretty Derby Musume

1 weeks ago By Jack chen

Guide: Stellar Blade Save File Location/Save File Lost/Not Saving

3 weeks ago By DDD

Dune: Awakening - Advanced Planetologist Quest Walkthrough

3 weeks ago By Jack chen

Agnes Tachyon Build Guide | A Pretty Derby Musume

1 weeks ago By Jack chen

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

8526

Java Tutorial

1747

CakePHP Tutorial

1601

Laravel Tutorial

1542

PHP Tutorial

1402

Related knowledge

Top 7 NotebookLM Alternatives Jun 17, 2025 pm 04:32 PM

Google’s NotebookLM is a smart AI note-taking tool powered by Gemini 2.5, which excels at summarizing documents. However, it still has limitations in tool use, like source caps, cloud dependence, and the recent “Discover” feature

Hollywood Sues AI Firm For Copying Characters With No License Jun 14, 2025 am 11:16 AM

But what’s at stake here isn’t just retroactive damages or royalty reimbursements. According to Yelena Ambartsumian, an AI governance and IP lawyer and founder of Ambart Law PLLC, the real concern is forward-looking.“I think Disney and Universal’s ma

What Does AI Fluency Look Like In Your Company? Jun 14, 2025 am 11:24 AM

Using AI is not the same as using it well. Many founders have discovered this through experience. What begins as a time-saving experiment often ends up creating more work. Teams end up spending hours revising AI-generated content or verifying outputs

From Adoption To Advantage: 10 Trends Shaping Enterprise LLMs In 2025 Jun 20, 2025 am 11:13 AM

Here are ten compelling trends reshaping the enterprise AI landscape.Rising Financial Commitment to LLMsOrganizations are significantly increasing their investments in LLMs, with 72% expecting their spending to rise this year. Currently, nearly 40% a

The Prototype: Space Company Voyager's Stock Soars On IPO Jun 14, 2025 am 11:14 AM

Space company Voyager Technologies raised close to $383 million during its IPO on Wednesday, with shares offered at $31. The firm provides a range of space-related services to both government and commercial clients, including activities aboard the In

Nvidia Wants To Build A Planet-Scale AI Factory With DGX Cloud Lepton Jun 14, 2025 am 11:17 AM

Nvidia has rebranded Lepton AI as DGX Cloud Lepton and reintroduced it in June 2025. As stated by Nvidia, the service offers a unified AI platform and compute marketplace that links developers to tens of thousands of GPUs from a global network of clo

Boston Dynamics And Unitree Are Innovating Four-Legged Robots Rapidly Jun 14, 2025 am 11:21 AM

I have, of course, been closely following Boston Dynamics, which is located nearby. However, on the global stage, another robotics company is rising as a formidable presence. Their four-legged robots are already being deployed in the real world, and

What Is 'Physical AI'? Inside The Push To Make AI Understand The Real World Jun 14, 2025 am 11:23 AM

Add to this reality the fact that AI largely remains a black box and engineers still struggle to explain why models behave unpredictably or how to fix them, and you might start to grasp the major challenge facing the industry today.But that’s where a

See all articles

国产av日韩一区二区三区精品,成人性爱视频在线观看,国产,欧美,日韩,一区,www.成色av久久成人,2222eeee成人天堂

Why Is Retrieval Augmented Generation Or RAG Popular Today?

Getting to the Point

Using RAG

Hot AI Tools

Undress AI Tool

Undresser.AI Undress

AI Clothes Remover

Clothoff.io

Video Face Swap

Hot Article

Hot Tools

Notepad++7.3.1

SublimeText3 Chinese version

Zend Studio 13.0.1

Dreamweaver CS6

SublimeText3 Mac version

Hot Topics