[Question]: Gaining better control over the insert process? #1828

WilliamDiakite · 2025-05-14T22:00:13Z

WilliamDiakite
May 14, 2025

Do you need to ask a question?

I have searched the existing question and discussions and this question is not already answered.
I believe this is a legitimate question, not just a bug or feature request.

Your Question

The rag.insert(...)takes a text document (or a collection of documents) as an input. From there, LightRAG uses an LLM to extract various information (entities, relationships, etc.). This extraction is tailored by a rather complex prompt which outputs formatted data that is later parsed by LightRAG.

My first question concern the format of the output : why this default particular default format (the one described in prompt.py)? Why not ask the LLM to output something like json or even xml which can be easily handled by machines and humans alike? Not to mention that LLM are trained on such formats (or language in the case of xml). I couldn't find any reason for this design choice running through the paper.

The follow-up question addresses the possibility of interacting with the insert task by providing rag.insert(...) formatted data rather than plain text (using a schema specified by LightRAG). This way, preparation of data could be handled outside LightRAG, allowing easier testing and better control (moreover, having an explicit schema for the extracted data would make it more comfortable to modify the LLM prompt). Is there an approach that already allows that kind of interaction?

Additional Context

No response

WilliamDiakite · 2025-06-23T18:22:55Z

WilliamDiakite
Jun 23, 2025
Author

Answering the second part of my question: one can insert custom knowledge base using rag.insert_custom_kg(custom_kg) (see README.md:941)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Question]: Gaining better control over the insert process? #1828

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[Question]: Gaining better control over the insert process? #1828

Uh oh!

Uh oh!

WilliamDiakite May 14, 2025

Do you need to ask a question?

Your Question

Additional Context

Replies: 1 comment

Uh oh!

WilliamDiakite Jun 23, 2025 Author

WilliamDiakite
May 14, 2025

WilliamDiakite
Jun 23, 2025
Author