-
Notifications
You must be signed in to change notification settings - Fork 749
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testset generation broken after migrating from 0.1.x to 0.2.4 #1660
Comments
Hey, the reason could be that the default summary similarity threshold might be higher for your docs. You may do either or both of the following.
|
Hey @shahules786 , But I still have a question, for testing purposes, I changed the By changing the return random.random() < 0.25 to
Any reason why this default_filter is coded this way? I understood the if conditions but couldn't find any explanation for the random. Thanks in advance ! |
Hey @malikbrh Great, amazed that you could debug it without much help from docs. Would love any contributions from you to improve ragas. To answer your question, the idea was to sample random summaries from given document set, cluster them and use one summary from each cluster (representative of the cluster) to estimate the persona that could interact with it. |
[X] I have checked the documentation and related resources and couldn't resolve my bug.
Describe the bug
I have a DivisionByZero error while generating my testset. The same code structure was working fine in v0.1, after migrating to 0.2 it broke. I have tried with multiple models, and finally stuck with OpenAI GPT4o-mini and text-embedding-3-small.
I added two documents to generate the testset, but it always fails at the same place. When exploring the KnowledgeGraph in the Debugger, it is fine with multiple Nodes generated by the previous steps.
Ragas version: v0.2.4
Python version: v3.11.10
Code to Reproduce
Note: documents are llama_index documents
`
kg = KnowledgeGraph()
Error trace
Expected behavior
A clear and concise description of what you expected to happen.
I would expect the testset to be generated properly, or at least a more self-explainable error. I do not really know what would be the next best steps to debug.
Additional context
Add any other context about the problem here.
I can provide more stuff if needed, just ask me in comments and I'll see what I can post, as my project is supposed to stay confidential. Thanks in advance for your help !
The text was updated successfully, but these errors were encountered: