Golden Eval Suite
5 Q/A pairs · heuristic scoring · no LLM · pass ≥ 40% overall
Select corpus files and click Run Eval to score these 5 pairs:
1
Where does the app store its data — backend or browser?
Architecture decision retrieval
browserbackendin-browser
2
What file types can be uploaded and indexed?
Feature coverage retrieval
markdownjsontypescripttext
3
What is PromptOps and how does it relate to memory blocks?
Concept linkage retrieval
promptopsmemory blockprompt assetversioned
4
What are the planned roadmap features after the MVP?
Roadmap / future plans retrieval
roadmapsemanticvectorembedding
5
What block types does the heuristic extractor produce?
Block type enumeration retrieval
featuredecisionrisktodoconcept