Build a feedback-collection tool

1. Collect a bunch of contexts to generate questions for -> list of texts
2. In batch, ask our LMs to generate questions for each context; aggregate the results. -> list of, for each text, list of questions and metadata (what system generated it). Ask each LM for several questions, log all of them.
3. Streamlit app: pick a random context from the list, pick some subset of the questions (random?) ask user which one is "best" (most appropriate, most helpful, ...?) -> user's choice of which question. Maybe have them rank the questions? (drag them into best-to-worst?) (or: pick the best)
   - log this in a database:

```
context, question, system_that_generated, rater_id, rank
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Build a feedback-collection tool #12

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Build a feedback-collection tool #12

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions