Add a simple Question Answering notebook with Haystack #12

codificat · 2023-02-08T20:25:25Z

In #9 we are exploring various QA systems.

This PR provides a simple experiment of Extractive and Generative QA using Haystack

review-notebook-app · 2023-02-08T20:25:30Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codificat · 2023-02-08T20:27:10Z

NOTE: this PR also includes the sample dataset from #11 (same commit) in order to have data to work on.

Shreyanand · 2023-02-08T21:29:35Z

notebooks/haystack-qa.ipynb

@@ -0,0 +1,494 @@
+{


Great start! The adversarial example was a good addition :D Haystack makes implementing the reader retriever approach quite intuitive. For the source of context, it should have a method to point back to the file, it seems to be a simple string search problem...
For the PR, I think it has the commits from the file addition PR as well. I guess if we separate them then this can be merged independently.
Looking forward to the generative example from haystack as well!

Reply via ReviewNB

suppathak · 2023-02-09T00:59:41Z

notebooks/haystack-qa.ipynb

@@ -0,0 +1,494 @@
+{


Line #24. details="minimum" ## Choose from minimum, medium, and all
Looks great!! Thanks Pep.
I was observing some of the contexts. I do have some observations and suggestions in this context.
"All" is the choice means whole document ?
I also observed that some of the question got included in the context along with the answers. what if we separate the questions from the answers in the corpus text. and train our model with the text only containing the answers and not question. lmkwyt? I will also try to apply it in my QA model testing work.

Reply via ReviewNB

all means: show all the details about each answer, e.g. score, offset, ... vs minimum which only shows the answer and context. I thought "minimum" would be enough here, but happy to change it to show all the details.

About context that includes questions: yes, that comes from some of the source documents that do include questions. Most notably, the FAQ in the ROSA worshop: https://www.rosaworkshop.io/rosa/14-faq/

If we try to pre-process the content to separate Questions from Answers, I'm not sure how successful that would be in general because most of the answers need their respective questions in order to make sense (extreme case: there are answers that are just "Yes"). But for that document in particular (the FAQ) it might make sense to use it for the "squad-like" test... only that I have not verified if all the answers there can be obtained from other documents.

codificat · 2023-02-15T11:46:20Z

Converted this PR to draft while I'm working to expand it with a Generative QA approach

codificat · 2023-02-22T22:41:13Z

Updated with the current version that adds 3 generative QA types: RAG, LFQA and OpenAI-based.

Context now includes the full ROSA docs (plus the ROSA workshop and the MOBB material in the data/external samples)

Results are not great, I'm still trying to see if they can be improved a bit - also need to elaborate/document.

codificat · 2023-02-23T22:27:25Z

Ok, I believe this is ready for another review.

The RAG version is not working well for some reason that so far has escaped me. @Shreyanand @suppathak if you have suggestions especially on that part they would be most welcome.

I have added the retrieval of the whole ROSA docs from S3 storage, and these docs together with the in-repo samples (ROSA workshop and MOBB) are used for context.

There are now more comments/docs and the structure has also been updated, hopefully making it more easy to follow.

Shreyanand · 2023-02-27T16:23:25Z

notebooks/haystack-qa.ipynb

@@ -0,0 +1,1462 @@
+{


One way to inspect the model would be to observe what retriever is fetching. Maybe it's not providing enough context to the generator. What happens if we tweak retriever model parameters?

Also. another possible explanation could be that these embedding models may not be advanced enough to capture language constructs leading to poor answers.

Reply via ReviewNB

Shreyanand · 2023-02-27T16:23:25Z

notebooks/haystack-qa.ipynb

@@ -0,0 +1,1462 @@
+{


In essence, this notebook compares free and OS model bart_lfqa and paid Open AI model text-davinci-003 for the long form generative QA task.
For the extractive qa task it tries roberta-base-squad2 . For all of these experiments, the retriever is BM25.

Also, it has a separate experiment with combined DPR as retriever and facebook/rag-token-nq as the generator model.

Could we add this classification in the summary/conclusion? Once we have a validation dataset and metrics defined, we can add results of these experiments based on the metrics as well.

Reply via ReviewNB

In essence, this notebook compares free and OS model bart_lfqa and paid Open AI model text-davinci-003 for the long form generative QA task.

While it does have a free model for LFQA and an OpenAI version, the main goal of the notebook is to explore Haystack as a framework.

For all of these experiments, the retriever is BM25.

Dense vectors are used for generative QA - and a dense retriever accordingly. I expanded the introduction a bit to hopefully explain that (although I am not getting into details - should I?)

I think the detail level is appropriate now.

While it does have a free model for LFQA and an OpenAI version, the main goal of the notebook is to explore Haystack as a framework

That makes sense.

Signed-off-by: Pep Turró Mauri <[email protected]>

codificat · 2023-03-03T17:38:43Z

Another update:

I now removed the RAG generator test: it does not work, and deepset plan to remove the RAG generator tutorial.
I found a problem with the Markdown pre-processor. I mention it as "fixmes" in the notebook. I am inclined NOT to try to fix these in this notebook though: the purpose of this PR is to review Haystack as a framework, and it there are issues with the markdown pre-processor they should be mentioned, right?

Shreyanand

/lgtm

codificat requested a review from durandom as a code owner February 8, 2023 20:25

Shreyanand reviewed Feb 8, 2023

View reviewed changes

suppathak reviewed Feb 9, 2023

View reviewed changes

codificat marked this pull request as draft February 15, 2023 11:45

codificat force-pushed the haystack-experiment branch 2 times, most recently from d3100d1 to 56a6b2a Compare February 22, 2023 22:37

codificat force-pushed the haystack-experiment branch from 56a6b2a to f1db9ea Compare February 23, 2023 22:22

codificat marked this pull request as ready for review February 23, 2023 22:23

Shreyanand reviewed Feb 27, 2023

View reviewed changes

Add a QA notebook using haystack

0f82abb

Signed-off-by: Pep Turró Mauri <[email protected]>

codificat force-pushed the haystack-experiment branch from f1db9ea to 0f82abb Compare March 3, 2023 17:21

Shreyanand approved these changes Mar 6, 2023

View reviewed changes

codificat removed the request for review from durandom March 7, 2023 17:27

Shreyanand merged commit 5076c6f into redhat-et:master Mar 9, 2023

codificat deleted the haystack-experiment branch March 10, 2023 20:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a simple Question Answering notebook with Haystack #12

Add a simple Question Answering notebook with Haystack #12

codificat commented Feb 8, 2023 •

edited

Loading

review-notebook-app bot commented Feb 8, 2023

codificat commented Feb 8, 2023

Shreyanand Feb 8, 2023 •

edited

Loading

suppathak Feb 9, 2023 •

edited

Loading

codificat Feb 9, 2023

codificat commented Feb 15, 2023

codificat commented Feb 22, 2023

codificat commented Feb 23, 2023

Shreyanand Feb 27, 2023 •

edited

Loading

Shreyanand Feb 27, 2023 •

edited

Loading

codificat Mar 3, 2023

Shreyanand Mar 6, 2023

codificat commented Mar 3, 2023 •

edited

Loading

Shreyanand left a comment

Add a simple Question Answering notebook with Haystack #12

Add a simple Question Answering notebook with Haystack #12

Conversation

codificat commented Feb 8, 2023 • edited Loading

review-notebook-app bot commented Feb 8, 2023

codificat commented Feb 8, 2023

Shreyanand Feb 8, 2023 • edited Loading

Choose a reason for hiding this comment

suppathak Feb 9, 2023 • edited Loading

Choose a reason for hiding this comment

codificat Feb 9, 2023

Choose a reason for hiding this comment

codificat commented Feb 15, 2023

codificat commented Feb 22, 2023

codificat commented Feb 23, 2023

Shreyanand Feb 27, 2023 • edited Loading

Choose a reason for hiding this comment

Shreyanand Feb 27, 2023 • edited Loading

Choose a reason for hiding this comment

codificat Mar 3, 2023

Choose a reason for hiding this comment

Shreyanand Mar 6, 2023

Choose a reason for hiding this comment

codificat commented Mar 3, 2023 • edited Loading

Shreyanand left a comment

Choose a reason for hiding this comment

codificat commented Feb 8, 2023 •

edited

Loading

Shreyanand Feb 8, 2023 •

edited

Loading

suppathak Feb 9, 2023 •

edited

Loading

Shreyanand Feb 27, 2023 •

edited

Loading

Shreyanand Feb 27, 2023 •

edited

Loading

codificat commented Mar 3, 2023 •

edited

Loading