Question Answering

Question answering through pretrained transformer-based models from Hugging Face.

Question Answering (QA) is the task consisting in generating answers for questions from the passages containing the needed information. Optionally, also the history of previous question.answer turns can be used for producing the answer.

The CoQA dataset has been used: https://stanfordnlp.github.io/coqa/.

In our work, we use a model which consists of two modules: the tokens importances extractor and the encoder-decoder (i.e. seq2seq). The first module computes an importance score in $[0,1]$ for each passage token, representing the likelihood that the token is in the span of the passage containing the answer. Then, the encoder-decoder takes as additional input these tokens importances, and it generates the answer. The reason of following this approach is to help the encoder-decoder in finding the interesting information in the passage, since it can be very long. Both modules are built from a pre-trained transformer-based architecture, taken from Hugging Face.

Two different pre-trained models have been considered, namely DistilRoBERTa and BERTTiny. Different random seeds have been set for generating our experiments. Finally, also whether to use or not the conversation history has been taken into account.

For evaluating these different experiments, the average SQuAD F1 score has been computed, both on the validation and test datasets.

Dependencies

Repository structure

.
├── coqa    # It contains the dataset files     
├── images    # It contains some explanatory images                    
├── models     # It contains the models                           
├── utils    # It contains the python files with useful functions
├── weigths       # It contains the models weigths
├── Assignment.ipynb     # Task description
├── question answering.ipynb   # Task resolution
├── .gitignore
├── LICENSE
├── report.pdf     # Report of the assignment
└── README.md

Versioning

Git is used for versioning.

Group members

Name	Surname	Email	Username
Samuele	Bortolato	`[email protected]`	Sam
Antonio	Politano	`[email protected]`	S1082351
Enrico	Pittini	`[email protected]`	EnricoPittini
Riccardo	Spolaor	`[email protected]`	RiccardoSpolaor

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Question Answering

Dependencies

Repository structure

Versioning

Group members

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
coqa		coqa
images		images
models		models
utils		utils
.gitignore		.gitignore
Assignment.ipynb		Assignment.ipynb
LICENSE		LICENSE
README.md		README.md
f1squad.md		f1squad.md
question answering.ipynb		question answering.ipynb
report.pdf		report.pdf

License

Shruti2301/Question-Answering

Folders and files

Latest commit

History

Repository files navigation

Question Answering

Dependencies

Repository structure

Versioning

Group members

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages