If you are using the contents of this repository please cite it this way:
@article{Setty:2023:Arxiv,
title={Extreme Classification for Answer Type Prediction in Question Answering},
author={Setty, Vinay},
journal={arXiv preprint arXiv:2304.12395},
year={2023}
}
Semantic answer type prediction (SMART) is known to be a useful step towards designing effective question answering (QA) systems. The SMART task involves predicting the top-$k$ knowledge graph (KG) types for a given natural language question. This is challenging due to the large number of types in KGs. In this paper, we propose use of extreme multi-label classification using Transformer models (XBERT) by clustering KG types using structural and semantic features based on question text. We specifically improve the clustering stage of the XBERT pipleline using textual and structural features derived from KGs. We show that these features can improve end-to-end performance for the SMART task, and yield state-of-the-art results.
- Create a conda environment conda create -n jcdl-smart-2023
- Install the requirements pip install -r requirements.txt
- Install apex a. Clone apex repository b. install with command pip install -v --disable-pip-version-check --no-cache-dir ./
- On a GPU server run python -m smart.classification.category.bert_category_classifier
- Modify the bash script scripts/run_experiments.sh to specify appropriate datasets, embedding types and Transformer models.
EMB_LIST=( pifa-neural text-emb pifa-tfidf )
dataset_list=( wikidata dbpedia )
model_list=( roberta xlnet )
- Then run
./scripts/run_experiments.sh
- It should run full pipeline with feature generation to model training, ranking, prediction and evaluation using the script. It should print NDCG for DBPedia and MRR for Wikidata.
- To run with a sample data and for pytest unit tests to pass run
./scripts/run_experiments.sh True
The True flag runs it in test mode.
Method | DBpedia Type Classification | DBpedia End-to-End | Wikidata Type Classification | Wikidata End-to-End | ||||||
---|---|---|---|---|---|---|---|---|---|---|
NDCG@3 | NDCG@5 | NDCG@10 | NDCG@3 | NDCG@5 | NDCG@10 | MRR | MRR | |||
Question text (TF-IDF) | 0.717 | 0.693 | 0.650 | 0.824 | 0.811 | 0.787 | 0.66 | 0.76 | ||
KG-TypeSim | 0.725 | 0.697 | 0.662 | 0.828 | 0.813 | 0.793 | 0.67 | 0.77 | ||
KG-RDF2Vec | 0.729 | 0.701 | 0.656 | 0.831 | 0.815 | 0.791 | 0.67 | 0.78 | ||
BERT-TypeDesc | 0.727 | 0.706 | 0.665 | 0.830 | 0.818 | 0.795 | 0.67 | 0.78 | ||
BERT-TypeDesc-FT | 0.734 | 0.712 | 0.678 | 0.834 | 0.822 | 0.802 | 0.68 | 0.79 |