Code for my EMNLP 2018 paper "SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach"
This is the code for the EMNLP 2018 paper "SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach".
On the SimpleQuestions dataset task, one of the most commonly used benchmarks for studying single-relation factoid questions, we:
.
├── /notebooks/
│ ├── /Simple QA End-To-End/ # Experiments on components of the end-to-end QA pipeline
│ ├── /Simple QA Models # Experiments on various neural models
│ ├── /Simple QA KG to PostgreSQL DB # Scripts to populate postgreSQL
│ ├── /Simple QA Numbers # Scripts for computing and verifying various numbers
├── /pretrained_models/
├── /lib/ # Various utility functionality
├── /tests/
├── .flake8
└── requirements.txt # Required python packages
This repository requires Python 3.5 or greater and PostgreSQL.
git clone https://github.com/PetrochukM/Simple-QA-EMNLP-2018.git
cd Simple-QA-EMNLP-2018
python -m pip install -r requirements.txt
Create and populate a PostgreSQL table named fb_two_subject_name
with notebooks/Simple QA KG to PostgreSQL DB/fb_two_subject_name.csv.gz
Create a .pass
file using the below template:
DB_NAME=
DB_PORT=
DB_USER=
DB_HOST=
DB_PASS=
Such that:
Download the SimpleQuestions v2 dataset from Facebook Research. Use the notebook at Simple-QA-EMNLP-2018/notebooks/Simple QA KG to PostgreSQL DB/FB5M & FB2M KG to DB.ipynb
to create and populate a PostgreSQL table.
You're done! Feel free to run Simple-QA-EMNLP-2018/notebooks/Simple QA End-To-End
.
The slides used for our EMNLP talk.
@article{Petrochuk2018SimpleQuestionsNS,
title={SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach},
author={Michael Petrochuk and Luke S. Zettlemoyer},
journal={CoRR},
year={2018},
volume={abs/1804.08798}
}
(01g4wmh, music/album/acquire_webpage, 02q5zps)
.