Introduction
This is the dataset used in paper:
Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification using Distant Label Expansion
arXiv preprint arXiv:1612.04499
Hu Xu, Lei Shu, Jingyuan Zhang, Philip S. Yu
[paper], [Annotated Dataset], [bib]
It has 10 annotated products (more products than the original paper). Each product has more QA pairs than the original paper.
Usage
Python:
import json
with open('CER_PCQA_10.json', 'r') as f:
products=json.load(f)
#print the list of question word-label pairs of the 1st QA in the first product.
print products[0]['qa'][0]['q']
#print the list of answers of the 1st QA in the first product.
print products[0]['qa'][0]['a']
Java: You may need a library to load JSON format.