This is the dataset used in paper:
Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification using Distant Label Expansion
arXiv preprint arXiv:1612.04499
Hu Xu, Lei Shu, Jingyuan Zhang, Philip S. Yu
[paper], [Annotated Dataset], [bib]
It has 10 annotated products (more products than the original paper). Each product has more QA pairs than the original paper.
import json with open('CER_PCQA_10.json', 'r') as f: products=json.load(f) #print the list of question word-label pairs of the 1st QA in the first product. print products['qa']['q'] #print the list of answers of the 1st QA in the first product. print products['qa']['a']
Java: You may need a library to load JSON format.