Introduction

This is the dataset used in paper:
Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification using Distant Label Expansion
arXiv preprint arXiv:1612.04499
Hu Xu, Lei Shu, Jingyuan Zhang, Philip S. Yu
[paper], [Annotated Dataset], [bib]

It has 10 annotated products (more products than the original paper). Each product has more QA pairs than the original paper.

Usage

Python:

import json
with open('CER_PCQA_10.json', 'r') as f:
	products=json.load(f)
	
#print the list of question word-label pairs of the 1st QA in the first product.
print products[0]['qa'][0]['q']
#print the list of answers of the 1st QA in the first product.
print products[0]['qa'][0]['a']

Java: You may need a library to load JSON format.