SQuAD (Stanford Question Answering Dataset) is a dataset for reading comprehension. It consists of a list of questions by crowdworkers on a set of Wikipedia articles. The answers to each of the questions is a segment of text, or span, from the corresponding Wikipedia reading passage. Alternatively, the question may also be unanswerable.
SQuAD2.0 combines the 100k questions from its predecessor, SQuAD1.1, with 50k+ additional unanswerable questions from crowdworkers
SQuAD 1.1 consists of 100,000+ question and answer pairs on 500+ articles.