Too Long; Didn't Read
The Stanford Question Answering Dataset (SQuAD) is used to test NLP models and their ability to understand natural language. SQuAD2.0 consists of a set of paragraphs from Wikipedia articles, along with 100,000 question-answer pairs derived from these paragraphs, and 50,000 unanswerable questions. The problem is to get the correct answer to a question based on a fragment of a Wikipedia article, or a segment of text from the corresponding passage, or the question may not have an answer at all.