Name

MWIDSRQA1.0 short for Malawi Integrated Disease Surveillance and Response (IDSR) Questions and Answers 1.0

Description

The dataset contains questions and answers over a text containing technical guidelines for disease surveillance in Malawi. For each question we mention the location in the text that applies. The dataset focuses on disease surveillance and can be used for tasks relevant to hierarchical text classification, machine learning, information retrieval, QA from texts and structured data, multi-document summarization and many other areas. Additionally the dataset can also be used in developing training materials and tests to be used as part of public health / community surveillance courses and degrees.

Creator

Amelia Taylor, Kuyesera AI Lab, Malawi University of Business and Applied Sciences

Methodology

The dataset has two parts: one that was obtained via an automatic process of extracting questions and answers from text. The gold standard contains questions and answers that have been curated by academics and public health experts.

Data Source

The source of the dataset are six booklets containing the Technical Guidelines for Integrated Disease Surveillance in Malawi. The six booklets are country specific and have been adapted from the WHO Technical Guidelines for IDSR. The booklets are organised into sections covering different areas for disease surveillance and response. The citation for these Booklets is given below:

Permissions

Some rights reserved. This work is available under the Creative Commons Attribution-NonCommercialShareAlike 3.0 IGO licence (CC BY-NC-SA 3.0 IGO; https://creativecommons.org/licenses/by-ncsa/3.0/igo).

Competitions

Zindi : Strengthening Health Systems: LLM Challenge for Integrated Disease Surveillance and Response in Malawi