GithubHelp home page GithubHelp logo

qasina's Introduction

QASiNa: Question Answering Sirah Nabawiyah Dataset

Dataset Summary

Question Answering Sirah Nabawiyah Dataset (QASiNa) is Extractive QA Dataset which build to perform QA task in Sirah Nabawiyah domain.

Languages

Language: Indonesian

Dataset Format

There are two dataset format, in JSON and CSV format with the same data

CSV columns

  • ID : Question ID (integer)
  • cotext_id : Context ID (integer)
  • context_title : Context title (string)
  • context_length : Length of context (integer)
  • context : Full text of context (string)
  • question : Question sentence (string)
  • answer : Answer sentence (string)
  • start position : Starting position of answer sentence (integer)
  • question type : Type of question (string)

JSON format

{
    "context_id": unique integer,
    "context": string,
    "question_answers": [
      {
        "type": what / who / where / how many / when,
        "question": string,
        "answer": string,
        "answer_start": integer,
        "question_id": unique integer
      },
    "context_length": integer,
    "context_title": string
  }

Data Example

JSON example

{
    "context_id": 0,
    "context": "Pada saat Nabi sudah hijrah ke Madinah masih sering terjadi peperangan antara orang Islam dengan kafir Quraisy, diantaranya adalah perang Badar. Perang badar merupakan salah satu perang yang sangat menentukan masa depan negara Islam yang terjadi pada tahun kedua di daerah Badar kurang lebih 120 km dari Madinah. Perang badar ada tiga macam, yaitu perang badar pertama, perang badar kubra, dan perang badar yang terakhir (Ghazwah al-Sawiq) terjadi pada abad keempat hijrah. Perang badar kubra didahului oleh Sariyah Abdullah Ibn Jahsy ke daerah Nakhlah yang berada di antara Mekkah dan Thaif yang terjadi pada bulan Rajab tahun ke-2 H. Sariyah inilah yang menjadi penyebab paling kuat terhadap perang Badar Kubra.",
    "question_answers": [
      {
        "type": "who",
        "question": "Siapa yang menjadi penyebab terjadinya perang Badar Kubra?",
        "answer": "Sariyah Abdullah Ibn Jahsy",
        "answer_start": 508,
        "question_id": 0
      },
      {
        "type": "when",
        "question": "Kapan perang Badar terjadi?",
        "answer": "tahun kedua",
        "answer_start": 251,
        "question_id": 1
      },
      {
        "type": "what",
        "question": "Apa saja macam-macam perang Badar yang terjadi?",
        "answer": "perang badar pertama, perang badar kubra, dan perang badar yang terakhir (Ghazwah al-Sawiq)",
        "answer_start": 348,
        "question_id": 2
      }
    ],
    "context_length": 713,
    "context_title": "Penyebab Perang Badar (1)"
  }

Citation

@INPROCEEDINGS{10390123,
  author={Rizqullah, Muhammad Razif and Purwarianti, Ayu and Aji, Alham Fikri},
  booktitle={2023 10th International Conference on Advanced Informatics: Concept, Theory and Application (ICAICTA)}, 
  title={QASiNa: Religious Domain Question Answering Using Sirah Nabawiyah}, 
  year={2023},
  volume={},
  number={},
  pages={1-6},
  keywords={Sociology;Question answering (information retrieval);Task analysis;Statistics;Informatics;question answering;low resources;religious domain;mBERT;XLM-R;IndoBERT;Chat GPT;QASiNa},
  doi={10.1109/ICAICTA59291.2023.10390123}}

qasina's People

Contributors

rizquuula avatar

Stargazers

Yusuf Syaifudin avatar Balma Bahira Adzkia avatar Galih Anggara avatar Ray Ramadita avatar Danang Wijaya avatar Choirul Septyono avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.