Workshop Description

Given the success of the first, second, third, and fourth workshops on Open-Source Arabic Corpora and Corpora Processing Tools (OSACT) in LREC 2014, LREC 2016, LREC 2018 and LREC 2020, the fifth workshop comes to encourage researchers and practitioners of Arabic language technologies, including computational linguistics (CL), natural language processing (NLP), and information retrieval (IR) to share and discuss their latest research efforts, corpora, and tools. The workshop will also give special attention on Multilingualism and Language Technology for All, which is one of LREC 2022 hot topics. In addition to the general topics of CL, NLP and IR, the workshop will give a special emphasis on two shared tasks, namely: Quran QA and Fine-Grained Hate Speech Detection.

Shared Task

Quran QA Shared Task


For more information about Q&A for Quran shared task please visit this website

Important dates:

  • 6 February 2022: Train/dev set release
  • 26-29 March 2022: Runs submission (Test set available)
  • 31 March 2022: Announcing runs results
  • 10 April 2022: Shared-task paper submission deadline
  • 1 May 2022: Notification of acceptance
  • 25 May 2022: Camera ready submission of manuscripts

  • Fine-grained detection of hate speech on Arabic Twitter Shared Task


    For more information about Fine-grained detection of hate speech on Arabic Twitter shared task please visit this website

    Important dates:

  • 6 February 2022: Train/dev set release
  • 26-29 March 2022: Runs submission (Test set available)
  • 31 March 2022: Announcing runs results
  • 10 April 2022: Shared-task paper submission deadline
  • 1 May 2022: Notification of acceptance
  • 25 May 2022: Camera ready submission of manuscripts
  • Motivation and Topics of interest

    In the NLP, CL, and IR communities, Arabic is considered to be relatively resource-poor compared to English. This situation was thought to be the reason for the limited number of corpus-based studies in Arabic. However, the past years witnessed the emergence of new considerably free Modern Standard Arabic (MSA) corpora and to a lesser extent Arabic processing tools.

    This workshop follows the footsteps of previous editions of OSACT to provide a forum for researchers to share and discuss their ongoing work. This workshop is timely given the continued rise in research projects focusing on Arabic Language Resources.

    Language Resources:

  • Pre-trained Arabic language models and their applications.
  • Surveying and evaluating the design of available Arabic corpora, their associated and processing tools.
  • Availing new annotated corpora for NLP and IR applications such as named entity recognition, machine translation, sentiment analysis, text classification, and language learning.
  • Evaluating the use of crowdsourcing platforms for Arabic data annotation.
  • Open source Arabic processing toolkits.

  • Tools and Technologies:
  • Language education, e.g., L1 and L2.
  • Language modeling and pre-trained models.
  • Tokenization, normalization, word segmentation, morphological analysis, part-of-speech tagging, etc.
  • Sentiment analysis, dialect identification, and text classification
  • Dialect translation
  • Fake news detection
  • Web and social media search and analytics
  • Issues in the design, construction and use of Arabic LRs: text, speech, sign, gesture, image, in single or multimodal/multimedia data
  • Guidelines, standards, best practices and models for LRs interoperability
  • Methodologies and tools for LRs construction and annotation
  • Methodologies and tools for extraction and acquisition of knowledge
  • Ontologies, terminology and knowledge representation
  • LRs and Semantic Web (including Linked Data, Knowledge Graphs, etc.)

  • Issues in the design, construction and use of Arabic LRs: text, speech, sign, gesture, image, in single or multimodal/multimedia data:
  • Guidelines, standards, best practices and models for LRs interoperability
  • Methodologies and tools for LRs construction and annotation
  • Methodologies and tools for extraction and acquisition of knowledge
  • Ontologies, terminology and knowledge representation
  • LRs and Semantic Web (including Linked Data, Knowledge Graphs, etc.)
  • Important Dates

    Submission deadline: April 10, 2022
    Notification of acceptance: May 1, 2022
    Camera Ready of manuscripts: May 25, 2022
    Workshop date: June 20, 2022

    Submission guidelines

    The language of the workshop is English and submissions should be with respect to LREC 2022 paper submission instructions (https://lrec2022.lrec-conf.org/en/submission2020/authors-kit/). All papers will be peer reviewed, possibly by three independent referees. Papers must be submitted electronically in PDF format to the STAR system.

    When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research.

    Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).

    Submission link: START Page

    Committees

    Organizing Committee

    • Hend Al-Khalifa, King Saud University, KSA
    • Walid Magdy, University of Edinburgh, UK
    • Kareem Darwish, Qatar Computing Research Institute, Qatar
    • Tamer Elsayed, Qatar University, Qatar
    • Hamdy Mubarak, Qatar Computing Research Institute, Qatar
    • Abdulmohsen Al-Thubaity, KACST, KSA

    Programme Committee

    • Nizar Habash, New York University Abu Dhabi, UAE
    • Wajdi Zaghouani, Carnegie Mellon University, Qatar
    • Mahmoud El-Haj, Lancaster University, UK
    • Wassim El-Hajj, American University of Beirut, Lebanon
    • Irina Temnikova, Qatar Computing Research Institute, Qatar
    • Khaled Shaalan, The British University in Dubai, UAE
    • Fethi Bougares, Université du Maine, Avenue Laënnec, France
    • Hazem Hajj, American University of Beirut, Lebanon
    • Nadi Tomeh, LIPN University of Paris 13, Sorbonne Paris CitéParis, France
    • Samhaa R. El-Beltagy, Nile UniversitySheikh Zayed, GizaEgypt
    • Muhammad Abdul-Mageed, The university of British Columbia, Canada
    • Lamia Hadrich Belguith, University of Sfax, Tunisia
    • Reem Suwaileh, Qatar University, Qatar
    • Maram Hasanain, Qatar University, Qatar
    • Mucahid Kutlu, TOBB University, Turkey
    • Abdulrahman Almuhareb, King Abdulaziz City for Science and Technology, KSA
    • Waleed Alsanie, King Abdulaziz City for Science and Technology, KSA
    • Sakhar Alkhereyf, King Abdulaziz City for Science and Technology, KSA

    • More names to come . . .