Resources

Browse and download the lab's datasets, shared task resources, lexicons, and tools. Most corpora are released for academic use on request.

Access. Public resources link directly to their catalog entries. Others are shared on request via the resource access form.
59 of 59 resources

Led by MARSAD

Datasets, lexicons, tools, and guidelines created and released by the MARSAD Lab team. 39 resources in total.

Corpora 35 entries

Annotated Arabic text and multimodal corpora covering social media, news, dialectal materials, religious text, and specialized domains.

Lexicons 1 entries

Specialized word lists covering morphology, MSA vocabulary, dialect terms, and social media categories.

Tools 1 entries

Software, annotation systems, and analytical platforms.

  • T001 MARSAD AI Platform 2026
    Live Arabic social media observatory platform with topic modeling, sentiment, toxicity, network, and geographic analysis. QRDI-funded under the Digital Citizenship cluster.
    Arabic Public web-platformqrdisocial-media
Guidelines 2 entries

Published annotation protocols and methodological frameworks.

Collaborations

Resources led by external partners where MARSAD contributed as collaborator or co-author. 20 resources in total.

Corpora 18 entries

Annotated Arabic text and multimodal corpora covering social media, news, dialectal materials, religious text, and specialized domains.

Lexicons 1 entries

Specialized word lists covering morphology, MSA vocabulary, dialect terms, and social media categories.

Guidelines 1 entries

Published annotation protocols and methodological frameworks.