Natural Language Processing Faculty Publications

Target-Based Offensive Language Identification

Marcos Zampieri, George Mason University
Skye Morgan, Rochester Institute of Technology
Kai North, George Mason University
Tharindu Ranasinghe, Aston University
Austin Simmons, Rochester Institute of Technology
Paridhi Khandelwal, Rochester Institute of Technology
Sara Rosenthal, IBM Research
Preslav Nakov, Mohamed Bin Zayed University of Artificial IntelligenceFollow

Document Type

Conference Proceeding

Publication Title

Proceedings of the Annual Meeting of the Association for Computational Linguistics

Abstract

We present TBO, a new dataset for Target-based Offensive language identification. TBO contains post-level annotations regarding the harmfulness of an offensive post and token-level annotations comprising of the target and the offensive argument expression. Popular offensive language identification datasets for social media focus on annotation taxonomies only at the post level and more recently, some datasets have been released that feature only token-level annotations. TBO is an important resource that bridges the gap between post-level and token-level annotation datasets by introducing a single comprehensive unified annotation taxonomy. We use the TBO taxonomy to annotate post-level and token-level offensive language on English Twitter posts. We release an initial dataset of over 4,500 instances collected from Twitter and we carry out multiple experiments to compare the performance of different models trained and tested on TBO.

First Page

762

Last Page

770

DOI

10.18653/v1/2023.acl-short.66

Publication Date

7-2023

Comments

Open Access, archived thanks to ACL Anthology

Uploaded: Feb 05, 2024

Recommended Citation

M. Zampieri et al., “Target-based offensive language identification,” Proceedings of the 61st Annual Meeting of the Assoc. for Computational Linguistics (Vol. 2: Short Papers), July 2023. doi:10.18653/v1/2023.acl-short.66

Additional Links

Publisher link: https://aclanthology.org/2023.acl-short.66/

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Natural Language Processing Faculty Publications

Target-Based Offensive Language Identification

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Additional Links

Included in

Browse

Contribute

Links

Natural Language Processing Faculty Publications

Target-Based Offensive Language Identification

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Comments

Recommended Citation

Additional Links

Included in

Share

Browse

Contribute

Links