Search Results - "Zampieri, Marcos"
-
1
An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India
Published in Information (Basel) (01-08-2021)“…The pervasiveness of offensive content in social media has become an important reason for concern for online platforms. With the aim of improving online…”
Get full text
Journal Article -
2
Features of lexical complexity: insights from L1 and L2 speakers
Published in Frontiers in artificial intelligence (30-11-2023)“…We discover sizable differences between the lexical complexity assignments of first language (L1) and second language (L2) English speakers. The complexity…”
Get full text
Journal Article -
3
Lexical simplification benchmarks for English, Portuguese, and Spanish
Published in Frontiers in artificial intelligence (22-09-2022)“…Even in highly-developed countries, as many as 15–30% of the population can only understand texts written using a basic vocabulary. Their understanding of…”
Get full text
Journal Article -
4
The Role of Machine Translation Quality Estimation in the Post-Editing Workflow
Published in Informatics (Basel) (01-09-2021)“…As Machine Translation (MT) becomes increasingly ubiquitous, so does its use in professional translation workflows. However, its proliferation in the…”
Get full text
Journal Article -
5
Challenges in discriminating profanity from hate speech
Published in Journal of experimental & theoretical artificial intelligence (04-03-2018)“…In this study, we approach the problem of distinguishing general profanity from hate speech in social media, something which has not been widely considered…”
Get full text
Journal Article -
6
Automatic Language Identification in Texts: A Survey
Published in The Journal of artificial intelligence research (01-01-2019)“…Language identification (“LI”) is the problem of determining the natural language that a document or part thereof is written in. Automatic LI has been…”
Get full text
Journal Article -
7
Predicting lexical complexity in English texts: the Complex 2.0 dataset
Published in Language resources and evaluation (01-12-2022)“…Identifying words which may cause difficulty for a reader is an essential step in most lexical text simplification systems prior to lexical substitution and…”
Get full text
Journal Article -
8
Deep learning approaches to lexical simplification: A survey
Published in Journal of intelligent information systems (02-09-2024)“…Abstract Lexical Simplification (LS) is the task of substituting complex words within a sentence for simpler alternatives while maintaining the sentence’s…”
Get full text
Journal Article -
9
Offensive language identification with multi-task learning
Published in Journal of intelligent information systems (01-06-2023)“…The widespread presence of offensive content is a major issue in social media. This has motivated the development of computational models to identify such…”
Get full text
Journal Article -
10
Predicting the type and target of offensive social media posts in Marathi
Published in Social network analysis and mining (01-12-2022)“…The presence of offensive language on social media is very common motivating platforms to invest in strategies to make communities safer. This includes…”
Get full text
Journal Article -
11
Health text simplification: An annotated corpus for digestive cancer education and novel strategies for reinforcement learning
Published in Journal of biomedical informatics (01-10-2024)“…The reading level of health educational materials significantly influences the understandability and accessibility of the information, particularly for…”
Get full text
Journal Article -
12
SOLD: Sinhala offensive language dataset
Published in Language resources and evaluation (06-03-2024)“…Abstract The widespread of offensive content online, such as hate speech and cyber-bullying, is a global phenomenon. This has sparked interest in the…”
Get full text
Journal Article -
13
An Ensemble Approach for Annotating Source Code Identifiers With Part-of-Speech Tags
Published in IEEE transactions on software engineering (01-09-2022)“…This paper presents an ensemble part-of-speech tagging approach for source code identifiers. Ensemble tagging is a technique that uses machine-learning and the…”
Get full text
Journal Article -
14
A Text-to-Text Model for Multilingual Offensive Language Identification
Published 06-12-2023“…The ubiquity of offensive content on social media is a growing cause for concern among companies and government organizations. Recently, transformer-based…”
Get full text
Journal Article -
15
Toward more effective and equitable learning: Identifying barriers and solutions for the future of online education
Published in Technology, mind, and behavior (31-03-2022)Get full text
Journal Article -
16
Improving translation memory matching and retrieval using paraphrases
Published in Machine translation (01-06-2016)“…Most current translation memory (TM) systems work on the string level (character or word level) and lack semantic knowledge while matching. They use simple…”
Get full text
Journal Article -
17
mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
Published 19-10-2024“…Recent advancements in large language models (LLMs) have significantly enhanced code generation from natural language prompts. The HumanEval Benchmark,…”
Get full text
Journal Article -
18
Deep Contrastive Active Learning for Out-of-domain Filtering in Dialog Systems
Published in 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA) (06-10-2024)“…Task-oriented dialog systems have shown to foster effective human-chatbot collaborations for accomplishing goal-specific tasks through intent classification…”
Get full text
Conference Proceeding -
19
Multilingual Offensive Language Identification for Low-resource Languages
Published 12-05-2021“…Offensive content is pervasive in social media and a reason for concern to companies and government organizations. Several studies have been recently published…”
Get full text
Journal Article -
20
A Federated Learning Approach to Privacy Preserving Offensive Language Identification
Published 17-04-2024“…The spread of various forms of offensive speech online is an important concern in social media. While platforms have been investing heavily in ways of coping…”
Get full text
Journal Article