Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Reflect, Reason, Rephrase (R³-Detox): An In-Context Learning Approach to Text Detoxification

  • University of Deusto

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Descargas (Pure)

Resumen

Traditional content moderation, while effective in reducing toxicity through content removal or censoring, can discourage user participation by making them feel restricted or unfairly targeted, especially in nuanced discussions. Text detoxification offers a more constructive alternative by rephrasing offensive language into respectful forms. We propose R3-Detox, a Reflect-Reason-Rephrase framework that structures detoxification into three steps within a single prompt. The model identifies potentially toxic elements guided by Shapley values to reduce fabricated predictions, evaluates overall toxicity, and then revises the text to eliminate toxicity while retaining meaning. We augment three offensive text paraphrasing datasets (ParaDetox, Parallel Detoxification, APPDIA) with explicit detoxification reasoning. Evaluated with in-context learning, R3-Detox outperforms state-of-the-art methods, including instruction following models.

Idioma originalInglés
Título de la publicación alojadaBDCAT 2025 - IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, Co Located Conference UCC 2025
EditorialAssociation for Computing Machinery, Inc
ISBN (versión digital)9798400722868
DOI
EstadoPublicada - 24 dic 2025
Evento12th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2025 - Nantes, Francia
Duración: 1 dic 20254 dic 2025

Serie de la publicación

NombreBDCAT 2025 - IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, Co Located Conference UCC 2025

Conferencia

Conferencia12th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2025
País/TerritorioFrancia
CiudadNantes
Período1/12/254/12/25

Huella

Profundice en los temas de investigación de 'Reflect, Reason, Rephrase (R³-Detox): An In-Context Learning Approach to Text Detoxification'. En conjunto forman una huella única.

Citar esto