Back to Conversation AI Research Overview
Research resources
This page outlines the high level questions we are exploring. Further related research can be found on the WikiDetox outline of related work.
We also have a page that provides a brief introduction to ML to help make this research easier to understand.
Public datasets
One of our contributions is to produce datasets to support research:
Open Source Code
We also have various public github projects to support research into having better converstions online:
We also have some smaller projects that provide specific technical demonstration code for working with the Perspective API:
Lots more hacks built using our API can be found at the Perspective Hacks Gallery site, including:
- Toxicity Timeline: See exactly when negative conversations happen and discover the patterns behind them.
- Hot Topics: Compare unpublished articles with others that have created debate - before you push them live.
- Comment Blur Filter: Easily find and hide comments based on your tolerance for toxicity.
Research Contributions
- Toxicity Detection: Does Context Really Matter? studies the importance of context, and when as well as to what extent it can change the perceived toxicity of posts, finding that context can both amplify or mitigate the perceived toxicity of posts, and moreover for a subset of the data that including context can even reverse the toxicity label.
- Classifying Constructive Comments (preprint) introduces a new dataset of constructive comments, defines a taxonomy of characteristics of constructiveness, and provides models for constructiveness trained on this dataset.
- Debiasing Embeddings for Reduced Gender Bias in Text Classification demonstrates how traditional techniques for debiasing word embeddings can actually increase model bias on downstream tasks and proposes novel debiasing methods to ameliorate the issue.
- Model Cards for Model Reporting proposes a framework to encourage transparent reporting of the context, use-cases, and performance characteristics of machine learning models across domains.
- Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification introduces a suite of threshold-agnostic metrics that provide a nuanced view of unintended bias in text classification, by exploring the various ways that a classifier’s score distribution can vary across designated groups.
- Crowdsourcing Subjective Tasks: The Case Study of Understanding Toxicity in Online Discussions discusses open questions and research challenges toward the goal of effective crowdsourcing of online toxicity as well as presenting a survey of recent work that addresses these.
- WikiDetox Visualization presents a novel data visualization and moderation tool for Wikipedia that is built on top of the Perspective API.
- Conversations Gone Awry: Detecting Early Signs of Conversational Failure introduces the task of predicting whether a given conversation is on the verge of being derailed by the antisocial actions of one of its participants and demonstrates that a simple model using conversational and linguistic features can achieve performance close to that of humans for this task.
- Measuring and Mitigating Unintended Bias in Text Classification develops methods for measuring the unintended bias in a text classifier according to terms that appear in the text, as well as approaches to help mitigate them. The limitations of these methods are expanded on in the follow up paper Limitations of Pinned AUC for Measuring Unintended Bias.
- Correlating Self-Report and Trace Data Measures of Incivility: A Proof of Concept connects trace data and machine learning classifiers to self-reported survey information about user’s online behaviour demonstrating the correlation between the two.
- WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community presents an unprecedented view of the complete history of conversations between contributors of English Wikipedia by recording the intermediate states of conversations—including not only comments and replies, but also their modifications, deletions and restorations.
- Ex machina: Personal attacks seen at scale outlines how crowdsourcing and machine learning can be used to scale our understanding of online personal attacks and applies these methods to the challenge on Wikipedia.
- Network Traffic Obfuscation and Automated Internet Censorship surveys approaches that use machine learning to obfuscate network traffic to circumvent censorship.