KG-BIAS Workshop 2020

Overview

Knowledge Graphs (KGs) store human knowledge about the world in structured format, e.g., triples of facts or graphs of entities and relations, to be processed by AI systems. In the past decade, extensive research efforts have gone into constructing and utilizing knowledge graphs for tasks in natural language processing, information retrieval, recommender systems, and more. Once constructed, knowledge graphs are often considered as “gold standard” data sources that safeguard the correctness of other systems. Because the biases inherent to KGs may become magnified and spread through such systems, it is crucial that we acknowledge and address various types of bias in knowledge graph construction.

Biases may originate in the very design of the KG, in the source data from which it is created (semi-)automatically, and in the algorithms used to sample, aggregate, and process that data. Causes of bias include systematic errors due to selecting non-random items (selection bias), misremembering certain events (recall bias), and interpreting facts in a way that affirms individuals’ preconceptions (confirmation bias). Biases typically appear subliminally in expressions, utterances, and text in general and can carry over into downstream representations such as embeddings and knowledge graphs.

This workshop – to be held for the first time at AKBC 2020 – addresses the questions: “how do such biases originate?”, “How do we identify them?”, and “What is the appropriate way to handle them, if at all?”. This topic is as-yet unexplored and the goal of our workshop is to start a meaningful, long-lasting dialogue spanning researchers across a wide variety of backgrounds and communities.

KG-BIAS topics of interest include, but are not limited to the following.

Ethics, bias, and fairness
Qualitatively and quantitatively defining types of bias
- Implicit or explicit human bias reflected in data people generate
- Algorithmic bias represented in learned models or rules
- Taxonomies and categorizations of different biases
Empirically observing biases
- Measuring diversity of opinions
- Language, gender, geography, or interest bias
- Implications of existing bias to human end-users
- Benchmarks and datasets for bias in KGs
Measuring or remediating bias
- De-biased KG completion methods
- Algorithms for making inferences interpretable and explainable
- De-biasing or post-processing algorithms
- Creating user awareness on cognitive biases
- Ethics of data collection for bias management
- Diversification of information sources
- Provenance and traceability

Program

KG-BIAS will be held virtually on June 25, 2020. All times are in Pacific Time.

8.00-8.15: Welcome
8.15-9.00: Keynote + QA
- Jahna Otterbacher. Bias in Data and Algorithmic Systems: A "Fish-Eye View" of Problems, Solutions and Stakeholders.
  Mitigating bias in algorithmic processes and systems is a critical issue drawing increasing attention across research communities within the information and computer sciences. Given the complexity of the problem and the involvement of multiple stakeholders – not only developers, but also end-users and third parties – there is a need to understand the landscape of the sources of bias, as well as the solutions being proposed to address them. In this talk, I present insights from a recent survey of 250+ articles across four domains (machine learning, information retrieval, HCI, and RecSys), providing a “fish-eye view” of the field. I will also discuss the particular challenges of data biases, given the heterogeneity of data sources available for learning about our world, drawing on examples of my previous work on image data.
9.00-9.30: Invited paper + QA
- Joseph Fisher, Dave Palfrey, Arpit Mittal and Christos Christodoulopoulos. Measuring social bias in knowledge graph embeddings.
9.30-9.45: Break
9.45-10.45: Paper presentations + QA
- Christine Wolf. From Knowledge Graphs to Knowledge Practices: On the Need for Transparency and Explainability in Enterprise Knowledge Graph Applications.
- Shubhanshu Mishra, Sijun He and Luca Belli. Assessing Demographic Bias in Named Entity Recognition.
- Emma Gerritse, Faegheh Hasibi and Arjen P. de Vries. Bias in Conversational Search: The Double-Edged Sword of the Personalized Knowledge Graph.
10.45-11: Break
11.00-11.45: Keynote + QA
11.45-12.00: Break
12.00-13.00: Plenary discussion and wrap-up