Biases in Face & Emotion Tracking

We seem to subscribe to the popular oversimplification that machines are less biased than humans; however, if you are familiar with the ways in which machines are trained to read and focus on different aspects of data, you will know: It’s just not that simple.

Machines are not free of bias if they are trained by humans.

The following is a presentation of the different types biases that can occur in face tracking and expression labeling. Many of these biases can be reduced; so I have also included suggestions for improved methods. If you are working on face & emotion tracking of any type, it is your responsibility to be aware of these biases.

View the slides below OR the video linked here: YouTube Video

Text: Many face and emotion tracking companies try to use Paul Ekman’s Facial Action Coding System (FACS)
– but many don’t take the time to use it right.

Text: What happens when you don’t use FACS right?

incorrect expression classification
inconsistent expression classification
biased labeling (racial, cultural, age-related, etc.)
anarchy

Even if you use FACS properly there will always be bias and inconsistency – but by taking careful measures, these issues can be significantly reduced.

Text: incorrect expression classification

Facial actions are subtle and difficult to differentiate without intensive study.
Most FACS references (excluding the original FACS Manual) provide incorrect FACS visuals – even sources considered credible.
Despite these inaccuracies, such sources are often used as references by face tracking engineers and researchers.
Because tech companies do not invest enough in data-based roles, they likely do not possess the right staff or resources to differentiate important facial actions.

Text: incorrect expression classification

Basic shapes like “lip tightener” regularly get confused with actions like “lip presser” and/or “lip pucker.”
Lip tightener is important in: emotion expressions & speech production

Text: incorrect expression classification

Above is a true representation of lip tightener.
This is just one of many shapes that fly under the radar each time they are:
– mistaught – misclassified – misused

Text: What happens when you don’t use FACS right?

- incorrect expression classification
- inconsistent expression classification
- biased labeling (racial, cultural, age-related, etc.)
- anarchy

Even if you use FACS properly there will always be bias and inconsistency – but by taking careful measures, these issues can be significantly reduced.

Text: incorrect expression classification

The same issues surrounding incorrect classification also contribute to inconsistent classification.

Text: inconsistent expression classification

If tech companies don’t thoroughly invest in data quality, their data classification rules cannot be standardized.

Due to:

a lack of investment in hiring and/or training employees for data-based roles
a lack of quality FACS resources
an inherent difficulty in differentiating facial actions

→ Labelers classify expressions inconsistently.
→ Trackers develop weird quirks, tying incorrect expressions to each other while confusing others.

Text: inconsistent expression classification

NOTE: This diagram was made to explain problems in shape activation for avatars, but the same basic concepts apply to face and emotion tracking.

Text: What happens when you don’t use FACS right?

- incorrect expression classification
- inconsistent expression classification
- biased labeling (racial, cultural, age-related, etc.)
- anarchy

Even if you use FACS properly there will always be bias and inconsistency – but by taking careful measures, these issues can be significantly reduced.