Deepfake detection improves when using algorithms that are more aware of demographic diversity

Deepfake detection software may unfairly target people from some groups. JLco – Ana Suanes/iStock via Getty Images

Deepfakes – essentially putting words in someone else’s mouth in a very believable way – are becoming more sophisticated by the day and increasingly hard to spot. Recent examples of deepfakes include Taylor Swift nude images, an audio recording of President Joe Biden telling New Hampshire residents not to vote, and a video of Ukrainian President Volodymyr Zelenskyy calling on his troops to lay down their arms.

Although companies have created detectors to help spot deepfakes, studies have found that biases in the data used to train these tools can lead to certain demographic groups being unfairly targeted.

A deepfake of Ukraine President Volodymyr Zelensky in 2022 purported to show him calling on his troops to lay down their arms.
Olivier Douliery/AFP via Getty Images

My team and I discovered new methods that improve both the fairness and the accuracy of the algorithms used to detect deepfakes.

To do so, we used a large dataset of facial forgeries that lets researchers like us train our deep-learning approaches. We built our work around the state-of-the-art Xception detection algorithm, which is a widely used foundation for deepfake detection systems and can detect deepfakes with an accuracy of 91.5%.

We created two separate deepfake detection methods intended to encourage fairness.

One was focused on making the algorithm more aware of demographic diversity by labeling datasets by gender and race to minimize errors among underrepresented groups.

The other aimed to improve fairness without relying on demographic labels by focusing instead on features not visible to the human eye.

It turns out the first method worked best. It increased accuracy rates from the 91.5% baseline to 94.17%, which was a bigger increase than our second method as well as several others we tested. Moreover, it increased accuracy while enhancing fairness, which was our main focus.

We believe fairness and accuracy are crucial if the public is to accept artificial intelligence technology. When large language models like ChatGPT “hallucinate,” they can perpetuate erroneous information. This affects public trust and safety.

Likewise, deepfake images and videos can undermine the adoption of AI if they cannot be quickly and accurately detected. Improving the fairness of these detection algorithms so that certain demographic groups aren’t disproportionately harmed by them is a key aspect to this.

Our research addresses deepfake detection algorithms’ fairness, rather than just attempting to balance the data. It offers a new approach to algorithm design that considers demographic fairness as a core aspect.

Siwei Lyu receives funding from the National Science Foundation and DARPA.

Yan Ju receives funding from US Defense Advanced Research Projects Agency (DARPA) Semantic Forensic (SemaFor) program, under Contract No. HR001120C0123.

Deepfake detection improves when using algorithms that are more aware of demographic diversity

AI is set to transform science – but will we understand the results?

AI is just one of the thorny issues facing photography – here’s how the industry can prioritise ethics

Tracking vampire worms with machine learning − using AI to diagnose schistosomiasis before the parasites causing it hatch in your blood

Gender balance in computer science and engineering is improving at elite universities but getting worse elsewhere

Flying Lotus’ ‘Ash’ looks like a terrifying, sci-fi head trip

How to beat Zekvir's Lair in World of Warcraft: The War Within

Apple Intelligence will help AI become as commonplace as word processing

Review: Nikoderiko: The Magical World (Switch) – A Gleeful DKC Tribute, But Light On New Ideas

Related posts:

More Stories

You may have missed