How Deepfakes Are Made

Gabe Regan

VP of Human Engagement

Disclaimer

This article is for educational purposes only. Understanding how deepfakes are made helps organizations, security professionals, and individuals recognize and defend against synthetic media threats. Knowledge of the deepfake creation process encourages better detection and protection strategies against malicious actors who exploit this technology for fraud and deception.

Simple Explanation

Deepfake technology relies on artificial intelligence systems that learn to create convincing fake media by analyzing vast amounts of data. At its core, how deepfakes are made involves training AI models on hundreds or thousands of images, videos or audio samples of an individual.

The AI system studies patterns in facial expressions, voice characteristics, speech patterns and visual features. Through machine learning algorithms, particularly GAN technology (Generative Adversarial Networks), it learns to generate new content that mimics the original person's appearance or voice. The content can be so convincing that most people can’t differentiate between it and the real thing.
Data requirements are substantial. Compelling deepfakes typically require feeding hundreds or thousands of data points into a deep learning network, training it to reconstruct visual, audio and textual patterns. But some tools now claim to produce basic deepfakes with just a few minutes of audio or a handful of photos.

The Deepfake Creation Process

Step 1 (Data Collection): Gathering source material, including photos, videos and audio recordings of the target person, often scraped from social media or public appearances.
Step 2 (Data Preprocessing): Cleaning and organizing the collected material, extracting faces from videos, isolating voice samples, and preparing data for training.
Step 3 (Model Training): Using GAN technology or similar AI frameworks to train neural networks on the prepared data.
Step 4: (Generation): Running the trained model to create new synthetic content, fine-tuning parameters to improve realism and believability.
Step 5 (Post-Processing): Refining the output through editing software to eliminate obvious artifacts and enhance overall quality.

Time required varies based on the desired quality and available computing power. Rudimentary deepfakes can be created in under 30 seconds using off-the-shelf tools, while high-quality results may require days to weeks of processing time on more powerful systems.

Tools Used

Deepfake software ranges from user-friendly consumer applications to sophisticated development frameworks, including mobile apps that create basic face swaps and professional-grade platforms that require technical expertise.

GAN technology powers most deepfake creation, using two competing neural networks: one generates fake content while another tries to detect it, iteratively improving until the output becomes highly convincing.

Accessibility to deepfake tools has dramatically increased, with cloud-based services and simplified interfaces bringing creative capabilities to users without technical backgrounds. “Generative artificial intelligence tools make it easy for even low-skill threat actors to create deepfakes,” according to the FS-ISAC Artificial Intelligence Risk Working Group.

Why Detection Works

Deepfakes contain inherent flaws that detection systems can identify, including visual inconsistencies, audio issues and movement irregularities. Reality Defender uses inference-based methods to detect signs of manipulation or synthetic generation. Our advantage is using proprietary models that analyze video, audio or image files from thousands of approaches — all in milliseconds.

See Reality Defender's deepfake detection in action. Get in touch with our team for a demo.

Insights