A week ago, Balaji announced a challenge to create an open-source demo of a President Biden deepfake. This isn't new; we've seen such videos before. However, there hasn't been a project that combines all the open-source models and allows us to talk and interact with the deepfake. This is the initial release. Responses will stream live. To make this real-time on consumer GPUs and indistinguishable, a lot of work needs to be done, such as aligning LLMs, training text-to-speech, and optimizing the LipSync video generation model.
The same post I submitted earlier was flagged by someone for being political in nature and is no longer visible. Please do not flag it; this is not a political post. Let's discuss the implications of such a project on our society.