The rapid advancements in natural language generation have introduced powerful tools capable of shaping public opinion on social media platforms. Enhanced language modeling techniques have significantly improved the generative capabilities of deep neural networks, enabling them to produce highly realistic and contextually accurate text. This progress has led to the emergence of sophisticated text-generative models that can be exploited by adversaries to power social bots, creating convincing deepfake posts that influence public discourse. To combat this growing threat, the development of robust and accurate detection methods for identifying machine-generated content is essential. In response, this study focuses on the detection of deepfake tweets on platforms such as Twitter. A deep learning-based approach is proposed, employing a Convolutional Neural Network (CNN) architecture in combination with FastText word embeddings to classify tweets as either human-generated or botgenerated. The model is trained and evaluated using the publicly available Tweepfake dataset. To validate the effectiveness of the proposed method, it is benchmarked against several traditional machine learning models utilizing features like Term Frequency (TF), Term Frequency– Inverse Document Frequency (TF-IDF), FastText, and FastText subword embeddings. Additionally, comparisons are made with other deep learning architectures, including Long Short-Term Memory (LSTM) and hybrid CNN-LSTM models.
Keywords : Deepfake detection, social bots, text generation, CNN, FastText, Tweepfake dataset, machine-generated text, Twitter, natural language generation, fake tweets, TF-IDF, LSTM, CNNLSTM, social media manipulation, bot detection
Author : Prathima Patnaik,M.koushik Reddy, K.Bala Krishna
Title : DEEPFAKE DETECTION ON SOCIAL MEDIA LEVERAGING DEEP LEARNING AND FASTTEXT EMBEDDINGS FOR IDENTIFYING MACHINE-GENERATED TWEETS
Volume/Issue : 2025;02(10)
Page No : 4-8