Platform-Invariant Topic Modeling via Contrastive Learning to Mitigate Platform-Induced Bias

Koo, Minseo; Kim, Doeun; Han, Sungwon; Park, Sungkyu

doi:v1/2024.findings-emnlp.650

Platform-Invariant Topic Modeling via Contrastive Learning to Mitigate Platform-Induced Bias

Koo, Minseo / Kim, Doeun / Han, Sungwon / Park, Sungkyu

DC Field	Value	Language
dc.contributor.author	Koo, Minseo	-
dc.contributor.author	Kim, Doeun	-
dc.contributor.author	Han, Sungwon	-
dc.contributor.author	Park, Sungkyu	-
dc.date.available	2025-03-04T07:13:39Z	-
dc.date.created	2025-02-21	-
dc.date.issued	2024-11-14	-
dc.identifier.uri	https://archives.kdischool.ac.kr/handle/11125/59036	-
dc.description.abstract	Cross-platform topic dissemination is one of the research subjects that delved into media analysis; sometimes it fails to grasp the authentic topics due to platform-induced biases, which may be caused by aggregating documents from multiple platforms and running them on an existing topic model. This work deals with the impact of unique platform characteristics on the performance of topic models and proposes a new approach to enhance the effectiveness of topic modeling. The data utilized in this study consisted of a total of 1.5 million posts collected using the keyword ”ChatGPT” on the three social media platforms. The devised model reduces platform influence in topic models by developing a platform-invariant contrastive learning algorithm and removing platform-specific jargon word sets. The proposed approach was thoroughly validated through quantitative and qualitative experiments alongside standard and state-of-the-art topic models and showed its supremacy. This method can mitigate biases arising from platform influences when modeling topics from texts collected across various platforms.	-
dc.language	English	-
dc.publisher	Association for Computational Linguistics (ACL)	-
dc.relation.isPartOf	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024	-
dc.title	Platform-Invariant Topic Modeling via Contrastive Learning to Mitigate Platform-Induced Bias	-
dc.type	Conference	-
dc.identifier.bibliographicCitation	The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), pp. 11123-11139	-
dc.description.journalClass	1	-
dc.citation.conferenceDate	2024-11-12	-
dc.citation.conferencePlace	US	-
dc.citation.conferencePlace	Hyatt Regency Miami Hotel	-
dc.citation.endPage	11139	-
dc.citation.startPage	11123	-
dc.citation.title	The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)	-
dc.contributor.affiliatedAuthor	Park, Sungkyu	-
dc.identifier.doi	https://doi.org/10.18653/v1/2024.findings-emnlp.650	-
dc.identifier.url	https://aclanthology.org/2024.findings-emnlp.650/	-